Learning-Free Methods for Goal Conditioned Reinforcement Learning from Images

Van de Kleut, Alexander

Learning-Free Methods for Goal Conditioned Reinforcement Learning from Images

dc.contributor.advisor	Orchard, Jeff
dc.contributor.author	Van de Kleut, Alexander
dc.date.accessioned	2021-04-27T14:13:41Z
dc.date.available	2021-04-27T14:13:41Z
dc.date.issued	2021-04-27
dc.date.submitted	2021-04-16
dc.description.abstract	We are interested in training goal-conditioned reinforcement learning agents to reach arbitrary goals specified as images. In order to make our agent fully general, we provide the agent with only images of the environment and the goal image. Prior methods in goal-conditioned reinforcement learning from images use a learned lower-dimensional representation of images. These learned latent representations are not necessary to solve a variety of goal-conditioned tasks from images. We show that a goal-conditioned reinforcement learning policy can be successfully trained end-to-end from pixels by using simple reward functions. In contrast to prior work, we demonstrate that using negative raw pixel distance as a reward function is a strong baseline. We also show that using the negative Euclidian distance between feature vectors produced by a random convolutional neural network outperforms learned latent representations like convolutional variational autoencoders.	en
dc.identifier.uri	http://hdl.handle.net/10012/16908
dc.language.iso	en	en
dc.pending	false
dc.publisher	University of Waterloo	en
dc.subject	reinforcement learning	en
dc.subject	deep reinforcement learning	en
dc.subject	machine learning	en
dc.subject	ai	en
dc.subject	artificial intelligence	en
dc.subject	machine vision	en
dc.subject	computer vision	en
dc.subject	self-supervised	en
dc.subject	goal-conditioned	en
dc.subject	multi-goal	en
dc.subject	rl	en
dc.title	Learning-Free Methods for Goal Conditioned Reinforcement Learning from Images	en
dc.type	Master Thesis	en
uws-etd.degree	Master of Mathematics	en
uws-etd.degree.department	David R. Cheriton School of Computer Science	en
uws-etd.degree.discipline	Computer Science	en
uws-etd.degree.grantor	University of Waterloo	en
uws-etd.embargo.terms	0	en
uws.contributor.advisor	Orchard, Jeff
uws.contributor.affiliation1	Faculty of Mathematics	en
uws.peerReviewStatus	Unreviewed	en
uws.published.city	Waterloo	en
uws.published.country	Canada	en
uws.published.province	Ontario	en
uws.scholarLevel	Graduate	en
uws.typeOfResource	Text	en

Files

Original bundle

Now showing 1 - 1 of 1

Name:: VanDeKleut_Alexander.pdf
Size:: 3.19 MB
Format:: Adobe Portable Document Format
Description:: main article

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 6.4 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Theses
Computer Science