UWSpace is currently experiencing technical difficulties resulting from its recent migration to a new version of its software. These technical issues are not affecting the submission and browse features of the site. UWaterloo community members may continue submitting items to UWSpace. We apologize for the inconvenience, and are actively working to resolve these technical issues.
 

Learning-Free Methods for Goal Conditioned Reinforcement Learning from Images

dc.contributor.authorVan de Kleut, Alexander
dc.date.accessioned2021-04-27T14:13:41Z
dc.date.available2021-04-27T14:13:41Z
dc.date.issued2021-04-27
dc.date.submitted2021-04-16
dc.description.abstractWe are interested in training goal-conditioned reinforcement learning agents to reach arbitrary goals specified as images. In order to make our agent fully general, we provide the agent with only images of the environment and the goal image. Prior methods in goal-conditioned reinforcement learning from images use a learned lower-dimensional representation of images. These learned latent representations are not necessary to solve a variety of goal-conditioned tasks from images. We show that a goal-conditioned reinforcement learning policy can be successfully trained end-to-end from pixels by using simple reward functions. In contrast to prior work, we demonstrate that using negative raw pixel distance as a reward function is a strong baseline. We also show that using the negative Euclidian distance between feature vectors produced by a random convolutional neural network outperforms learned latent representations like convolutional variational autoencoders.en
dc.identifier.urihttp://hdl.handle.net/10012/16908
dc.language.isoenen
dc.pendingfalse
dc.publisherUniversity of Waterlooen
dc.subjectreinforcement learningen
dc.subjectdeep reinforcement learningen
dc.subjectmachine learningen
dc.subjectaien
dc.subjectartificial intelligenceen
dc.subjectmachine visionen
dc.subjectcomputer visionen
dc.subjectself-superviseden
dc.subjectgoal-conditioneden
dc.subjectmulti-goalen
dc.subjectrlen
dc.titleLearning-Free Methods for Goal Conditioned Reinforcement Learning from Imagesen
dc.typeMaster Thesisen
uws-etd.degreeMaster of Mathematicsen
uws-etd.degree.departmentDavid R. Cheriton School of Computer Scienceen
uws-etd.degree.disciplineComputer Scienceen
uws-etd.degree.grantorUniversity of Waterlooen
uws-etd.embargo.terms0en
uws.contributor.advisorOrchard, Jeff
uws.contributor.affiliation1Faculty of Mathematicsen
uws.peerReviewStatusUnrevieweden
uws.published.cityWaterlooen
uws.published.countryCanadaen
uws.published.provinceOntarioen
uws.scholarLevelGraduateen
uws.typeOfResourceTexten

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
VanDeKleut_Alexander.pdf
Size:
3.19 MB
Format:
Adobe Portable Document Format
Description:
main article
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
6.4 KB
Format:
Item-specific license agreed upon to submission
Description: