Asking for Help with a Cost in Reinforcement Learning

Vandenhof, Colin

Asking for Help with a Cost in Reinforcement Learning

dc.contributor.advisor	Law, Edith
dc.contributor.author	Vandenhof, Colin
dc.date.accessioned	2020-05-15T19:43:16Z
dc.date.available	2020-05-15T19:43:16Z
dc.date.issued	2020-05-15
dc.date.submitted	2020-04-16
dc.description.abstract	Reinforcement learning (RL) is a powerful tool for developing intelligent agents, and the use of neural networks makes RL techniques more scalable to challenging real-world applications, from task-oriented dialogue systems to autonomous driving. However, one of the major bottlenecks to the adoption of RL is efficiency, as it often takes many time steps to learn an acceptable policy. To address this problem, we investigate the idea of allowing the agent to ask for advice from a teacher. We formalize this concept in a framework called ask-for-help RL, which entails augmenting a Markov decision process with a teacher-query action that can be taken at a fixed cost in any state. In this task, the agent faces a dilemma between exploration, exploitation, and teacher-querying. To make this trade-off, we propose an action selection strategy that is rooted in the classical notion of value-of-information, and suggest a practical implementation that is based on deep Q-learning. This algorithm, called VOE/Q, can jointly decide between taking a particular environment action or querying the teacher, and is sensitive to the query cost. We then perform experiments in two domains: a maze navigation task and the Atari game Freeway. When the teacher is excluded, the algorithm shows substantial gains over many other exploration strategies from the literature. With the teacher included, we again find that the algorithm outperforms baselines. By taking advantage of the teacher, higher cumulative reward can be achieved than with standard RL alone. Together, our results point to a promising approach to both RL and ask-for-help RL.	en
dc.identifier.uri	http://hdl.handle.net/10012/15872
dc.language.iso	en	en
dc.pending	false
dc.publisher	University of Waterloo	en
dc.subject	reinforcement learning	en
dc.subject	apprenticeship learning	en
dc.subject	imitation learning	en
dc.subject	learning from demonstration	en
dc.subject	human-in-the-loop	en
dc.subject	interactive reinforcement learning	en
dc.subject	deep reinforcement learning	en
dc.subject	active learning	en
dc.subject.lcsh	Reinforcement learning	en
dc.subject.lcsh	Active learning	en
dc.title	Asking for Help with a Cost in Reinforcement Learning	en
dc.type	Master Thesis	en
uws-etd.degree	Master of Mathematics	en
uws-etd.degree.department	David R. Cheriton School of Computer Science	en
uws-etd.degree.discipline	Computer Science	en
uws-etd.degree.grantor	University of Waterloo	en
uws.contributor.advisor	Law, Edith
uws.contributor.affiliation1	Faculty of Mathematics	en
uws.peerReviewStatus	Unreviewed	en
uws.published.city	Waterloo	en
uws.published.country	Canada	en
uws.published.province	Ontario	en
uws.scholarLevel	Graduate	en
uws.typeOfResource	Text	en

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Vandenhof_Colin.pdf
Size:: 1.17 MB
Format:: Adobe Portable Document Format
Description:

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 6.4 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Theses
Computer Science