Browsing Theses by Author "Song, Haobei"
Now showing items 1-1 of 1
-
Optimal Learning Theory and Approximate Optimal Learning Algorithms
Song, Haobei (University of Waterloo, 2019-09-12)The exploration/exploitation dilemma is a fundamental but often computationally intractable problem in reinforcement learning. The dilemma also impacts data efficiency which can be pivotal when the interactions between the ...