Now showing items 1-1 of 1

    • Linearizing Contextual Multi-Armed Bandit Problems with Latent Dynamics 

      Nelson, Elliot (University of Waterloo, 2022-02-10)
      In many real-world applications of multi-armed bandit problems, both rewards and observed contexts are often influenced by confounding latent variables which evolve stochastically over time. While the observed contexts and ...

      UWSpace

      University of Waterloo Library
      200 University Avenue West
      Waterloo, Ontario, Canada N2L 3G1
      519 888 4883

      All items in UWSpace are protected by copyright, with all rights reserved.

      DSpace software

      Service outages