Now showing items 1-1 of 1

    • Obedience-based Multi-Agent Cooperation for Sequential Social Dilemmas 

      Gupta, Gaurav (University of Waterloo, 2020-05-14)
      We propose a mechanism for achieving cooperation and communication in Multi-Agent Reinforcement Learning (MARL) settings by intrinsically rewarding agents for obeying the commands of other agents. At every timestep, agents ...


      University of Waterloo Library
      200 University Avenue West
      Waterloo, Ontario, Canada N2L 3G1
      519 888 4883

      All items in UWSpace are protected by copyright, with all rights reserved.

      DSpace software

      Service outages