Browsing University of Waterloo by Author "Gupta, Gaurav"

Now showing items 1-1 of 1

Obedience-based Multi-Agent Cooperation for Sequential Social Dilemmas

Gupta, Gaurav (University of Waterloo, 2020-05-14)

We propose a mechanism for achieving cooperation and communication in Multi-Agent Reinforcement Learning (MARL) settings by intrinsically rewarding agents for obeying the commands of other agents. At every timestep, agents ...