The Bounds of Algorithmic Collusion: Q-learning, Gradient Learning, and the Folk Theorem

dc.contributor.authorAskenazi-Golan, Galit
dc.contributor.authorMergoni Cecchelli, Domenico
dc.contributor.authorPlumb, Edward
dc.contributor.authorPossnig, Clemens
dc.date.accessioned2026-06-10T16:18:20Z
dc.date.available2026-06-10T16:18:20Z
dc.date.issued2026-03-03
dc.description.abstractWe explore the behaviour emerging from learning agents repeatedly interacting strategically for a wide range of learning dynamics, including Q-learning, projected gradient, replicator and log-barrier dynamics. Going beyond the better understood classes of potential games and zero-sum games, we consider the setting of a general repeated game with finite recall under different forms of monitoring. We obtain a Folk Theorem-style result and characterise the set of payoff vectors that can be obtained by these dynamics, discovering a wide range of possibilities for the emergence of algorithmic collusion. Achieving this requires a novel technical approach, which, to the best of our knowledge, yields the first convergence result for multi-agent Q-learning algorithms in repeated games.
dc.identifier.urihttps://hdl.handle.net/10012/23583
dc.language.isoen
dc.publisherLondon School of Economics, University of Waterloo
dc.titleThe Bounds of Algorithmic Collusion: Q-learning, Gradient Learning, and the Folk Theorem
dc.typePreprint
uws.contributor.affiliation1Faculty of Arts
uws.contributor.affiliation2Economics
uws.peerReviewStatusUnreviewed
uws.scholarLevelFaculty
uws.typeOfResourceTexten

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
AlgorithmicCollusion_WorkingPaper.pdf
Size:
551.74 KB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
4.47 KB
Format:
Item-specific license agreed upon to submission
Description: