The Algorithmic Advantage: How Reinforcement Learning Generates Rich Communication
| dc.contributor.author | Calvano, Emilio | |
| dc.contributor.author | Possnig, Clemens | |
| dc.contributor.author | Tolvanen, Juha | |
| dc.date.accessioned | 2026-06-10T16:12:36Z | |
| dc.date.available | 2026-06-10T16:12:36Z | |
| dc.date.issued | 2026-02-12 | |
| dc.description.abstract | We analyze strategic communication when advice is generated by a reinforcement-learning algorithm rather than by a fully rational sender. Building on the cheap-talk framework of Crawford and Sobel (1982), an advisor adapts its messages based on payoff feedback, while a decision maker best-responds. We provide a theoretical analysis of the long-run communication outcomes induced by such reward-driven adaptation. With aligned preferences, we establish that learning robustly leads to informative communication even from uninformative initial policies. With misaligned preferences, no stable outcome exists; instead, learning generates cycles that sustain highly informative communication and payoffs exceeding those of any static equilibrium. | |
| dc.identifier.uri | https://hdl.handle.net/10012/23582 | |
| dc.language.iso | en | |
| dc.publisher | Luiss University, University of Waterloo, University of Rome Tor Vergata | |
| dc.title | The Algorithmic Advantage: How Reinforcement Learning Generates Rich Communication | |
| dc.type | Preprint | |
| uws.contributor.affiliation1 | Faculty of Arts | |
| uws.contributor.affiliation2 | Economics | |
| uws.peerReviewStatus | Unreviewed | |
| uws.scholarLevel | Faculty | |
| uws.typeOfResource | Text | en |