Learning to signal in a dynamic world
Sender–receiver games, first introduced by David Lewis ( [1969] ), have received increased attention in recent years as a formal model for the emergence of communication. Skyrms ( [2010] ) showed that simple models of reinforcement learning often succeed in forming efficient, albeit not necessarily minimal, signalling systems for a large family of games. Later, Alexander et al. ( [2012] ) showed that reinforcement learning, combined with forgetting, frequently produced both efficient and minimal signalling systems. In this article, I define a ‘dynamic’ sender–receiver game in which the state–action pairs are not held constant over time and show that neither of these two models of learning learn to signal in this environment. However, a model of reinforcement learning with discounting of the past does learn to signal; it also gives rise to the phenomenon of linguistic drift.
| Item Type | Article |
|---|---|
| Departments | Philosophy, Logic and Scientific Method |
| DOI | 10.1093/bjps/axt044 |
| Date Deposited | 09 Aug 2012 07:25 |
| URI | https://researchonline.lse.ac.uk/id/eprint/45282 |