Comparing Reinforcement Learning and Evolutionary Based Adaptation in Population Games
نویسنده
چکیده
In evolutionary game theory, the main interest is normally on the investigation of how the distribution of strategies changes along time and whether an stable strategy arises. In this paper we compare the dynamics of two games in which three populations of agents interact: a three-player version of matching pennies and a game with several Nash equilibria. We do this comparison by three methods: continuous replicator dynamics, an evolutionary approach, and reinforcement learning. We show how the convergence depends on the nature of the underlying method used, as well as on the pace of adjustments by the agents.
منابع مشابه
Aspiration-Based Reinforcement Learning in Repeated Interaction Games: an Overview
In models of aspiration-based reinforcement learning, agents adapt by comparing payoffs achieved from actions chosen in the past with an aspiration level. Though such models are well-established in behavioural psychology, only recently have they begun to receive attention in game theory and its applications to economics and politics. This paper provides an informal overview of a range of such t...
متن کاملReinforcement Learning in Large Population Models: A Continuity Equation Approach∗
We study an evolutionary model in which strategy revision protocols are based on agent specific characteristics rather than wider social characteristics. We assume that agents are primed to play mixed strategies. At any time, the distribution of mixed strategies over agents in a population is described by a probability measure. In each round, a pair of randomly chosen agents play a game, after ...
متن کاملAdaptive agents on evolving networks
In this work we study the learning dynamics for agents playing games on networks. We propose a model of network formation in repeated games where players strategically adopt actions and connections simultaneously using a reinforcement learning scheme which is called Boltzmann-Q-learning. This adaptation scheme in the continuous time limit has a proven relation to the evolutionary game theory th...
متن کاملSpike-based Decision Learning of Nash Equilibria in Two-Player Games
Humans and animals face decision tasks in an uncertain multi-agent environment where an agent's strategy may change in time due to the co-adaptation of others strategies. The neuronal substrate and the computational algorithms underlying such adaptive decision making, however, is largely unknown. We propose a population coding model of spiking neurons with a policy gradient procedure that succe...
متن کاملReinforcement Learning of Intelligent Characters in Fighting Action Games
Abstract. In this paper, we investigate reinforcement learning (RL) of intelligent characters, based on neural network technology, for fighting action games. RL can be either on-policy or off-policy. We apply both schemes to tabula rasa learning and adaptation. The experimental results show that (1) in tabula rasa leaning, off-policy RL outperforms on-policy RL, but (2) in adaptation, on-policy...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2013