discrete action reinforcement learning automata darla

Eecient Exploration in Reinforcement Learning

1992

Sebastian B. Thrun

Exploration plays a fundamental role in any active learning system. This study evaluates the role of exploration in active learning and describes several local techniques for exploration in nite, discrete domains, embedded in a reinforcement learning framework (delayed reinforcement). This paper distinguishes between two families of exploration schemes: undirected and directed exploration. Whil...

متن کامل

cient Exploration In Reinforcement Learning Sebastian

1992

Sebastian B. Thrun

Exploration plays a fundamental role in any active learning system. This study evaluates the role of exploration in active learning and describes several local techniques for exploration in nite, discrete domains, embedded in a reinforcement learning framework (delayed reinforcement). This paper distinguishes between two families of exploration schemes: undirected and directed exploration. Whil...

متن کامل

Modular on-line function approximation for scaling up reinforcement learning

1994

Chen-Khong Tham

Reinforcement l e a r n i n g i s a p o werful learning paradigm for autonomous agents which i n teract with unknown environments with the objective of maximizing cumulative p a yoo. Recent research has addressed issues concerning the scaling up of reinforcement learning methods in order to solve problems with large state spaces, composite tasks and tasks involving non-Markovian situations. In ...

متن کامل

Modified Uni-Vector Field Navigation and Modular Q-learning for Soccer Robots

2001

Kui-Hong Park Yong-Jae Kim Jong-Hwan Kim

The robot soccer system is being used as a test bed to develop the next generation of field robots. In the multiagent system, action selection is important for the cooperation and coordination among agents. There are many techniques in choosing a proper action of the agent. As the environment is dynamic, reinforcement learning is more suitable than supervised learning. Reinforcement learning is...

متن کامل

Continuous and discretized pursuit learning schemes: various algorithms and their comparison

Journal: :IEEE transactions on systems, man, and cybernetics. Part B, Cybernetics : a publication of the IEEE Systems, Man, and Cybernetics Society 2001

B. John Oommen M. Agache

A learning automaton (LA) is an automaton that interacts with a random environment, having as its goal the task of learning the optimal action based on its acquired experience. Many learning automata (LAs) have been proposed, with the class of estimator algorithms being among the fastest ones, Thathachar and Sastry, through the pursuit algorithm, introduced the concept of learning algorithms th...

متن کامل

Discrete Recurrent Neural Networks as Pushdown Automata

1997

Zheng Zeng Rodney M. Goodman Padhraic Smyth

in this paper we describe a new discrete rccurrcnt neural network model with discrete external stacks for learning context-free grammars (or pushdown automata). Conventional analog recurrent networks tend to have stability problems when presented with input sirings which are longer than those used for training: the network’s internal states become merged and the string can not be correctly pars...

متن کامل

Habits, action sequences and reinforcement learning

Journal: :European Journal of Neuroscience 2012

متن کامل

Heuristic Dynamic Programming Nonlinear Optimal Controller

2012

Asma Al-tamimi Murad Abu-Khalaf Frank Lewis

This chapter is concerned with the application of approximate dynamic programming techniques (ADP) to solve for the value function, and hence the optimal control policy, in discrete-time nonlinear optimal control problems having continuous state and action spaces. ADP is a reinforcement learning approach (Sutton & Barto, 1998) based on adaptive critics (Barto et al., 1983), (Widrow et al., 1973...

متن کامل

A Comparison of Continuous and Discretized Pursuit Learning Schemes

2000

B. John Oommen Mariana Agache

A Learning Automaton is an automaton that interacts with a random environment, having as its goal the task of learning the optimal action based on its acquired experience. Many learning automata have been proposed, with the class of Estimator Algorithms being among the fastest ones. Thathachar and Sastry [23], through the Pursuit Algorithm, introduced the concept of learning algorithms that pur...

متن کامل

Neural Field Approach to Topological Reinforcement Learningin Continuous Action

1998

H.-M. Gross V. Stephan M. Krabbes

| We present a neural eld approach to distributed Q-learning in continuous state and action spaces that is based on action coding and selection in dynamic neural elds. It is, to the best of our knowledge, one of the rst attempts that combines the advantages of a topological action coding with a distributed action-value learning in one neural architecture. This combination, supplemented by a neu...

متن کامل