نتایج جستجو برای: reinforcement learning

تعداد نتایج: 619520  

1997
Munir-ul M. CHOWDHURY Yun LI

Disadvantages of traditional reinforcement learning techniques are complicated structures and that training algorithms are often reliant on the derivative information of the problem domain and also require a priori information of the network architecture. Such handicaps are overcome in this paper with the use of ‘messy genetic algorithms’, whose main characteristic is a variable length chromoso...

Journal: :Neural computation 2011
Rafal Bogacz Tobias Larsen

This article seeks to integrate two sets of theories describing action selection in the basal ganglia: reinforcement learning theories describing learning which actions to select to maximize reward and decision-making theories proposing that the basal ganglia selects actions on the basis of sensory evidence accumulated in the cortex. In particular, we present a model that integrates the actor-c...

2007
Richard S. Sutton

The RLAI research program pursues an approach to artificial intelligence and engineering problems in which they are formulated as large optimal control problems and approximately solved using reinforcement learning methods. Reinforcement learning is a new body of theory and techniques for optimal control that has been developed in the last twenty years primarily within the machine learning and ...

Journal: :Neural networks : the official journal of the International Neural Network Society 2013
Anne S. Warlaumont Gert Westermann Eugene H. Buder D. Kimbrough Oller

Vocal motor development in infancy provides a crucial foundation for language development. Some significant early accomplishments include learning to control the process of phonation (the production of sound at the larynx) and learning to produce the sounds of one's language. Previous work has shown that social reinforcement shapes the kinds of vocalizations infants produce. We present a neural...

Journal: :Neural networks : the official journal of the International Neural Network Society 2004
R. J. C. Bosman W. A. van Leeuwen Bastian Wemmenhove

A toy model of a neural network in which both Hebbian learning and reinforcement learning occur is studied. The problem of 'path interference', which makes that the neural net quickly forgets previously learned input-output relations is tackled by adding a Hebbian term (proportional to the learning rate nu) to the reinforcement term (proportional to delta) in the learning rule. It is shown that...

2015
Jie SHAO jingru YU

This paper presented a novel approach XCS-FPGRL to research on robot reinforcement learning. XCS-FPGRL combines covering operator and genetic algorithm. The systems is responsible for adjusting precision and reducing search space according to some reward obtained from the environment, acts as an innovation discovery component which is responsible for discovering new better reinforcement learnin...

Journal: :Journal of neurophysiology 2015
Chong Chen

Our understanding of the neural basis of reinforcement learning and intelligence, two key factors contributing to human strivings, has progressed significantly recently. However, the overlap of these two lines of research, namely, how intelligence affects neural responses during reinforcement learning, remains uninvestigated. A mini-review of three existing studies suggests that higher IQ (espe...

2002
Martijn van Otterlo

This paper is about representation in RL. We discuss some of the concepts in representation and generalization in reinforcement learning and argue for higher-order representations, instead of the commonly used propositional representations. The paper contains a small review of current reinforcement learning systems using higher-order representations, followed by a brief discussion. The paper en...

2011
Johannes Friedrich Robert Urbanczik Walter Senn

In learning from trial and error, animals need to relate behavioral decisions to environmental reinforcement even though it may be difficult to assign credit to a particular decision when outcomes are uncertain or subject to delays. When considering the biophysical basis of learning, the credit-assignment problem is compounded because the behavioral decisions themselves result from the spatio-t...

Journal: :Robotics and Autonomous Systems 1997
Hamid Benbrahim Judy A. Franklin

This paper presents some results from a study of biped dynamic walking using reinforcement learning. During this study a hardware biped robot was built, a new reinforcement learning algorithm as well as a new learning architecture were developed. The biped learned dynamic walking without any previous knowledge about its dynamic model. The Self Scaling Reinforcement learning algorithm was develo...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید