نتایج جستجو برای: discrete action reinforcement learning automata darla

تعداد نتایج: 1357117  

1999
Richard S. Sutton

Reinforcement learning (RL) concerns the problem of a learning agent interacting with its environment to achieve a goal. Instead of being given examples of desired behavior, the learning agent must discover by trial and error how to behave in order to get the most reward. The environment is a Markov decision process (MDP) with state set, S, and action set, A. The agent and the environment inter...

2016
Huiru Zhao Yuwei Wang Sen Guo Chao Zhang Robert Lundmark

An important goal of China’s electric power system reform is to create a double-side day-ahead wholesale electricity market in the future, where the suppliers (represented by GenCOs) and demanders (represented by DisCOs) compete simultaneously with each other in one market. Therefore, modeling and simulating the dynamic bidding process and the equilibrium in the double-side day-ahead electricit...

2016
Ji He Jianshu Chen Xiaodong He Jianfeng Gao Lihong Li Li Deng Mari Ostendorf

This paper introduces a novel architecture for reinforcement learning with deep neural networks designed to handle state and action spaces characterized by natural language, as found in text-based games. Termed a deep reinforcement relevance network (DRRN), the architecture represents action and state spaces with separate embedding vectors, which are combined with an interaction function to app...

2015
Kevin N. Gurney Mark D. Humphries Peter Redgrave

Operant learning requires that reinforcement signals interact with action representations at a suitable neural interface. Much evidence suggests that this occurs when phasic dopamine, acting as a reinforcement prediction error, gates plasticity at cortico-striatal synapses, and thereby changes the future likelihood of selecting the action(s) coded by striatal neurons. But this hypothesis faces ...

Journal: :IEEE transactions on neural networks 1994
Zheng Zeng Rodney M. Goodman Padhraic Smyth

Describes a novel neural architecture for learning deterministic context-free grammars, or equivalently, deterministic pushdown automata. The unique feature of the proposed network is that it forms stable state representations during learning-previous work has shown that conventional analog recurrent networks can be inherently unstable in that they cannot retain their state memory for long inpu...

2001
Szilveszter Kovács

Reinforcement learning methods, surviving the control difficulties of the unknown environment, are gaining more and more popularity recently in the autonomous robotics community. One of the possible difficulties of the reinforcement learning applications in complex situations is the huge size of the statevalueor action-value-function representation [2]. The case of continuous environment (conti...

Journal: :IEICE Transactions 2017
Chenxi Li Lei Cao Xiaoming Liu Xiliang Chen Zhixiong Xu Yongliang Zhang

As an important method to solve sequential decisionmaking problems, reinforcement learning learns the policy of tasks through the interaction with environment. But it has difficulties scaling to largescale problems. One of the reasons is the exploration and exploitation dilemma which may lead to inefficient learning. We present an approach that addresses this shortcoming by introducing qualitat...

Journal: :research in medical education 0
حسین کریمی مونقی h karimi mooanaghi mashhad university of medical sciences, mashhadدانشگاه علوم پرشکی زهرا مرضیه حسنیان z m hasanian nursing dept, hamedan university of medical sciences, hamedan, iranگروه آموزشی پرستاری، دانشکده پرستاری و مامائی، دانشگاه علوم پرشکی همدان، همدان ،ایران

the main issues in any society are teaching and learning and main elements of this story are the teacher and the learner. there are different psychology schools in which any of them in turn, have taken many extensive researches about behavior, and facts and theories of learning have been studied from a particular perspective. rationalism asserts that the human intellect has the highest energy a...

Journal: :CoRR 2015
Ji He Jianshu Chen Xiaodong He Jianfeng Gao Lihong Li Li Deng Mari Ostendorf

This paper introduces a novel architecture for reinforcement learning with deep neural networks designed to handle state and action spaces characterized by natural language, as found in text-based games. Termed a deep reinforcement relevance network (DRRN), the architecture represents action and state spaces with separate embedding vectors, which are combined with an interaction function to app...

Journal: :Neurocomputing 1999
Cristiane Salum Antônio C. Roque-da-Silva Alan Pickering

This work describes a neural network model which simulates a discrete part of the dopaminergic striatal circuitry involved in reinforcement learning. We consider the proposal that learning by reinforcement is acquired by a heterosynaptic mechanism a!ecting plasticity of corticostriatal synapses, under the modulatory control of DA neurons. The simulation results seem to be successful since they ...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید