critic and theorist

نتایج جستجو برای: critic and theorist

تعداد نتایج: 16827658 فیلتر نتایج به سال:

A topos-theorist looks at dilators

Journal: :Journal of Pure and Applied Algebra 1989

متن کامل

Thomas Hobbes: theorist of the law

Journal: :Critical Review of International Social and Political Philosophy 2015

متن کامل

A novel actor-critic-identifier architecture for approximate optimal control of uncertain nonlinear systems

Journal: :Automatica 2013

Shubhendu Bhasin R. Kamalapurkar Marcus Johnson Kyriakos G. Vamvoudakis Frank L. Lewis Warren E. Dixon

An online adaptive reinforcement learning-based solution is developed for the infinite-horizon optimal control problem for continuous-time uncertain nonlinear systems. A novel actor–critic–identifier (ACI) is proposed to approximate the Hamilton–Jacobi–Bellman equation using three neural network (NN) structures—actor and critic NNs approximate the optimal control and the optimal value function,...

متن کامل

Forward Actor-Critic for Nonlinear Function Approximation in Reinforcement Learning

2017

Vivek Veeriah Harm van Seijen Richard S. Sutton

Multi-step methods are important in reinforcement learning (RL). Eligibility traces, the usual way of handling them, works well with linear function approximators. Recently, van Seijen (2016) had introduced a delayed learning approach, without eligibility traces, for handling the multi-step λ-return with nonlinear function approximators. However, this was limited to action-value methods. In thi...

متن کامل

Towards Feature Selection In Actor-Critic Algorithms

2007

Khashayar Rohanimanesh Nicholas Roy Russ Tedrake

Choosing features for the critic in actor-critic algorithms with function approximation is known to be a challenge. Too few critic features can lead to degeneracy of the actor gradient, and too many features may lead to slower convergence of the learner. In this paper, we show that a well-studied class of actor policies satisfy the known requirements for convergence when the actor features are ...

متن کامل

Design and implementation of an adaptive critic-based neuro-fuzzy controller on an unmanned bicycle

Journal: :CoRR 2017

Ali Shafiekhani Mohammad J. Mahjoob Mehdi Akraminia

Abstract: Fuzzy critic-based learning forms a reinforcement learning method based on dynamic programming. In this paper, an adaptive critic-based neuro-fuzzy system is presented for an unmanned bicycle. The only information available for the critic agent is the system feedback which is interpreted as the last action performed by the controller in the previous state. The signal produced by the c...

متن کامل

Adversarial Advantage Actor-Critic Model for Task-Completion Dialogue Policy Learning

Journal: :CoRR 2017

Baolin Peng Xiujun Li Jianfeng Gao Jingjing Liu Yun-Nung Chen Kam-Fai Wong

This paper presents a new method — adversarial advantage actor-critic (Adversarial A2C), which significantly improves the efficiency of dialogue policy learning in taskcompletion dialogue systems. Inspired by generative adversarial networks (GAN), we train a discriminator to differentiate responses/actions generated by dialogue agents from responses/actions by experts. Then, we incorporate the ...

متن کامل

Online Learning of Optimal Control Solutions Using Integral Reinforcement Learning and Neural Networks

2011

Kyriakos G. Vamvoudakis Draguna Vrabie Frank L. Lewis

In this paper we introduce an online algorithm that uses integral reinforcement knowledge for learning the continuous-time optimal control solution for nonlinear systems with infinite horizon costs and partial knowledge of the system dynamics. This algorithm is a data based approach to the solution of the Hamilton-Jacobi-Bellman equation and it does not require explicit knowledge on the system’...

متن کامل

Actor-critic models of the basal ganglia: new anatomical and computational perspectives

Journal: :Neural networks : the official journal of the International Neural Network Society 2002

Daphna Joel Yael Niv Eytan Ruppin

A large number of computational models of information processing in the basal ganglia have been developed in recent years. Prominent in these are actor-critic models of basal ganglia functioning, which build on the strong resemblance between dopamine neuron activity and the temporal difference prediction error signal in the critic, and between dopamine-dependent long-term synaptic plasticity in...

متن کامل

Application of reinforcement learning to balancing of acrobot

1999

Junichiro Yoshimoto Shin Ishii Masa - aki Sato

The acrobot is a two-link robot, actuated only at the joint between the two links. It is one of dicult tasks in reinforcement learning (RL) to control the acrobot because it has nonlinear dynamics and continuous state and action spaces. In this article, we discuss applying the RL to the task of balancing control of the acrobot. Our RL method has an architecture similar to the actor-critic. The ...

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید