critic and theorist

نتایج جستجو برای: critic and theorist

تعداد نتایج: 16827658 فیلتر نتایج به سال:

The Eigenoption-Critic Framework

Journal: :CoRR 2017

Miao Liu Marlos C. Machado Gerald Tesauro Murray Campbell

Eigenoptions (EOs) have been recently introduced as a promising idea for generating a diverse set of options through the graph Laplacian, having been shown to allow efficient exploration Machado et al. [2017a]. Despite its first initial promising results, a couple of issues in current algorithms limit its application, namely: 1) EO methods require two separate steps (eigenoption discovery and r...

متن کامل

Projected Natural Actor-Critic

2013

Philip S. Thomas William Dabney Stephen Giguere Sridhar Mahadevan

Natural actor-critics form a popular class of policy search algorithms for finding locally optimal policies for Markov decision processes. In this paper we address a drawback of natural actor-critics that limits their real-world applicability—their lack of safety guarantees. We present a principled algorithm for performing natural gradient descent over a constrained domain. In the context of re...

متن کامل

Off-Policy Actor-Critic

Journal: :CoRR 2012

Thomas Degris Martha White Richard S. Sutton

This paper presents the first actor-critic algorithm for off-policy reinforcement learning. Our algorithm is online and incremental, and its per-time-step complexity scales linearly with the number of learned weights. Previous work on actor-critic algorithms is limited to the on-policy setting and does not take advantage of the recent advances in offpolicy gradient temporal-difference learning....

متن کامل

Mean Actor Critic

Journal: :CoRR 2017

Kavosh Asadi Cameron Allen Melrose Roderick Abdel-rahman Mohamed George Konidaris Michael L. Littman

We propose a new algorithm, Mean Actor-Critic (MAC), for discrete-action continuous-state reinforcement learning. MAC is a policy gradient algorithm that uses the agent’s explicit representation of all action values to estimate the gradient of the policy, rather than using only the actions that were actually executed. This significantly reduces variance in the gradient updates and removes the n...

متن کامل

An Adaptive Critic Approach to Reference Model Adaptation

2003

K. KrishnaKumar G. Limes K. Gundy-Burlet D. Bryant

Neural networks have been successfully used for implementing control architectures for different applications. In this work, we examine a neural network augmented adaptive critic as a Level 2 intelligent controller for a C-17 aircraft. This intelligent control architecture utilizes an adaptive critic to tune the parameters of a reference model, which is then used to define the angular rate comm...

متن کامل

Publisher Correction: Numerical methods every atomic and molecular theorist should know

Journal: :Nature Reviews Physics 2020

متن کامل

The World, Gulliver, and the Critic

Journal: :XVII-XVIII 2020

متن کامل

Granny versus game theorist: ambiguity in experimental games

2006

Jürgen Eichberger David Kelsey Burkhard C. Schipper

We report on an experiment in which subjects choose actions in strategic games with either strategic complements or substitutes against a granny, a game theorist or other subjects. The games are selected in order to test predictions on the comparative statics of equilibrium with respect to changes in strategic ambiguity. We find that subjects face higher ambiguity while playing against the gran...

متن کامل

Beyond Adaptive Critic- Creative Learning for Intelligent Mobile Robots

2001

Xiaoqun Liao Ming Cao

Intelligent industrial and mobile robots may be considered proven technology in structured environments. Teach programming and supervised learning methods permit solutions to a variety of applications. However, we believe that to extend the operation of these machines to more unstructured environments requires a new learning method. Both unsupervised learning and reinforcement learning are pote...

متن کامل

Darwin Studies: A Theorist and His Theories in Their Contexts

Journal: :Aestimatio: Critical Reviews in the History of Science 2015

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید