critic

نتایج جستجو برای: critic

تعداد نتایج: 2831 فیلتر نتایج به سال:

Temporal Difference Based Actor Critic Learning - Convergence and Neural Implementation

2008

Dotan Di Castro Dmitry Volkinshtein Ron Meir

Actor-critic algorithms for reinforcement learning are achieving renewed popularity due to their good convergence properties in situations where other approaches often fail (e.g., when function approximation is involved). Interestingly, there is growing evidence that actor-critic approaches based on phasic dopamine signals play a key role in biological learning through cortical and basal gangli...

متن کامل

Impulse control disorders in Parkinson's disease are associated with dysfunction in stimulus valuation but not action valuation.

Journal: :The Journal of neuroscience : the official journal of the Society for Neuroscience 2014

Payam Piray Yashar Zeighami Fariba Bahrami Abeer M Eissa Doaa H Hewedi Ahmed A Moustafa

A substantial subset of Parkinson's disease (PD) patients suffers from impulse control disorders (ICDs), which are side effects of dopaminergic medication. Dopamine plays a key role in reinforcement learning processes. One class of reinforcement learning models, known as the actor-critic model, suggests that two components are involved in these reinforcement learning processes: a critic, which ...

متن کامل

A critic-critic architecture to combine reinforcement and supervised learnings

2003

Fabien Montagne Samuel Delepoulle Philippe Preux

In real life, learning is greatly speeded-up by the intervention of a teacher who gives examples, or shows, how to perform a certain task. In all this abstract, we let apart structural simpli cations of the problem by the designer which to not deal explicitely with learning. The intervention of the teacher can be realized in di erent ways: verbal explanation, demonstration, guidance, shaping th...

متن کامل

Natural Option Critic

Journal: :Proceedings of the AAAI Conference on Artificial Intelligence 2019

متن کامل

Adaptive critics for dynamic optimization

Journal: :Neural networks : the official journal of the International Neural Network Society 2010

Raghavendra V. Kulkarni Ganesh K. Venayagamoorthy

A novel action-dependent adaptive critic design (ACD) is developed for dynamic optimization. The proposed combination of a particle swarm optimization-based actor and a neural network critic is demonstrated through dynamic sleep scheduling of wireless sensor motes for wildlife monitoring. The objective of the sleep scheduler is to dynamically adapt the sleep duration to node's battery capacity ...

متن کامل

An Actor-Critic Algorithm With Second-Order Actor and Critic

Journal: :IEEE Transactions on Automatic Control 2017

متن کامل

Safe option-critic: learning safety in the option-critic architecture

Journal: :The Knowledge Engineering Review 2021

متن کامل

A Comparison of DHP Based Antecedent Parameter Tuning Strategies for Fuzzy Control

2001

Alec M. Rogers

In the context of fuzzy control, antecedent parameters are used to provide a segmentation of the state space so that different regions can be modeled appropriately. In Adaptive Critic methodologies, two modules (the critic and the controller) must properly segment the state space to insure good performance. In this paper, we explore the effects of tuning antecedent parameters that are shared be...

متن کامل

A Courteous Critic.

Journal: :Journal of the American Medical Association 1908

متن کامل

Critic and Historian

Journal: :The Old Testament Student 1885

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید