نتایج جستجو برای: critic

تعداد نتایج: 2831  

2008
Dotan Di Castro Dmitry Volkinshtein Ron Meir

Actor-critic algorithms for reinforcement learning are achieving renewed popularity due to their good convergence properties in situations where other approaches often fail (e.g., when function approximation is involved). Interestingly, there is growing evidence that actor-critic approaches based on phasic dopamine signals play a key role in biological learning through cortical and basal gangli...

Journal: :The Journal of neuroscience : the official journal of the Society for Neuroscience 2014
Payam Piray Yashar Zeighami Fariba Bahrami Abeer M Eissa Doaa H Hewedi Ahmed A Moustafa

A substantial subset of Parkinson's disease (PD) patients suffers from impulse control disorders (ICDs), which are side effects of dopaminergic medication. Dopamine plays a key role in reinforcement learning processes. One class of reinforcement learning models, known as the actor-critic model, suggests that two components are involved in these reinforcement learning processes: a critic, which ...

2003
Fabien Montagne Samuel Delepoulle Philippe Preux

In real life, learning is greatly speeded-up by the intervention of a teacher who gives examples, or shows, how to perform a certain task. In all this abstract, we let apart structural simpli cations of the problem by the designer which to not deal explicitely with learning. The intervention of the teacher can be realized in di erent ways: verbal explanation, demonstration, guidance, shaping th...

Journal: :Proceedings of the AAAI Conference on Artificial Intelligence 2019

Journal: :Neural networks : the official journal of the International Neural Network Society 2010
Raghavendra V. Kulkarni Ganesh K. Venayagamoorthy

A novel action-dependent adaptive critic design (ACD) is developed for dynamic optimization. The proposed combination of a particle swarm optimization-based actor and a neural network critic is demonstrated through dynamic sleep scheduling of wireless sensor motes for wildlife monitoring. The objective of the sleep scheduler is to dynamically adapt the sleep duration to node's battery capacity ...

Journal: :IEEE Transactions on Automatic Control 2017

2001
Alec M. Rogers

In the context of fuzzy control, antecedent parameters are used to provide a segmentation of the state space so that different regions can be modeled appropriately. In Adaptive Critic methodologies, two modules (the critic and the controller) must properly segment the state space to insure good performance. In this paper, we explore the effects of tuning antecedent parameters that are shared be...

Journal: :Journal of the American Medical Association 1908

Journal: :The Old Testament Student 1885

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید