نتایج جستجو برای: critic and theorist

تعداد نتایج: 16827658  

Journal: :Contemporary Political Theory 2019

Journal: :Revista Colombiana de Psicología 2018

1999
F. L. Lewis

Two feedback control systems are designed that employ the adaptive critic architecture, which consists of two neural networks, one of which (the critic) tunes the other. The first application is a deadzone compensator, where it is shown that the adaptive critic structure is a natural consequence of the mathematical problem of inversion of an unknown function. In this situation the adaptive crit...

2011
Victor Gabillon Alessandro Lazaric Mohammad Ghavamzadeh Bruno Scherrer

In this paper, we study the effect of adding a value function approximation component (critic) to rollout classification-based policy iteration (RCPI) algorithms. The idea is to use a critic to approximate the return after we truncate the rollout trajectories. This allows us to control the bias and variance of the rollout estimates of the action-value function. Therefore, the introduction of a ...

Journal: :journal of research in medical sciences 0
shaista salman guraya salman yousuf guraya fawzia a. habib khalid i. khoshhal

background: understanding the learning styles of medical students can drive the institutions to adapt instructional materials to enhance students’ learning of knowledge and skills. this study explored the learning styles of undergraduate medical students, comparing gender variations in terms of their significant preferences. materials and methods: a cross-sectional observational study was perfo...

Journal: :SIAM J. Control and Optimization 2003
Vijay R. Konda John N. Tsitsiklis

In this article, we propose and analyze a class of actor-critic algorithms. These are two-time-scale algorithms in which the critic uses temporal difference learning with a linearly parameterized approximation architecture, and the actor is updated in an approximate gradient direction, based on information provided by the critic. We show that the features for the critic should ideally span a su...

2002
XIAOQUN LIAO

Intelligent industrial and mobile robots may be considered proven technology in structured environments. Teach programming and supervised learning methods permit solutions to a variety of applications. However, we believe that to extend the operation of these machines to more unstructured environments requires a new learning method. Both unsupervised learning and reinforcement learning are pote...

Journal: :Artif. Intell. 1993
Michael Thielscher

Thielscher, M., On prediction in Theorist (Research Note), Artificial Intelligence 60 (1993) 283-292. Theorist is a well-known framework and system for nonmonotonic reasoning which provides mechanisms for dealing with both explanations for observations and skeptical prediction. Its current implementation, developed by David Poole and co-workers, uses an algorithm for prediction which holds for ...

Journal: :Personality and Individual Differences 2016

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید