نتایج جستجو برای: passive critic features

تعداد نتایج: 593035  

2014
Jordan I. Barnes Caitlyn McColeman Ekaterina R. Stepanova Mark R. Blair R. Calen Walshe

Here we introduce a simple actor-critic model of eye movements during category learning that we call RLAttn (Reinforcement Learning of Attention). RLAttn stores the rewards it receives for making decisions or performing actions, while attempting to associate stimuli with particular categories. Over multiple trials, RLAttn learns that a large reward is most likely when the values of the relevant...

2010
Beibei Cheng Joe Stanley Sameer Antani George R. Thoma Jongan Park Waqas Rasheed

In this research, a novel computational intelligencebased algorithm to detect artifacts, specifically arrows, in medical images is presented. Image analyses techniques are developed to find the symbols and text automatically. Features are computed from the shape of arrow for the discrimination of arrows from other artifacts. We investigate a biologically-inspired reinforcement learning (RL) app...

Journal: :The Musical Times 1912

Journal: :Hispanic Review 1964

پایان نامه :وزارت علوم، تحقیقات و فناوری - دانشگاه علامه طباطبایی - دانشکده ادبیات و زبانهای خارجی 1387

the present study is an experimental case study which investigates the impacts, if any, of skopos on syntactic features of the target text. two test groups each consisting of 10 ma students translated a set of sentences selected from advertising texts in the operative and informative mode. the resulting target texts were then statistically analyzed in terms of the number of words, phrases, si...

Journal: :Science 2004
John O'Doherty Peter Dayan Johannes Schultz Ralf Deichmann Karl Friston Raymond J Dolan

Instrumental conditioning studies how animals and humans choose actions appropriate to the affective structure of an environment. According to recent reinforcement learning models, two distinct components are involved: a "critic," which learns to predict future reward, and an "actor," which maintains information about the rewarding outcomes of actions to enable better ones to be chosen more fre...

2012
Thomas Degris Martha White Richard S. Sutton

This paper presents the first actor-critic algorithm for o↵-policy reinforcement learning. Our algorithm is online and incremental, and its per-time-step complexity scales linearly with the number of learned weights. Previous work on actor-critic algorithms is limited to the on-policy setting and does not take advantage of the recent advances in o↵policy gradient temporal-di↵erence learning. O↵...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید