نتایج جستجو برای: relative growth of rl

تعداد نتایج: 21246978  

Journal: :The European journal of neuroscience 2012
Anne G E Collins Michael J Frank

Instrumental learning involves corticostriatal circuitry and the dopaminergic system. This system is typically modeled in the reinforcement learning (RL) framework by incrementally accumulating reward values of states and actions. However, human learning also implicates prefrontal cortical mechanisms involved in higher level cognitive functions. The interaction of these systems remains poorly u...

Journal: :Physics in medicine and biology 2015
N Ploquin G Kertzscher E Vandervoort J E Cygler C E Andersen P Francescon

A dosimetry system based on Al2O3:C radioluminescence (RL), and RADPOS, a novel 4D dosimetry system using microMOSFETs, were used to measure total scatter factors, (S(c,p))(f(clin))(det), for the CyberKnife robotic radiosugery system. New Monte Carlo calculated correction factors are presented and applied for the RL detector response for the 5, 7.5 and 10 mm collimators in order to correct for ...

In this paper, we introduce the idea of generalized relative order (respectively generalized relative lower order) of entire functions of two complex variables. Hence, we study some growth properties of entire functions of two complex variables on the basis of the definition of generalized relative order and generalized relative lower order of entire functions of two complex variables.

پایان نامه :وزارت علوم، تحقیقات و فناوری - دانشگاه علوم کشاورزی و منابع طبیعی گرگان - دانشکده کشاورزی 1391

چکیده مورچه‏ها با حمایت از شپشک‏ها در مقابل دشمنان طبیعی، می‏توانند برنامه‏های کنترل بیولوژیک آن‏ها را مختل ‏نمایند. طی سال‏های 1389 و 1390 با نمونه‏برداری از باغات مرکبات شهرستان‏های ساری و بهشهر، فون و فراوانی نسبی مورچه‏هایِ همراه با شپشک‏ها مورد مطالعه قرار گرفت. همچنین، تاثیر تراکم‏های مختلف کاست کارگر گونه‏ی santschi lasius turcicus بر میزان تغذیه و فراسنجه‏های واکنش تابعی حشرات ماده و لا...

پایان نامه :وزارت علوم، تحقیقات و فناوری - دانشگاه گیلان 1390

عنوان: بررسی اوضاع فرهنگی گیلان (از انقلاب مشروطیت تا به قدرت رسیدن رضا شاه ) نویسنده: افضل افشاری سال :90-1389 استاد راهنما: دکتر ابوطالب سلطانیان استاد راهنما: دکتر حسن کهنسال واجارگاه چکیده: این پژوهش می کوشد اوضاع فرهنگی گیلان را از دور? مشروطیت تا ظهور رضا خان بررسی نماید و کم و کیف آن را آشکار سازد. انقلاب مشروطیت دو دستاورد مهم فرهنگی را در پی داشت؛ یکی انتشار گسترد? مطبوعات د...

Journal: :CoRR 2017
Stephen Tu Benjamin Recht

Reinforcement learning (RL) has been successfully used to solve many continuous control tasks. Despite its impressive results however, fundamental questions regarding the sample complexity of RL on continuous problems remain open. We study the performance of RL in this setting by considering the behavior of the Least-Squares Temporal Difference (LSTD) estimator on the classic Linear Quadratic R...

Journal: :Brain and language 2013
Marta Vergara-Martínez Manuel Perea Pablo Gómez Tamara Y Swaab

The encoding of letter position is a key aspect in all recently proposed models of visual-word recognition. We analyzed the impact of lexical frequency on letter position assignment by examining the temporal dynamics of lexical activation induced by pseudowords extracted from words of different frequencies. For each word (e.g., BRIDGE), we created two pseudowords: A transposed-letter (TL: BRIGD...

Journal: :Matematychni Studii 2018

Journal: :CoRR 2017
Romain Laroche Mehdi Fatemi Joshua Romoff Harm van Seijen

This article deals with a novel branch of Separation of Concerns, called Multi-Advisor Reinforcement Learning (MAd-RL), where a single-agent RL problem is distributed to n learners, called advisors. Each advisor tries to solve the problem with a different focus. Their advice is then communicated to an aggregator, which is in control of the system. For the local training, three off-policy bootst...

2005
Nathaniel D. Daw Yael Niv Peter Dayan

The basal ganglia are widely believed to be involved in the learned selection of actions. Building on this idea, reinforcement learning (RL) theories of optimal control have had some success in explaining the responses of their key dopaminergic afferents. While these model-free RL theories offer a compelling account of a range of neurophysiological and behavioural data, they offer only an incom...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید