نتایج جستجو برای: critic
تعداد نتایج: 2831 فیلتر نتایج به سال:
In the present paper a neural network approach called “Adaptive Critic Design” (ACD) was applied to optimal tuning of set point controllers of the three main substrates (sugar, nitrogen source and dissolved oxygen) for PHB production process. For approximation of the critic and the controllers a special kind of recurrent neural networks called Echo state networks (ESN) were used. Their structur...
Instrumental conditioning studies how animals and humans choose actions appropriate to the affective structure of an environment. According to recent reinforcement learning models, two distinct components are involved: a "critic," which learns to predict future reward, and an "actor," which maintains information about the rewarding outcomes of actions to enable better ones to be chosen more fre...
This paper presents the first actor-critic algorithm for o↵-policy reinforcement learning. Our algorithm is online and incremental, and its per-time-step complexity scales linearly with the number of learned weights. Previous work on actor-critic algorithms is limited to the on-policy setting and does not take advantage of the recent advances in o↵policy gradient temporal-di↵erence learning. O↵...
this article discusses the stylistic criticism of the adonis’s poetry, based on the book modern poetic style by salah fadel. for this purpose, this article focuses on the critic himself and investigates the basic issues that he raised, specifies the concept and the expressions that he conceptualized, and the methodological techniques that he employed. this researcher believes that the critic ha...
نمودار تعداد نتایج جستجو در هر سال
با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید