نتایج جستجو برای: action value function

تعداد نتایج: 2342819  

Journal: :Science 2005
K S Lashley

Mass Action in Cerebral Function: PROFESsoR K. S. Scientific Apparatus and Laboratory Methods: LASHLEY ..................................... 245 A New Singing Tube: DR. F. L. ROBESON. A Simple Microscope Eyepiece Pointer: JAMES A. Obituary: LOUNSBURY .......L..265 Ignatius Urban, Erik L. Ekman: PAUL C. STANDLEY. Recent Deaths ................ .................... 254 Special Articles: Observati...

2012
Nathaniel D. Daw Samuel J. Gershman Ben Seymour Peter Dayan Raymond J. Dolan

The task consists of three states (first stage: sA; second stage: sB and sC), each with two actions (aA and aB). The goal of both the model-based and model-free subcomponents of the algorithm is to learn a state-action value function Q(s,a) mapping each state-action pair to its expected future value. On trial t, we denote the first-stage state (always sA) by s1,t, the second-stage state by s2,t...

فروغ, بیژن, معین الدین, رضا, نجاتی, مینا, نجاتی, پریسا, کوهپایه زاده, جلیل,

Bacground and Objective: Foot orthoses are a common intervention for patients with patellofemoral pain syndrome but, limited information is available in the effects of foot orthoses on knee pain and function of athletes with patellofemoral pain syndrome. The aim of our study was to determinate the effects of foot orthoses on reducing pain and increasing function of athletes with patellofemoral ...

Journal: :Soft Comput. 2011
Xin Xu Chunming Liu Dewen Hu

As an important approach to solving complex sequential decision problems, reinforcement learning (RL) has been widely studied in the community of artificial intelligence and machine learning. However, the generalization ability of RL is still an open problem and it is difficult for existing RL algorithms to solve Markov decision problems (MDPs) with both continuous state and action spaces. In t...

2015
Robert H. Sturges

Value Engineering (VE) techniques based on function have been the means to improved products and processes for several decades. It is a social design methodology that is usually episodic in application and often confused with narrow interests, such as cost cutting. This paper addresses the role, or function, of VE in a larger model of design practice to give insight into its use, non-use and mi...

LVAD is a mechanical pump supporting a weak heart function and blood flow. Sometimes, the heart may not recover fast enough to take over the pumping action immediately after surgery, in such patients a temporary support device has been employed to maintain the pumping action until the patient’s own heart recovers. This device can be considered as a temporary alternative before the process of ar...

Journal: :Artif. Intell. 2008
Ronen I. Brafman Carmel Domshlak

Classical work on eliciting and representing preferences over multi-attribute alternatives has attempted to recognize conditions under which value functions take on particularly simple and compact form, making their elicitation much easier. In this paper we consider preferences over discrete domains, and show that for a certain class of simple and intuitive qualitative preference statements, on...

1995

eduction model. We postulate that this implementation can potentially ooer superior performance scalability when compared to the single-master implementations even when the granularity of parallelism is not very coarse. Immediate future work will focus on evaluating the performance scalability of the new implementationus-ing GLU applications with modest granularity of par-allelism. In the long ...

Journal: :Progress of Theoretical Physics Supplement 2003

Journal: :Proceedings of the Japan Academy, Series A, Mathematical Sciences 1953

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید