نتایج جستجو برای: reward situation

تعداد نتایج: 163929  

2017
Yong Zhao Yongchao Liu Jin Zhang Xinmin Yang

Reward-risk ratio (RR) is a very important stock market definition. In order to capture the situation that the investor does not have complete information on the distribution of the underlying uncertainty, people extend RR model to distributionally robust reward-risk ratio (DRR) model. In this paper, we study the DRR problem where the ambiguity on the distributions is defined through Wassertein...

2009
Florian Pommerening Stefan Wölfl Matthias Westphal

Agents interacting in a dynamically changing spatial environment often need to access the same spatial resources. A typical example is given by moving vehicles that meet at an intersection in a street network. In such situations right-of-way rules regulate the actions the vehicles involved may perform. For this application scenario we show how the Golog framework for reasoning about action and ...

Journal: :Artif. Intell. 2011
Gerhard Lakemeyer Hector J. Levesque

Article history: Available online 3 April 2010

2008
Peter O'Hearn David Pym Rob Miller Edmund Robinson Murray Shanahan

Journal: :Journal of Machine Learning Research 2004
Shie Mannor Nahum Shimkin

We consider the problem of reinforcement learning in a controlled Markov environment with multiple objective functions of the long-term average reward type. The environment is initially unknown, and furthermore may be affected by the actions of other agents, actions that are observed but cannot be predicted beforehand. We capture this situation using a stochastic game model, where the learning ...

Journal: :Learning & memory 2007
Mariana Gil Rodrigo J De Marco Randolf Menzel

The aim of this study was to test whether honeybees develop reward expectations. In our experiment, bees first learned to associate colors with a sugar reward in a setting closely resembling a natural foraging situation. We then evaluated whether and how the sequence of the animals' experiences with different reward magnitudes changed their later behavior in the absence of reinforcement and wit...

Journal: :The Journal of neuroscience : the official journal of the Society for Neuroscience 2013
Susan Sangha James Z Chadick Patricia H Janak

Learning to fear and avoid life-threatening stimuli are critical survival skills but are maladaptive when they persist in the absence of a direct threat. Thus, it is important to detect when a situation is safe and to increase behaviors leading to naturally rewarding actions, such as feeding and mating. It is unclear how the brain distinguishes between dangerous and safe situations. Here, we pr...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید