reward situation

نتایج جستجو برای: reward situation

تعداد نتایج: 163929 فیلتر نتایج به سال:

Asymptotic Behavior of Multivariate Reward Processes with Nonlinear Reward Functions

Journal: Bulletin of the Iranian Mathematical Society 2011

A. R. Soltani K. Khorshidian

متن کامل

Distributionally Robust Reward-risk Ratio Programming with Wasserstein Metric

2017

Yong Zhao Yongchao Liu Jin Zhang Xinmin Yang

Reward-risk ratio (RR) is a very important stock market definition. In order to capture the situation that the investor does not have complete information on the distribution of the underlying uncertainty, people extend RR model to distributionally robust reward-risk ratio (DRR) model. In this paper, we study the DRR problem where the ambiguity on the distributions is defined through Wassertein...

متن کامل

Right-of-Way Rules as Use Case for Integrating GOLOG and Qualitative Reasoning

2009

Florian Pommerening Stefan Wölfl Matthias Westphal

Agents interacting in a dynamically changing spatial environment often need to access the same spatial resources. A typical example is given by moving vehicles that meet at an intersection in a street network. In such situations right-of-way rules regulate the actions the vehicles involved may perform. For this application scenario we show how the Golog framework for reasoning about action and ...

متن کامل

A Situation Calculus Approach to Modeling and Programming Agents

1999

H. J. LEVESQUE

متن کامل

A semantic characterization of a useful fragment of the situation calculus with knowledge

Journal: :Artif. Intell. 2011

Gerhard Lakemeyer Hector J. Levesque

Article history: Available online 3 April 2010

متن کامل

A Linear Meta-interpreter for the Situation Calculus

2008

Peter O'Hearn David Pym Rob Miller Edmund Robinson Murray Shanahan

متن کامل

Reasoning in the Situation Calculus with Limited Belief

2017

Christoph Schwering

متن کامل

A Geometric Approach to Multi-Criterion Reinforcement Learning

Journal: :Journal of Machine Learning Research 2004

Shie Mannor Nahum Shimkin

We consider the problem of reinforcement learning in a controlled Markov environment with multiple objective functions of the long-term average reward type. The environment is initially unknown, and furthermore may be affected by the actions of other agents, actions that are observed but cannot be predicted beforehand. We capture this situation using a stochastic game model, where the learning ...

متن کامل

Learning reward expectations in honeybees.

Journal: :Learning & memory 2007

Mariana Gil Rodrigo J De Marco Randolf Menzel

The aim of this study was to test whether honeybees develop reward expectations. In our experiment, bees first learned to associate colors with a sugar reward in a setting closely resembling a natural foraging situation. We then evaluated whether and how the sequence of the animals' experiences with different reward magnitudes changed their later behavior in the absence of reinforcement and wit...

متن کامل

Safety encoding in the basal amygdala.

Journal: :The Journal of neuroscience : the official journal of the Society for Neuroscience 2013

Susan Sangha James Z Chadick Patricia H Janak

Learning to fear and avoid life-threatening stimuli are critical survival skills but are maladaptive when they persist in the absence of a direct threat. Thus, it is important to detect when a situation is safe and to increase behaviors leading to naturally rewarding actions, such as feeding and mating. It is unclear how the brain distinguishes between dangerous and safe situations. Here, we pr...

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید