نتایج جستجو برای: reward situation

تعداد نتایج: 163929  

2008
Stavros Vassos Gerhard Lakemeyer Hector J. Levesque

In a seminal paper Lin and Reiter introduced the notion of progression for basic action theories in the situation calculus. The idea is to replace an initial database by a new set of sentences which reflect the changes due to an action. Unfortunately, progression requires secondorder logic in general. In this paper, we introduce the notion of strong progression, a slight variant of Lin and Reit...

2015
Christoph Schwering Gerhard Lakemeyer

A fundamental task in reasoning about action and change is projection, which refers to determining what holds after a number of actions have occurred. A powerful method for solving the projection problem is regression, which reduces reasoning about the future to reasoning about the initial state. In particular, regression has played an important role in the situation calculus and its epistemic ...

2012
Giuseppe De Giacomo Yves Lespérance Fabio Patrizi

We define a notion of bounded action theory in the situation calculus, where the theory entails that in all situations, the number of ground fluent atoms is bounded by a constant. Such theories can still have an infinite domain and an infinite set of states. We argue that such theories are fairly common in applications, either because facts do not persist indefinitely or because one eventually ...

Journal: :Journal of Applied Non-Classical Logics 2003
Robert Demolombe

We propose a translation into Modal Logic of the ideas that formalise belief change in the Situation Calculus. This translation is extended to the case of revision. In the conclusion is presented a set of open issues.

Journal: :Proceedings of the National Academy of Sciences of the United States of America 2013
Darby Proctor Rebecca A Williamson Frans B M de Waal Sarah F Brosnan

Is the sense of fairness uniquely human? Human reactions to reward division are often studied by means of the ultimatum game, in which both partners need to agree on a distribution for both to receive rewards. Humans typically offer generous portions of the reward to their partner, a tendency our close primate relatives have thus far failed to show in experiments. Here we tested chimpanzees (Pa...

2010
Gergely Neu András György Csaba Szepesvári

We consider a stochastic extension of the loop-free shortest path problem with adversarial rewards. In this episodic Markov decision problem an agent traverses through an acyclic graph with random transitions: at each step of an episode the agent chooses an action, receives some reward, and arrives at a random next state, where the reward and the distribution of the next state depend on the act...

Journal: :Acta medica Okayama 2003
Yutaka Gomita Minehiro Moriyama Yasuyuki Ichimaru Hiroaki Araki Hidenori Sagara

In considering characteristics of action of anxiolytic drugs and the mechanism of drug action in the brain, it may be necessary to study not only the behavioral pharmacology but also the brain site. In the present study, anxiolytic drugs have been examined in various kinds of behaviors induced by stimulating the brain areas with regard to emotional expression such as reward (pleasure) or aversi...

S Bouret SJ Sara

Motivation and attention can have a profound influence on perception, learning and memory. Neuromodulatory systems, especially the noradrenergic (NE) system, co-vary with psychological states to modulate cortical arousal, influence sensory processing and promote synaptic plasticity. There is even some suggestion that the NE system might facilitate functional recovery after brain damage. Post-sy...

S Bouret SJ Sara

Motivation and attention can have a profound influence on perception, learning and memory. Neuromodulatory systems, especially the noradrenergic (NE) system, co-vary with psychological states to modulate cortical arousal, influence sensory processing and promote synaptic plasticity. There is even some suggestion that the NE system might facilitate functional recovery after brain damage. Post-sy...

2015
Han The Anh Long Tran-Thanh Nicholas R. Jennings

We study the situation of a decision-maker who aims to encourage the players of an evolutionary game theoretic system to follow certain desired behaviours. To do so, she can interfere in the system to reward her preferred behavioural patterns. However, this action requires certain cost (e.g., resource consumption). Given this, her main goal is to maintain an efficient trade-off between achievin...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید