reward situation

A concept of value during experimental exchange in brown capuchin monkeys, Cebus apella.

Journal: :Folia primatologica; international journal of primatology 2004

Sarah F Brosnan Frans B M de Waal

We evaluated the response of brown capuchin monkeys to two differentially valued tokens in an experimental exchange situation akin to a simple barter. Monkeys were given a series of three tests to evaluate their ability to associate tokens with food, then their responses were examined in a barter situation in which tokens were either limited or unlimited. Capuchins did not perform barter in the...

متن کامل

Modeling Sensorimotor Learning in LIDA Using a Dynamic Learning Rate

2015

Daqi Dong Stan Franklin

We present a new model of sensorimotor learning in a systems-level cognitive model, LIDA. Sensorimotor learning helps an agent properly interact with its environment using past experiences. This new model stores and updates the rewards of pairs of data, motor commands and their contexts, using the concept of reinforcement learning; thus the agent is able to generate (output) effective commands ...

متن کامل

Optimal, heuristic and Q-learning based DSA policies for cellular networks with coordinated access band

Journal: :European Transactions on Telecommunications 2010

Hany Kamal Marceau Coupechoux Philippe Godlewski Jean Marc Kelif

Due to the increasing demands for higher data rate applications, also due to the actual spectrum crowd situation, DSA (Dynamic Spectrum Access) turned into an active research topic. In this paper, we analyze DSA in cellular networks context, where a CAB (Coordinated Access Band) is shared between RANs (Radio Access Networks). We propose an SMDP (Semi Markov Decision Process) approach to derive ...

متن کامل

The neural mechanisms of learning from competitors

Journal: :NeuroImage 2010

Paul A. Howard-Jones Rafal Bogacz Jee H. Yoo Ute Leonards Skevi Demetriou

Learning from competitors poses a challenge for existing theories of reward-based learning, which assume that rewarded actions are more likely to be executed in the future. Such a learning mechanism would disadvantage a player in a competitive situation because, since the competitor's loss is the player's gain, reward might become associated with an action the player should themselves avoid. Us...

متن کامل

Individual Differences in Delay Discounting Under Acute Stress: The Role of Trait Perceived Stress

2012

Karolina M. Lempert Anthony J. Porcelli Mauricio R. Delgado Elizabeth Tricomi

Delay discounting refers to the reduction of the value of a future reward as the delay to that reward increases. The rate at which individuals discount future rewards varies as a function of both individual and contextual differences, and high delay discounting rates have been linked with problematic behaviors, including drug abuse and gambling. The current study investigated the effects of acu...

متن کامل

Regionally distinct processing of rewards and punishments by the primate ventromedial prefrontal cortex.

Journal: :The Journal of neuroscience : the official journal of the Society for Neuroscience 2012

Ilya E Monosov Okihide Hikosaka

The ventromedial prefrontal cortex (vmPFC) is thought to be related to emotional experience and to the processing of stimulus and action values. However, little is known about how single vmPFC neurons process the prediction and reception of rewards and punishments. We recorded from monkey vmPFC neurons in an experimental situation with alternating blocks, one in which rewards were delivered and...

متن کامل

Slow cortical potentials capture decision processes during temporal discounting

2013

Felix Oswald Uta Sailer

Various neuroimaging studies have detected brain regions involved in discounting the value of temporally delayed rewards. This study used slow cortical potentials (SCPs) to elaborate the time course of cognitive processing during temporal discounting. Depending on their strength of discounting, subjects were categorised as low and high impulsive. Low impulsives, but not high impulsives, showed ...

متن کامل

Situation ActionPlanner Situation ActionReactivePolicy A ) B ) C ) Situation

1991

Richard S. Sutton

Dyna is an AI architecture that integrates learning, planning, and reactive execution. Learning methods are used in Dyna both for compiling planning results and for updating a model of the eeects of the agent's actions on the world. Planning is incre-mental and can use the probabilistic and ofttimes incorrect world models generated by learning processes. Execution is fully reactive in the sense...

متن کامل

The absence of reward induces inequity aversion in dogs.

Journal: :Proceedings of the National Academy of Sciences of the United States of America 2009

Friederike Range Lisa Horn Zsófia Viranyi Ludwig Huber

One crucial element for the evolution of cooperation may be the sensitivity to others' efforts and payoffs compared with one's own costs and gains. Inequity aversion is thought to be the driving force behind unselfish motivated punishment in humans constituting a powerful device for the enforcement of cooperation. Recent research indicates that non-human primates refuse to participate in cooper...

متن کامل

Hypoalgesia Induced by Reward Devaluation in Rats

2016

Ana María Jiménez-García Leandro Ruíz-Leyva Cruz Miguel Cendán Carmen Torres Mauricio R. Papini Ignacio Morón

Reduced sensitivity to physical pain (hypoalgesia) has been reported after events involving reward devaluation. Reward devaluation was implemented in a consummatory successive negative contrast (cSNC) task. Food-deprived Wistar rats had access to 32% sucrose during 16 sessions followed by access to 4% sucrose during 3 additional sessions. An unshifted control group had access to 4% sucrose thro...

متن کامل