نتایج جستجو برای: reward situation

تعداد نتایج: 163929  

Journal: :Folia primatologica; international journal of primatology 2004
Sarah F Brosnan Frans B M de Waal

We evaluated the response of brown capuchin monkeys to two differentially valued tokens in an experimental exchange situation akin to a simple barter. Monkeys were given a series of three tests to evaluate their ability to associate tokens with food, then their responses were examined in a barter situation in which tokens were either limited or unlimited. Capuchins did not perform barter in the...

2015
Daqi Dong Stan Franklin

We present a new model of sensorimotor learning in a systems-level cognitive model, LIDA. Sensorimotor learning helps an agent properly interact with its environment using past experiences. This new model stores and updates the rewards of pairs of data, motor commands and their contexts, using the concept of reinforcement learning; thus the agent is able to generate (output) effective commands ...

Journal: :European Transactions on Telecommunications 2010
Hany Kamal Marceau Coupechoux Philippe Godlewski Jean Marc Kelif

Due to the increasing demands for higher data rate applications, also due to the actual spectrum crowd situation, DSA (Dynamic Spectrum Access) turned into an active research topic. In this paper, we analyze DSA in cellular networks context, where a CAB (Coordinated Access Band) is shared between RANs (Radio Access Networks). We propose an SMDP (Semi Markov Decision Process) approach to derive ...

Journal: :NeuroImage 2010
Paul A. Howard-Jones Rafal Bogacz Jee H. Yoo Ute Leonards Skevi Demetriou

Learning from competitors poses a challenge for existing theories of reward-based learning, which assume that rewarded actions are more likely to be executed in the future. Such a learning mechanism would disadvantage a player in a competitive situation because, since the competitor's loss is the player's gain, reward might become associated with an action the player should themselves avoid. Us...

2012
Karolina M. Lempert Anthony J. Porcelli Mauricio R. Delgado Elizabeth Tricomi

Delay discounting refers to the reduction of the value of a future reward as the delay to that reward increases. The rate at which individuals discount future rewards varies as a function of both individual and contextual differences, and high delay discounting rates have been linked with problematic behaviors, including drug abuse and gambling. The current study investigated the effects of acu...

Journal: :The Journal of neuroscience : the official journal of the Society for Neuroscience 2012
Ilya E Monosov Okihide Hikosaka

The ventromedial prefrontal cortex (vmPFC) is thought to be related to emotional experience and to the processing of stimulus and action values. However, little is known about how single vmPFC neurons process the prediction and reception of rewards and punishments. We recorded from monkey vmPFC neurons in an experimental situation with alternating blocks, one in which rewards were delivered and...

2013
Felix Oswald Uta Sailer

Various neuroimaging studies have detected brain regions involved in discounting the value of temporally delayed rewards. This study used slow cortical potentials (SCPs) to elaborate the time course of cognitive processing during temporal discounting. Depending on their strength of discounting, subjects were categorised as low and high impulsive. Low impulsives, but not high impulsives, showed ...

1991
Richard S. Sutton

Dyna is an AI architecture that integrates learning, planning, and reactive execution. Learning methods are used in Dyna both for compiling planning results and for updating a model of the eeects of the agent's actions on the world. Planning is incre-mental and can use the probabilistic and ofttimes incorrect world models generated by learning processes. Execution is fully reactive in the sense...

Journal: :Proceedings of the National Academy of Sciences of the United States of America 2009
Friederike Range Lisa Horn Zsófia Viranyi Ludwig Huber

One crucial element for the evolution of cooperation may be the sensitivity to others' efforts and payoffs compared with one's own costs and gains. Inequity aversion is thought to be the driving force behind unselfish motivated punishment in humans constituting a powerful device for the enforcement of cooperation. Recent research indicates that non-human primates refuse to participate in cooper...

2016
Ana María Jiménez-García Leandro Ruíz-Leyva Cruz Miguel Cendán Carmen Torres Mauricio R. Papini Ignacio Morón

Reduced sensitivity to physical pain (hypoalgesia) has been reported after events involving reward devaluation. Reward devaluation was implemented in a consummatory successive negative contrast (cSNC) task. Food-deprived Wistar rats had access to 32% sucrose during 16 sessions followed by access to 4% sucrose during 3 additional sessions. An unshifted control group had access to 4% sucrose thro...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید