نتایج جستجو برای: مدل reward beta
تعداد نتایج: 336113 فیلتر نتایج به سال:
Previous reports have described that neural activities in midbrain dopamine areas are sensitive to unexpected reward delivery and omission. These activities are correlated with reward prediction error in reinforcement learning models, the difference between predicted reward values and the obtained reward outcome. These findings suggest that the reward prediction error signal in the brain update...
In addition, the original version of this Article quoted an incorrect abbreviation for Azadeh HajiHosseini in the 'How to cite this article' section. This has now been corrected in both the PDF and HTML versions of the paper. This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article'...
The relationship between a reinforcement learning (RL) agent and an asynchronous environment is often ignored. Frequently used models of the interaction between an agent and its environment, such as Markov Decision Processes (MDP) or Semi-Markov Decision Processes (SMDP), do not capture the fact that, in an asynchronous environment, the state of the environment may change during computation per...
Zhang and Zafar proposed a video compression scheme based on the wavelet representation and multiresolution motion compensation (MRMC). In this letter, an additional masking module will be created to further enhance its efficiency. Specifically, between the modules of wavelet decomposition and MRMC, the masking module will be inserted which will construct binary images based on the difference o...
Reinforcement learning agents interacting with a complex environment like the real world are unlikely to behave optimally all the time. If such an agent is operating in real-time under human supervision, now and then it may be necessary for a human operator to press the big red button to prevent the agent from continuing a harmful sequence of actions—harmful either for the agent or for the envi...
سابقه و هدف: نئوپلازی تروفوبلاستیک بارداری (gestational trophoblastic neoplasia, gtn) یک طیف گسترده از تومورهای خوش خیم و بدخیم با منشاء جفت انسانی است. این بیماری با وجود نادر بودن دارای پتانسیل پیشرفت سریع به یک بیماری کشنده است. از این رو پیش بینی آن در مراحل اولیه بیماری از اهمیت بالایی برخوردار است. هدف از این مطالعه رسیدن به یک نشانگر مناسب برای پیش بینی زود هنگامgtn بر اساس روند تیتراژ &...
Associative learning studies have shown that the anticipation of reward and punishment shapes the representation of sensory stimuli, which is further modulated by dopamine. Less is known about whether and how reward delivery activates sensory cortices and the role of dopamine at that time point of learning. We used an appetitive instrumental learning task in which participants had to learn that...
چکیدههدف اصلی این مقاله، بررسی رابطه بین استراتژیهای فرهنگ پذیری و سلامت روانی مهاجران شهر کرمانشاه می باشد.امروزه بخش قابل توجهی از مردم برای بهتر شدن وضعیت خود و فرزندانشان معمولا از زادگاه خود به مناطقی مهاجرتمی کنند که امکان پیشرفت برای آنان مناسبتر باشد، مجبور به مهاجرت از زادگاه خود به مناطق دیگر می شوند، در بسیاریموارد این مهاجران با محیطهایی مواجه می شوند که از نظر فرهنگی، اجتماعی و اقت...
نمودار تعداد نتایج جستجو در هر سال
با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید