Learning to Avoid Risky Actions

نویسندگان

  • María Malfaz
  • Miguel Angel Salichs
چکیده

When a reinforcement learning agent executes actions that can cause frequently damages to itself, it can learn, by using Q-learning, that these actions must not be executed again. However, there are other actions that do not cause damage frecuently, only once in a while: risky actions, such as parachuting. These actions may imply a big punishment to the agent and, depending on its personality, it would be better to avoid. Nevertheless, using the standard Q-learning algorithm the agent is not able to learn to avoid them, since the result of these actions can be positive in average. In this paper, an additional mechanism to Q-learning, inspired by the emotion of fear, is introduced in order to deal with those risky actions by considering the worst results of them. Moreover, there is a daring factor for adjusting the consideration of the risk. This mechanism is implemented on an autonomous agent living in a virtual environment. The results present the performance of the agent with different daring degrees.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Agent-based Reinforcement Learning Model for Simulating Driver Heterogeneous Behavior during Safety Critical Events in Traffic

Driving behavior in traffic has been modeled quite successfully in simulation software using predefined car-following models rules. However, because most car-following models assume that vehicles could keep a safety distance away to avoid crash related conflicts; they are not capable to capture naturalistic driving behavior during safety-critical events. Also, vehicle detailed lateral maneuveri...

متن کامل

Policy Improvement through Safe Reinforcement Learning in High-Risk Tasks

Reinforcement Learning (RL) methods are widely used for dynamic control tasks. In many cases, these are high risk tasks where the trial and error process may select actions which execution from unsafe states can be catastrophic. In addition, many of these tasks have continuous state and action spaces, making the learning problem harder and unapproachable with conventional RL algorithms. So, whe...

متن کامل

Learning Pareto-optimal Solutions in 2x2 Conflict Games

Multiagent learning literature has investigated iterated two-player games to develop mechanisms that allow agents to learn to converge on Nash Equilibrium strategy profiles. Such equilibrium configurations imply that no player has the motivation to unilaterally change its strategy. Often, in general sum games, a higher payoff can be obtained by both players if one chooses not to respond myopica...

متن کامل

Precautionary Behavior in Response to Perceived Threat of Pandemic Influenza

Faced with an epidemic of an infectious disease, persons may take precautionary actions to try to reduce their risk. Such actions include avoiding situations that persons perceive to be risky, which can have negative health and economic effects. Therefore, we conducted a population-based survey of persons' precautionary actions in response to a hypothetical influenza pandemic. For the 5 Europea...

متن کامل

The Prevalence of Risky Sexual Behaviors and Awareness of STDs Among Temporary Residents of Homeless Shelters in Tehran

Background: Risky sexual behaviors expose people to sexual transmitted diseases. These behaviors are usually common among homeless people, so educational programs would help them to avoid high risk behaviors. Understanding STD awareness and the common types of risky behaviors among homeless people would provide a good context for designing appropriate educational plans. Aim: This study was perf...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Cybernetics and Systems

دوره 42  شماره 

صفحات  -

تاریخ انتشار 2011