Learning to Avoid Risky Actions
نویسندگان
چکیده
When a reinforcement learning agent executes actions that can cause frequently damages to itself, it can learn, by using Q-learning, that these actions must not be executed again. However, there are other actions that do not cause damage frecuently, only once in a while: risky actions, such as parachuting. These actions may imply a big punishment to the agent and, depending on its personality, it would be better to avoid. Nevertheless, using the standard Q-learning algorithm the agent is not able to learn to avoid them, since the result of these actions can be positive in average. In this paper, an additional mechanism to Q-learning, inspired by the emotion of fear, is introduced in order to deal with those risky actions by considering the worst results of them. Moreover, there is a daring factor for adjusting the consideration of the risk. This mechanism is implemented on an autonomous agent living in a virtual environment. The results present the performance of the agent with different daring degrees.
منابع مشابه
An Agent-based Reinforcement Learning Model for Simulating Driver Heterogeneous Behavior during Safety Critical Events in Traffic
Driving behavior in traffic has been modeled quite successfully in simulation software using predefined car-following models rules. However, because most car-following models assume that vehicles could keep a safety distance away to avoid crash related conflicts; they are not capable to capture naturalistic driving behavior during safety-critical events. Also, vehicle detailed lateral maneuveri...
متن کاملPolicy Improvement through Safe Reinforcement Learning in High-Risk Tasks
Reinforcement Learning (RL) methods are widely used for dynamic control tasks. In many cases, these are high risk tasks where the trial and error process may select actions which execution from unsafe states can be catastrophic. In addition, many of these tasks have continuous state and action spaces, making the learning problem harder and unapproachable with conventional RL algorithms. So, whe...
متن کاملLearning Pareto-optimal Solutions in 2x2 Conflict Games
Multiagent learning literature has investigated iterated two-player games to develop mechanisms that allow agents to learn to converge on Nash Equilibrium strategy profiles. Such equilibrium configurations imply that no player has the motivation to unilaterally change its strategy. Often, in general sum games, a higher payoff can be obtained by both players if one chooses not to respond myopica...
متن کاملPrecautionary Behavior in Response to Perceived Threat of Pandemic Influenza
Faced with an epidemic of an infectious disease, persons may take precautionary actions to try to reduce their risk. Such actions include avoiding situations that persons perceive to be risky, which can have negative health and economic effects. Therefore, we conducted a population-based survey of persons' precautionary actions in response to a hypothetical influenza pandemic. For the 5 Europea...
متن کاملThe Prevalence of Risky Sexual Behaviors and Awareness of STDs Among Temporary Residents of Homeless Shelters in Tehran
Background: Risky sexual behaviors expose people to sexual transmitted diseases. These behaviors are usually common among homeless people, so educational programs would help them to avoid high risk behaviors. Understanding STD awareness and the common types of risky behaviors among homeless people would provide a good context for designing appropriate educational plans. Aim: This study was perf...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Cybernetics and Systems
دوره 42 شماره
صفحات -
تاریخ انتشار 2011