Frames-of-Reference-Based Learning: Overcoming Perceptual Aliasing in Multistep Decision-Making Tasks

نویسندگان

چکیده

Perceptual aliasing challenges reinforcement learning agents. They struggle to learn stable policies by failing identify and disambiguate perceptually identical states in the environment that require different actions reach a goal. As agent often has only local frame of reference, it cannot represent global environment. Frame-of-reference-based is feature vertebrate intelligence allows multiple simultaneous representations an at levels abstraction. This enables resolution patterns are made up features. The evolutionary computation technique classifier systems shown promise nested single-step domains. work uses frame-of-reference concept within system non-Markov multistep Considering aliased constituent level place them appropriately holistic-level policies. Instead enumerating huge search space, evolution empowers novel evolve fitter rules experimental results show effectively solves complex environments have been challenging artificial For example, utilizes 6.5, 3.71, 3.22 steps resolve Maze10, Littman57, Woods102, respectively.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Impact of Perceptual Aliasing on Exploration and Learning in a Dynamic Decision Making Task

Perceptual aliasing arises in situations where multiple, distinct states of the world give rise to the same percept. In this study, we examine how the degree of perceptual aliasing in a task impacts the ability of human agents to learn reward-maximizing decision strategies. Previous work has shown that the presence of perceptual cues that help signal distinct states of the environment can impro...

متن کامل

The Impact of Perceptual Aliasing on Human Learning in a Dynamic Decision Making Task

A crucial problem facing both human and artificial RL agents is correctly perceiving, and interpreting, the current state of the environment. For instance, imagine a traveler staying in an unfamiliar hotel, with each floor and exit decorated identically. Based on perceptual information alone, this guest might experience difficulty learning how to navigate towards his room, since the various hal...

متن کامل

Reinforcement Learning with Perceptual Aliasing :

The Perceptual Distinctions Approach Lonnie Chrisman School of Computer Science Carnegie Mellon University Pittsburgh, PA 15213 [email protected] Abstract It is known that Perceptual Aliasing may signi cantly diminish the e ectiveness of reinforcement learning algorithms [Whitehead and Ballard, 1991]. Perceptual aliasing occurs when multiple situations that are indistinguishable from immediat...

متن کامل

Ranking Efficient Decision Making Units in Data Envelopment Analysis based on Changing Reference Set

One of the drawbacks of Data Envelopment Analysis (DEA) is the problem of lack of discrimination among efficient Decision Making Units (DMUs). A method for removing this difficulty is called changing reference set proposed by Jahanshahloo and et.al (2007). The method has some drawbacks. In this paper a modified method and new method to overcome this problems are suggested. The main advantage of...

متن کامل

A Connectionist Formulation of Learning in Dynamic Decision-Making Tasks

A formulation of learning in dynamic decision-making tasks is developed, building on the application of control theory to the study of human performance in dynamic decision making and a connectionist approach to motor control. The formulation is implemented as a connectionist model and compared with human subjects in learning a simulated dynamic decision-making task. When the model is pretraine...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Transactions on Evolutionary Computation

سال: 2022

ISSN: ['1941-0026', '1089-778X']

DOI: https://doi.org/10.1109/tevc.2021.3102241