discrete action reinforcement learning automata darla

نتایج جستجو برای: discrete action reinforcement learning automata darla

تعداد نتایج: 1357117 فیلتر نتایج به سال:

State-coupled replicator dynamics

2009

Daniel Hennes Karl Tuyls Matthias Rauterberg

This paper introduces a new model, i.e. state-coupled replicator dynamics, expanding the link between evolutionary game theory and multiagent reinforcement learning to multistate games. More precisely, it extends and improves previous work on piecewise replicator dynamics, a combination of replicators and piecewise models. The contributions of the paper are twofold. One, we identify and explain...

متن کامل

MEBRL: Memory Evolution Based Reinforcement Learning Algorithm of MAS

2001

Le Chang Jiaben Yang

A memory-evolution-based MAS reinforcement learning algorithm (MEBRL) inspired by a psychology memory model is presented. 3 types of different memory stores are used in the design of the algorithm and Learning Automata is used in the processes of agent memory evolution. Through the memory evolution procedure, the agent in the MAS could make a proper decision and share its information indirectly...

متن کامل

web pages ranking algorithm based on reinforcement learning and user feedback

Journal: :journal of ai and data mining 2015

v. derhami j. paksima h. khajah

the main challenge of a search engine is ranking web documents to provide the best response to a user`s query. despite the huge number of the extracted results for user`s query, only a small number of the first results are examined by users; therefore, the insertion of the related results in the first ranks is of great importance. in this paper, a ranking algorithm based on the reinforcement le...

متن کامل

Aspiration-based Perturbed Learning Automata

2018

Georgios C. Chasparis

This paper introduces a novel payoff-based learning scheme for distributed optimization in repeatedly-played strategic-form games. Standard reinforcement-based learning schemes exhibit several limitations with respect to their asymptotic stability. For example, in two-player coordination games, payoff-dominant (or efficient) Nash equilibria may not be stochastically stable. In this work, we pre...

متن کامل

using belbic based optimal controller for omni-directional threewheel robots model identified by lolimot

Journal: :journal of computer and robotics 0

maziar ahmad sharbafi university of tehran caro lucas university of tehran aida mohammadinejad khaje nasir toosi university

in this paper, an intelligent controller is applied to control omni-directional robots motion. first, the dynamics of the three wheel robots, as a nonlinear plant with considerable uncertainties, is identified using an efficient algorithm of training, named lolimot. then, an intelligent controller based on brain emotional learning algorithm is applied to the identified model. this emotional lea...

متن کامل

Multiple Model-Based Reinforcement Learning

Journal: :Neural computation 2002

Kenji Doya Kazuyuki Samejima Ken-ichi Katagiri Mitsuo Kawato

We propose a modular reinforcement learning architecture for nonlinear, nonstationary control tasks, which we call multiple model-based reinforcement learning (MMRL). The basic idea is to decompose a complex task into multiple domains in space and time based on the predictability of the environmental dynamics. The system is composed of multiple modules, each of which consists of a state predict...

متن کامل

An ART-based fuzzy adaptive learning control network

Journal: :IEEE transactions on neural networks 1996

Cheng-Jian Lin Chin-Teng Lin

This paper proposes a reinforcement fuzzy adaptive learning control network (RFALCON), constructed by integrating two fuzzy adaptive learning control networks (FALCON), each of which has a feedforward multilayer network and is developed for the realization of a fuzzy controller. One FALCON performs as a critic network (fuzzy predictor), the other as an action network (fuzzy controller). Using t...

متن کامل

The Self Organization of Context for Learning in MultiAgent Games

2006

Christopher D. White Dave Brogan

Reinforcement learning is an effective machine learning paradigm in domains represented by compact and discrete state-action spaces. In high-dimensional and continuous domains, tile coding with linear function approximation has been widely used to circumvent the curse of dimensionality, but it suffers from the drawback that human-guided identification of features is required to create effective...

متن کامل

Multi-Objective Learning Automata for Design and Optimization a Two-Stage CMOS Operational Amplifier

Journal: Iranian Journal of Electrical and Electronic Engineering 2020

N. Sayyadi Shahraki, S. H. Zahiri,

In this paper, we propose an efficient approach to design optimization of analog circuits that is based on the reinforcement learning method. In this work, Multi-Objective Learning Automata (MOLA) is used to design a two-stage CMOS operational amplifier (op-amp) in 0.25μm technology. The aim is optimizing power consumption and area so as to achieve minimum Total Optimality Index (TOI), as a new...

متن کامل

Mixed Deep Reinforcement Learning Considering Discrete-continuous Hybrid Action Space for Smart Home Energy Management

Journal: :Journal of modern power systems and clean energy 2022

This paper develops deep reinforcement learning (DRL) algorithms for optimizing the operation of home energy system which consists photovoltaic (PV) panels, battery storage system, and household appliances. Model-free DRL can efficiently handle difficulty modeling uncertainty PV generation. However, discrete-continuous hybrid action space considered challenges existing either discrete actions o...

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید