نتایج جستجو برای: discrete action reinforcement learning automata darla

تعداد نتایج: 1357117  

2009
Daniel Hennes Karl Tuyls Matthias Rauterberg

This paper introduces a new model, i.e. state-coupled replicator dynamics, expanding the link between evolutionary game theory and multiagent reinforcement learning to multistate games. More precisely, it extends and improves previous work on piecewise replicator dynamics, a combination of replicators and piecewise models. The contributions of the paper are twofold. One, we identify and explain...

2001
Le Chang Jiaben Yang

A memory-evolution-based MAS reinforcement learning algorithm (MEBRL) inspired by a psychology memory model is presented. 3 types of different memory stores are used in the design of the algorithm and Learning Automata is used in the processes of agent memory evolution. Through the memory evolution procedure, the agent in the MAS could make a proper decision and share its information indirectly...

Journal: :journal of ai and data mining 2015
v. derhami j. paksima h. khajah

the main challenge of a search engine is ranking web documents to provide the best response to a user`s query. despite the huge number of the extracted results for user`s query, only a small number of the first results are examined by users; therefore, the insertion of the related results in the first ranks is of great importance. in this paper, a ranking algorithm based on the reinforcement le...

2018
Georgios C. Chasparis

This paper introduces a novel payoff-based learning scheme for distributed optimization in repeatedly-played strategic-form games. Standard reinforcement-based learning schemes exhibit several limitations with respect to their asymptotic stability. For example, in two-player coordination games, payoff-dominant (or efficient) Nash equilibria may not be stochastically stable. In this work, we pre...

Journal: :journal of computer and robotics 0
maziar ahmad sharbafi university of tehran caro lucas university of tehran aida mohammadinejad khaje nasir toosi university

in this paper, an intelligent controller is applied to control omni-directional robots motion. first, the dynamics of the three wheel robots, as a nonlinear plant with considerable uncertainties, is identified using an efficient algorithm of training, named lolimot. then, an intelligent controller based on brain emotional learning algorithm is applied to the identified model. this emotional lea...

Journal: :Neural computation 2002
Kenji Doya Kazuyuki Samejima Ken-ichi Katagiri Mitsuo Kawato

We propose a modular reinforcement learning architecture for nonlinear, nonstationary control tasks, which we call multiple model-based reinforcement learning (MMRL). The basic idea is to decompose a complex task into multiple domains in space and time based on the predictability of the environmental dynamics. The system is composed of multiple modules, each of which consists of a state predict...

Journal: :IEEE transactions on neural networks 1996
Cheng-Jian Lin Chin-Teng Lin

This paper proposes a reinforcement fuzzy adaptive learning control network (RFALCON), constructed by integrating two fuzzy adaptive learning control networks (FALCON), each of which has a feedforward multilayer network and is developed for the realization of a fuzzy controller. One FALCON performs as a critic network (fuzzy predictor), the other as an action network (fuzzy controller). Using t...

2006
Christopher D. White Dave Brogan

Reinforcement learning is an effective machine learning paradigm in domains represented by compact and discrete state-action spaces. In high-dimensional and continuous domains, tile coding with linear function approximation has been widely used to circumvent the curse of dimensionality, but it suffers from the drawback that human-guided identification of features is required to create effective...

In this paper, we propose an efficient approach to design optimization of analog circuits that is based on the reinforcement learning method. In this work, Multi-Objective Learning Automata (MOLA) is used to design a two-stage CMOS operational amplifier (op-amp) in 0.25μm technology. The aim is optimizing power consumption and area so as to achieve minimum Total Optimality Index (TOI), as a new...

Journal: :Journal of modern power systems and clean energy 2022

This paper develops deep reinforcement learning (DRL) algorithms for optimizing the operation of home energy system which consists photovoltaic (PV) panels, battery storage system, and household appliances. Model-free DRL can efficiently handle difficulty modeling uncertainty PV generation. However, discrete-continuous hybrid action space considered challenges existing either discrete actions o...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید