نتایج جستجو برای: discrete action reinforcement learning automata darla
تعداد نتایج: 1357117 فیلتر نتایج به سال:
This paper introduces a new model, i.e. state-coupled replicator dynamics, expanding the link between evolutionary game theory and multiagent reinforcement learning to multistate games. More precisely, it extends and improves previous work on piecewise replicator dynamics, a combination of replicators and piecewise models. The contributions of the paper are twofold. One, we identify and explain...
A memory-evolution-based MAS reinforcement learning algorithm (MEBRL) inspired by a psychology memory model is presented. 3 types of different memory stores are used in the design of the algorithm and Learning Automata is used in the processes of agent memory evolution. Through the memory evolution procedure, the agent in the MAS could make a proper decision and share its information indirectly...
the main challenge of a search engine is ranking web documents to provide the best response to a user`s query. despite the huge number of the extracted results for user`s query, only a small number of the first results are examined by users; therefore, the insertion of the related results in the first ranks is of great importance. in this paper, a ranking algorithm based on the reinforcement le...
This paper introduces a novel payoff-based learning scheme for distributed optimization in repeatedly-played strategic-form games. Standard reinforcement-based learning schemes exhibit several limitations with respect to their asymptotic stability. For example, in two-player coordination games, payoff-dominant (or efficient) Nash equilibria may not be stochastically stable. In this work, we pre...
in this paper, an intelligent controller is applied to control omni-directional robots motion. first, the dynamics of the three wheel robots, as a nonlinear plant with considerable uncertainties, is identified using an efficient algorithm of training, named lolimot. then, an intelligent controller based on brain emotional learning algorithm is applied to the identified model. this emotional lea...
We propose a modular reinforcement learning architecture for nonlinear, nonstationary control tasks, which we call multiple model-based reinforcement learning (MMRL). The basic idea is to decompose a complex task into multiple domains in space and time based on the predictability of the environmental dynamics. The system is composed of multiple modules, each of which consists of a state predict...
This paper proposes a reinforcement fuzzy adaptive learning control network (RFALCON), constructed by integrating two fuzzy adaptive learning control networks (FALCON), each of which has a feedforward multilayer network and is developed for the realization of a fuzzy controller. One FALCON performs as a critic network (fuzzy predictor), the other as an action network (fuzzy controller). Using t...
Reinforcement learning is an effective machine learning paradigm in domains represented by compact and discrete state-action spaces. In high-dimensional and continuous domains, tile coding with linear function approximation has been widely used to circumvent the curse of dimensionality, but it suffers from the drawback that human-guided identification of features is required to create effective...
Multi-Objective Learning Automata for Design and Optimization a Two-Stage CMOS Operational Amplifier
In this paper, we propose an efficient approach to design optimization of analog circuits that is based on the reinforcement learning method. In this work, Multi-Objective Learning Automata (MOLA) is used to design a two-stage CMOS operational amplifier (op-amp) in 0.25μm technology. The aim is optimizing power consumption and area so as to achieve minimum Total Optimality Index (TOI), as a new...
This paper develops deep reinforcement learning (DRL) algorithms for optimizing the operation of home energy system which consists photovoltaic (PV) panels, battery storage system, and household appliances. Model-free DRL can efficiently handle difficulty modeling uncertainty PV generation. However, discrete-continuous hybrid action space considered challenges existing either discrete actions o...
نمودار تعداد نتایج جستجو در هر سال
با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید