MEBRL: Memory Evolution Based Reinforcement Learning Algorithm of MAS
نویسندگان
چکیده
A memory-evolution-based MAS reinforcement learning algorithm (MEBRL) inspired by a psychology memory model is presented. 3 types of different memory stores are used in the design of the algorithm and Learning Automata is used in the processes of agent memory evolution. Through the memory evolution procedure, the agent in the MAS could make a proper decision and share its information indirectly. A multi-agent multiresource stochastic system model is used to illustrate the performance of the algorithm, and the comparison of the memory-evolution-based MAS reinforcement learning algorithm and other MAS learning algorithm is given.
منابع مشابه
A MAS Reinforcement Learning Approach for Indeterministic Multi-Layer Job-Shop Scheduling
The indeterministic multi-layer job-shop scheduling problem, which is the extension of the traditional job-shop scheduling, is introduced in this paper. The framework and some key issues of the problem are discussed. A multi-agent reinforcement learning approach, named memory-evolution-based MAS reinforcement learning algorithm, is breifly introduced too. Experiment results show that our approa...
متن کاملDynamic Obstacle Avoidance by Distributed Algorithm based on Reinforcement Learning (RESEARCH NOTE)
In this paper we focus on the application of reinforcement learning to obstacle avoidance in dynamic Environments in wireless sensor networks. A distributed algorithm based on reinforcement learning is developed for sensor networks to guide mobile robot through the dynamic obstacles. The sensor network models the danger of the area under coverage as obstacles, and has the property of adoption o...
متن کاملOperation Scheduling of MGs Based on Deep Reinforcement Learning Algorithm
: In this paper, the operation scheduling of Microgrids (MGs), including Distributed Energy Resources (DERs) and Energy Storage Systems (ESSs), is proposed using a Deep Reinforcement Learning (DRL) based approach. Due to the dynamic characteristic of the problem, it firstly is formulated as a Markov Decision Process (MDP). Next, Deep Deterministic Policy Gradient (DDPG) algorithm is presented t...
متن کاملWeb pages ranking algorithm based on reinforcement learning and user feedback
The main challenge of a search engine is ranking web documents to provide the best response to a user`s query. Despite the huge number of the extracted results for user`s query, only a small number of the first results are examined by users; therefore, the insertion of the related results in the first ranks is of great importance. In this paper, a ranking algorithm based on the reinforcement le...
متن کاملRRLUFF: Ranking function based on Reinforcement Learning using User Feedback and Web Document Features
Principal aim of a search engine is to provide the sorted results according to user’s requirements. To achieve this aim, it employs ranking methods to rank the web documents based on their significance and relevance to user query. The novelty of this paper is to provide user feedback-based ranking algorithm using reinforcement learning. The proposed algorithm is called RRLUFF, in which the rank...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2001