Feed-Forward Learning: Fast Reinforcement Learning of Controllers
نویسندگان
چکیده
Reinforcement Learning (RL) approaches are, very often, rendered useless by the statistics of the required sampling process. This paper shows how very fast RL is essentially made possible by abandoning the state feedback during training episodes. The resulting new method, feed-forward learning (FF learning), employs a return estimator for pairs of a state and a feed-forward policy’s parameter vector. FF learning is particularly suitable for the learning of controllers, e.g. for robotics applications, and yields learning rates unprecedented in the RL context. This paper introduces the method formally and proves a lower bound on its performance. Practical results are provided from applying FF learning to several scenarios based on the collision avoidance behavior of a mobile robot.
منابع مشابه
Reinforcement Learning in Associative Memory
A reinforcement learning based associative memory structure (RLAM) is proposed. In this structure, a one-layer feed forward Palm [1] model is applied to the networks. Instead of batch training, an on-line learning method is used to construct the memory. The networks are trained interactively according to reinforcement learning, which is biologically plausible. The experiment results show that t...
متن کاملAn Unsupervised Learning Method for an Attacker Agent in Robot Soccer Competitions Based on the Kohonen Neural Network
RoboCup competition as a great test-bed, has turned to a worldwide popular domains in recent years. The main object of such competitions is to deal with complex behavior of systems whichconsist of multiple autonomous agents. The rich experience of human soccer player can be used as a valuable reference for a robot soccer player. However, because of the differences between real and simulated soc...
متن کاملMini/Micro-Grid Adaptive Voltage and Frequency Stability Enhancement Using Q-learning Mechanism
This paper develops an adaptive control method for controlling frequency and voltage of an islanded mini/micro grid (M/µG) using reinforcement learning method. Reinforcement learning (RL) is one of the branches of the machine learning, which is the main solution method of Markov decision process (MDPs). Among the several solution methods of RL, the Q-learning method is used for solving RL in th...
متن کاملDynamic Obstacle Avoidance by Distributed Algorithm based on Reinforcement Learning (RESEARCH NOTE)
In this paper we focus on the application of reinforcement learning to obstacle avoidance in dynamic Environments in wireless sensor networks. A distributed algorithm based on reinforcement learning is developed for sensor networks to guide mobile robot through the dynamic obstacles. The sensor network models the danger of the area under coverage as obstacles, and has the property of adoption o...
متن کاملUnsupervised Real Time Obstacle Avoidance Technique Based On ARTMAP And BK-Product Of Fuzzy Relation For Autonomous Underwater Vehicle
The article presents ARTMAP and Fuzzy BKProduct approach underwater obstacle avoidance for the Autonomous underwater Vehicles (AUV). The AUV moves an unstructured area of underwater and obstacles that is might meet in its way and whom AUV might avoid. The AUVs are equipped with complex sensorial systems like camera, aquatic sonar system, and transducers. A Neural integrated Fuzzy BKProduct cont...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007