Feed-Forward Learning: Fast Reinforcement Learning of Controllers

نویسندگان

Marek Musial

Frank Lemke

چکیده

Reinforcement Learning (RL) approaches are, very often, rendered useless by the statistics of the required sampling process. This paper shows how very fast RL is essentially made possible by abandoning the state feedback during training episodes. The resulting new method, feed-forward learning (FF learning), employs a return estimator for pairs of a state and a feed-forward policy’s parameter vector. FF learning is particularly suitable for the learning of controllers, e.g. for robotics applications, and yields learning rates unprecedented in the RL context. This paper introduces the method formally and proves a lower bound on its performance. Practical results are provided from applying FF learning to several scenarios based on the collision avoidance behavior of a mobile robot.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Reinforcement Learning in Associative Memory

A reinforcement learning based associative memory structure (RLAM) is proposed. In this structure, a one-layer feed forward Palm [1] model is applied to the networks. Instead of batch training, an on-line learning method is used to construct the memory. The networks are trained interactively according to reinforcement learning, which is biologically plausible. The experiment results show that t...

متن کامل

An Unsupervised Learning Method for an Attacker Agent in Robot Soccer Competitions Based on the Kohonen Neural Network

RoboCup competition as a great test-bed, has turned to a worldwide popular domains in recent years. The main object of such competitions is to deal with complex behavior of systems whichconsist of multiple autonomous agents. The rich experience of human soccer player can be used as a valuable reference for a robot soccer player. However, because of the differences between real and simulated soc...

متن کامل

Mini/Micro-Grid Adaptive Voltage and Frequency Stability Enhancement Using Q-learning Mechanism

This paper develops an adaptive control method for controlling frequency and voltage of an islanded mini/micro grid (M/µG) using reinforcement learning method. Reinforcement learning (RL) is one of the branches of the machine learning, which is the main solution method of Markov decision process (MDPs). Among the several solution methods of RL, the Q-learning method is used for solving RL in th...

متن کامل

Dynamic Obstacle Avoidance by Distributed Algorithm based on Reinforcement Learning (RESEARCH NOTE)

In this paper we focus on the application of reinforcement learning to obstacle avoidance in dynamic Environments in wireless sensor networks. A distributed algorithm based on reinforcement learning is developed for sensor networks to guide mobile robot through the dynamic obstacles. The sensor network models the danger of the area under coverage as obstacles, and has the property of adoption o...

متن کامل

Unsupervised Real Time Obstacle Avoidance Technique Based On ARTMAP And BK-Product Of Fuzzy Relation For Autonomous Underwater Vehicle

The article presents ARTMAP and Fuzzy BKProduct approach underwater obstacle avoidance for the Autonomous underwater Vehicles (AUV). The AUV moves an unstructured area of underwater and obstacles that is might meet in its way and whom AUV might avoid. The AUVs are equipped with complex sensorial systems like camera, aquatic sonar system, and transducers. A Neural integrated Fuzzy BKProduct cont...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2007

Feed-Forward Learning: Fast Reinforcement Learning of Controllers

نویسندگان

چکیده

منابع مشابه

Reinforcement Learning in Associative Memory

An Unsupervised Learning Method for an Attacker Agent in Robot Soccer Competitions Based on the Kohonen Neural Network

Mini/Micro-Grid Adaptive Voltage and Frequency Stability Enhancement Using Q-learning Mechanism

Dynamic Obstacle Avoidance by Distributed Algorithm based on Reinforcement Learning (RESEARCH NOTE)

Unsupervised Real Time Obstacle Avoidance Technique Based On ARTMAP And BK-Product Of Fuzzy Relation For Autonomous Underwater Vehicle

عنوان ژورنال:

اشتراک گذاری