Reinforcement Learning Through Gradient Descent

نویسندگان

  • Leemon C. Baird
  • Tom Mitchell
  • Scott Fahlman
  • Leslie Kaelbling
چکیده

1

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Multi-robot Reinforcement Learning Based On Learning Classifier System with Gradient Descent Methods

This paper proposed a robot reinforcement learning method based on learning classifier system. A Learning Classifier System is a accuracy-based machine learning system with gradient descent that combines reinforcement learning and rule discovery system. The genetic algorithm and the covering operator act as innovation discovery components which are responsible for discovering new better reinfor...

متن کامل

Reinforcement Learning for Adaptive Theory of Mind in the Sigma Cognitive Architecture

One of the most common applications of human intelligence is social interaction, where people must make effective decisions despite uncertainty about the potential behavior of others around them. Reinforcement learning (RL) provides one method for agents to acquire knowledge about such interactions. We investigate different methods of multiagent reinforcement learning within the Sigma cognitive...

متن کامل

Scaled Gradient Descent Learning Rate - Reinforcement Learning with Light-Seeking Robot

Adaptive behaviour through machine learning is challenging in many real-world applications such as robotics. This is because learning has to be rapid enough to be performed in real time and to avoid damage to the robot. Models using linear function approximation are interesting in such tasks because they offer rapid learning and have small memory and processing requirements. Adalines are a simp...

متن کامل

Reinforcement Learning in a Noisy Environment: Light-seeking Robot

Despite many promising results from the use of reinforcement learning in simulated robot worlds, its use in real robot worlds is relatively rare. This paper addresses challenges related to real robot worlds and shows how reinforcement learning combined with linear function approximation can solve many of them. Experiments are performed using a light-seeking robot built with the Lego Mindstorms ...

متن کامل

Reinforcement Learning for Adaptive Routing

Reinforcement learning means learning a policy—a mapping of observations into actions— based on feedback from the environment. The learning can be viewed as browsing a set of policies while evaluating them by trial through interaction with the environment. We present an application of gradient ascent algorithm for reinforcement learning to a complex domain of packet routing in network communica...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999