Efficient Reinforcement Learning via Probabilistic Trajectory Optimization

نویسندگان

چکیده

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Adaptive Probabilistic Trajectory Optimization via Efficient Approximate Inference

Robotic systems must be able to quickly and robustly make decisions when op-erating in uncertain and dynamic environments. While Reinforcement Learning(RL) can be used to compute optimal policies with little prior knowledge about theenvironment, it suffers from slow convergence. An alternative approach is ModelPredictive Control (MPC), which optimizes policies quickly, but also ...

متن کامل

Model-Free Trajectory Optimization for Reinforcement Learning

Many of the recent Trajectory Optimization algorithms alternate between local approximation of the dynamics and conservative policy update. However, linearly approximating the dynamics in order to derive the new policy can bias the update and prevent convergence to the optimal policy. In this article, we propose a new model-free algorithm that backpropagates a local quadratic time-dependent Q-F...

متن کامل

Efficient Reinforcement Learning with Bayesian Optimization

OF THE DISSERTATION Efficient Reinforcement Learning with Bayesian Optimization By Danyan Ganjali Doctor of Philosophy in Mechanical and Aerospace Engineering University of California, Irvine, 2016 Professor Athanasios Sideris, Chair A probabilistic reinforcement learning algorithm is presented for finding control policies in continuous state and action spaces without a prior knowledge of the d...

متن کامل

Trajectory Optimization using Reinforcement Learning for Map Exploration

Automatically building maps from sensor data is a necessary and fundamental skill for mobile robots; as a result, considerable research attention has focused on the technical challenges inherent in the mapping problem. While statistical inference techniques have led to computationally efficient mapping algorithms, the next major challenge in robotic mapping is to automate the data collection pr...

متن کامل

Using trajectory data to improve bayesian optimization for reinforcement learning

Recently, Bayesian Optimization (BO) has been used to successfully optimize parametric policies in several challenging Reinforcement Learning (RL) applications. BO is attractive for this problem because it exploits Bayesian prior information about the expected return and exploits this knowledge to select new policies to execute. Effectively, the BO framework for policy search addresses the expl...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Transactions on Neural Networks and Learning Systems

سال: 2018

ISSN: 2162-237X,2162-2388

DOI: 10.1109/tnnls.2017.2764499