Learning to Plan Probabilistically

نویسنده

  • Ron Sun
چکیده

This paper discusses the learning of probabilistic planning without a priori domain-specific knowledge. Different from existing reinforcement learning algorithms that generate only reactive policies and existing probabilistic planning algorithms that requires a substantial amount of a priori knowledge in order to plan, we devise a two-stage bottom-up learning-to-plan process, in which first reinforcement learning/dynamic programming is applied, without the use of a priori domainspecific knowledge, to acquire a reactive policy and then explicit plans are extracted from the learned reactive policy. Plan extraction is based on a beam search algorithm that performs temporal projection in a restricted fashion guided by the value functions resulting from reinforcement learning/dynamic programming.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Learning to Plan Probabilistically from Neural Networks

| This paper discusses the learning of probabilis-tic planning without a priori domain-speciic knowledge. Diierent from existing reinforcement learning algorithms that generate only reactive policies and existing probabilis-tic planning algorithms that requires a substantial amount of a priori knowledge in order to plan, we devise a two-stage bottom-up learning-to-plan process, in which rst rei...

متن کامل

Probabilistically Reusing Plans in Deterministic Planning

Inspired by probabilistic path planning, we contribute a planning approach that probabilistically balances heuristics and past plans as guidance to planning search. Our ERRT-PLAN algorithm generates multiple search branches probabilistically choosing to extend them towards the current goal or towards actions or goals of a past given plan. We have defined domains to show where current techniques...

متن کامل

KR3L: An Architecture for Knowledge Representation, Reasoning and Learning in Human-Robot Collaboration

This paper describes an architecture that combines the complementary strengths of declarative programming, probabilistic graphical models, and reinforcement learning. Reasoning with different descriptions of incomplete domain knowledge and uncertainty is based on tightly-coupled representations at two different resolutions. For any given goal, non-monotonic logical inference with the coarse-res...

متن کامل

The Effects of Educational Planning on Learning of Occupational Health Students

Background: Presentation of course plan were advised but presentations of  lesson plans were useful too. These items can introduce the course and lessons of each session to students.  The objective of this study was the determination of the effects of educational planning on learning of occupational diseases. Methods: This study was a semi experimental study which was conducted by using the cur...

متن کامل

Real-Time Predictive Modeling and Robust Avoidance of Pedestrians with Uncertain, Changing Intentions

To plan safe trajectories in urban environments, autonomous vehicles must be able to quickly assess the future intentions of dynamic agents. Pedestrians are particularly challenging to model, as their motion patterns are often uncertain and/or unknown a priori. This paper presents a novel changepoint detection and clustering algorithm that, when coupled with offline unsupervised learning of a G...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002