Hybrid POMDP Algorithms

نویسندگان

  • Sébastien Paquet
  • Brahim Chaib-draa
  • Stéphane Ross
چکیده

When an agent evolves in a partially observable environment, it has to deal with uncertainties when choosing its actions. An efficient model for such environments is to use partially observable Markov decision processes (POMDPs). Many algorithms have been developed for POMDPs. Some use an offline approach, learning a complete policy before the execution. Others use an online approach, constructing the policy online for the current belief state. In this article, we present three hybrid algorithms that have been developed to combine the strengths of these two extremes approaches (offline and online). We present results showing that hybrid algorithms can often obtained better results than the online or the offline algorithms alone.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Multiagent Teamwork: Hybrid Approaches

Today within the multiagent community, we see at least four competing methods to building multiagent systems: beliefdesire-intention (BDI), distributed constraint optimization (DCOP), distributed POMDPs, and auctions or game-theoretic methods. While there is exciting progress within each approach, there is a lack of cross-cutting research. This article highlights the various hybrid techniques f...

متن کامل

Massively parallel motion planning algorithms under uncertainty using POMDP

We present new parallel algorithms that solve continuous-state partially observable Markov decision process (POMDP) problems using the GPU (gPOMDP) and a hybrid of the GPU and CPU (hPOMDP). We choose the Monte Carlo value iteration (MCVI) method as our base algorithm and parallelize this algorithm using the multi-level parallel formulation of MCVI. For each parallel level, we propose efficient ...

متن کامل

Policy optimization by marginal-map probabilistic inference in generative models

While most current work in POMDP planning focus on the development of scalable approximate algorithms, existing techniques often neglect performance guarantees and sacrifice solution quality to improve efficiency. In contrast, our approach to optimizing POMDP controllers by probabilistic inference and obtaining bounded on solution quality can be summarized as follows: (1) re-formulate POMDP pla...

متن کامل

Two Solvers of Partially Observable Stochastic Hybrid Systems

We consider the problem of verification of safety specifications for hybrid systems with a controller that has access to partially observations of the state. Two kinds of POMDP Solvers with different goals are studied. First, we get a finite state approximation of the stochastic hybrid system in a uniform grid. Then we can solve the dynamic programs using the existed algorithms. Finally, we pro...

متن کامل

POMDP Planning by Marginal-MAP Probabilistic Inference in Generative Models

While most current POMDP planning methods have focused on the development of scalable approximate algorithms, they often neglect the important aspect of solution quality and sacrifice performance guarantees to improve efficiency. In contrast, we propose a novel approach to optimize POMDP controllers by probabilistic inference while obtaining bounds on solution quality as follows: 1) re-formulat...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006