partially non

Information Theoretic Approaches for Predictive Models: Results and Analysis

2006

Monica Dinculescu Doina Precup

Learning the internal representation of partially observable environments has proven to be a di cult problem. State representations which rely on prior models, such as partially observable Markov decision processes (POMDPs) are computation expensive and sensitive to the accuracy of the underlying model dynamics. Recent work by Still and Bialek o ers an information theoretic approach that compre...

متن کامل

Hoc Teamwork by Learning Teammates ’ Task ( JAAMAS Extended

2016

Francisco S. Melo Alberto Sardinha

We address ad hoc teamwork, where an agent must coordinate with other agents in an unknown common task without pre-defined coordination. We formalize the ad hoc teamwork problem as a sequential decision problem and propose (i) the use of an online learning approach that considers the different tasks depending on their ability to predict the behavior of the teammate; and (ii) a decision-theoreti...

متن کامل

Cops and invisible robbers: The cost of drunkenness

Journal: :Theor. Comput. Sci. 2013

Athanasios Kehagias Dieter Mitsche Pawel Pralat

We examine a version of the Cops and Robber (CR) game in which the robber is invisible, i.e., the cops do not know his location until they capture him. Apparently this game (CiR) has received little attention in the CR literature. We examine two variants: in the first the robber is adversarial (he actively tries to avoid capture); in the second he is drunk (he performs a random walk). Our goal ...

متن کامل

Planning in Partially Observable Domains with Fuzzy Epistemic States and Probabilistic Dynamics

2015

Nicolas Drougard Didier Dubois Jean-Loup Farges Florent Teichteil-Königsbuch

A new translation from Partially Observable MDP into Fully Observable MDP is described here. Unlike the classical translation, the resulting problem state space is finite, making MDP solvers able to solve this simplified version of the initial partially observable problem: this approach encodes agent beliefs with fuzzy measures over states, leading to an MDP whose state space is a finite set of...

متن کامل

Accelerated Vector Pruning for Optimal POMDP Solvers

2017

Erwin Walraven Matthijs T. J. Spaan

Partially Observable Markov Decision Processes (POMDPs) are powerful models for planning under uncertainty in partially observable domains. However, computing optimal solutions for POMDPs is challenging because of the high computational requirements of POMDP solution algorithms. Several algorithms use a subroutine to prune dominated vectors in value functions, which requires a large number of l...

متن کامل

Explicit temporal models for decision-theoretic planning of clinical management

Journal: :Artificial intelligence in medicine 1999

Niels Peek

The management of patients over a prolonged period of time is a complicated task involving both diagnostic and prognostic reasoning with incomplete and often uncertain knowledge. Various formalisations of this type of task exist, but these often conceal one or more essential ingredients of the problem. This article explores the suitability of partially observable Markov decision processes to fo...

متن کامل

Permissive Finite-State Controllers of POMDPs using Parameter Synthesis

Journal: :CoRR 2017

Sebastian Junges Nils Jansen Ralf Wimmer Tim Quatmann Leonore Winterer Joost-Pieter Katoen Bernd Becker

We study finite-state controllers (FSCs) for partially observable Markov decision processes (POMDPs). The key insight is that computing (randomized) FSCs on POMDPs is equivalent to synthesis for parametric Markov chains (pMCs). This correspondence enables using parameter synthesis techniques to compute FSCs for POMDPs in a black-box fashion. We investigate how typical restrictions on parameter ...

متن کامل

A Bayesian Framework for Modeling Confidence in Perceptual Decision Making

2015

Koosha Khalvati Rajesh P. Rao

The degree of confidence in one’s choice or decision is a critical aspect of perceptual decision making. Attempts to quantify a decision maker’s confidence by measuring accuracy in a task have yielded limited success because confidence and accuracy are typically not equal. In this paper, we introduce a Bayesian framework to model confidence in perceptual decision making. We show that this model...

متن کامل

Dynamic Decision Making in Stochastic

1997

Milos Hauskrecht

The focus of this paper is the framework of partially observable Markov decision processes (POMDPs) and its role in modeling and solving complex dynamic decision problems in stochastic and partially observable medical domains. The paper summarizes some of the basic features of the POMDP framework and explores its potential in solving the problem of the management of the patient with chronic isc...

متن کامل

Identifying and exploiting weak-information inducing actions in solving POMDPs

2011

Ekhlas Sonu Prashant Doshi

We present a method for identifying actions that lead to observations which are only weakly informative in the context of partially observable Markov decision processes (POMDP). We call such actions as weak(inclusive of zero-) information inducing. Policy subtrees rooted at these actions may be computed more efficiently. While zero-information inducing actions may be exploited without error, th...

متن کامل