partially non

Reinforcement Learning for Problems with Hidden State

2002

Samuel W. Hasinoff

In this paper, we describe how techniques from reinforcement learning might be used to approach the problem of acting under uncertainty. We start by introducing the theory of partially observable Markov decision processes (POMDPs) to describe what we call hidden state problems. After a brief review of other POMDP solution techniques, we motivate reinforcement learning by considering an agent wi...

متن کامل

BI-POMDP: Bounded, Incremental, Partially-Observable Markov-Model Planning

1997

Richard Washington

Given the problem of planning actions for situations with uncertainty about the action outcomes, Markov models can eeectively model this uncertainty and ooer optimal actions. When the information about the world state is itself uncertain, partially observable Markov models are an appropriate extension to the basic Markov model. However , nding optimal actions for partially observable Markov mod...

متن کامل

A POMDP approach to cooperative localization in sparse environments

2007

Francisco S. Melo Isabel Ribeiro

In this paper we discuss how communication can be used advantageously for cooperative navigation in sparse environments. Specifically, we analyze the tradeoff between the cost of communication cost and the efficient completion of the navigation task. We make use of a partially observable Markov decision process (POMDP) to model the navigation task, since this model allows to explicitly consider...

متن کامل

Analyses of Markov Decision Process Structure Regarding the Possible Strategic use of Interacting Memory Systems

Journal: :Frontiers in Computational Neuroscience 2008

Eric A. Zilli Michael E. Hasselmo

Behavioral tasks are often used to study the different memory systems present in humans and animals. Such tasks are usually designed to isolate and measure some aspect of a single memory system. However, it is not necessarily clear that any given task actually does isolate a system or that the strategy used by a subject in the experiment is the one desired by the experimenter. We have previousl...

متن کامل

The Duality of State and Observation in Probabilistic Transition Systems

2011

Monica Dinculescu Christopher Hundt Prakash Panangaden Joelle Pineau Doina Precup

In this paper we consider the problem of representing and reasoning about systems, especially probabilistic systems, with hidden state. We consider transition systems where the state is not completely visible to an outside observer. Instead, there are observables that partly identify the state. We show that one can interchange the notions of state and observation and obtain what we call a dual ...

متن کامل

Framing Human-Robot Task Communication as a POMDP

Journal: :CoRR 2012

Mark P. Woodward Robert J. Wood

As general purpose robots become more capable, pre-programming of all tasks at the factory will become less practical. We would like for non-technical human owners to be able to communicate, through interaction with their robot, the details of a new task; we call this interaction “task communication”. During task communication the robot must infer the details of the task from unstructured human...

متن کامل

Acting Optimally in Partially Observable Stochastic Domains

1994

Anthony R. Cassandra Leslie Pack Kaelbling Michael L. Littman

In this paper, we describe the partially observable Markov decision process (pomdp) approach to nding optimal or near-optimal control strategies for partially observable stochastic environments, given a complete model of the environment. The pomdp approach was originally developed in the operations research community and provides a formal basis for planning problems that have been of interest t...

متن کامل

Improving Rice (Oryza sativa L.) Drought Tolerance by Suppressing a NF-YA Transcription Factor

Journal: Iranian Journal of Biotechnology 2012

The response to drought stress is a complicated process involving stress sensing, intracellular signaltransduction, and the execution of a cellular response. Transcription factors play important roles in the signaling pathways including abiotic stress. In the present study a rice NF-YA transcription factor gene was partially characterized following dehydration. Disrupting the gene via a T...

متن کامل

PEGASUS: A policy search method for large MDPs and POMDPs

2000

Andrew Y. Ng Michael I. Jordan

We propose a new approach to the problem of searching a space of policies for a Markov decision process (MDP) or a partially observable Markov decision process (POMDP), given a model. Our approach is based on the following observation: Any (PO)MDP can be transformed into an “equivalent” POMDP in which all state transitions (given the current state and action) are deterministic. This reduces the...

متن کامل

Active Chemical Sensing With Partially Observable Markov Decision Processes

2008

Rakesh Gosangi Ricardo Gutierrez-Osuna

We present an active-perception strategy to optimize the temperature program of metal-oxide sensors in real time, as the sensor reacts with its environment. We model the problem as a partially observable Markov decision process (POMDP), where actions correspond to measurements at particular temperatures, and the agent is to find a temperature sequence that minimizes the Bayes risk. We validate ...

متن کامل