limitedunlimited partially gated

نتایج جستجو برای: limitedunlimited partially gated

تعداد نتایج: 159378 فیلتر نتایج به سال:

RAPID: A Reachable Anytime Planner for Imprecisely-sensed Domains

2010

Emma Brunskill Stuart J. Russell

Despite the intractability of generic optimal partially observable Markov decision process planning, there exist important problems that have highly structured models. Previous researchers have used this insight to construct more efficient algorithms for factored domains, and for domains with topological structure in the flat state dynamics model. In our work, motivated by findings from the edu...

متن کامل

Speeding up Online POMDP Planning - Unification of Observation Branches by Belief-state Compression Via Expected Feature Values

2015

Gavin Rens

A novel algorithm to speed up online planning in partially observable Markov decision processes (POMDPs) is introduced. I propose a method for compressing nodes in beliefdecision-trees while planning occurs. Whereas belief-decision-trees branch on actions and observations, with my method, they branch only on actions. This is achieved by unifying the branches required due to the nondeterminism o...

متن کامل

Dynamic Decision Making in Stochastic Partially Observable Medical Domains: Ischemic Heart Disease Example

2004

Milos Hauskrecht

The focus of this paper is the framework of partially observable Markov decision processes (POMDPs) and its role in modeling and solving complex dynamic decision problems in stochastic and partially observable medical domains. The paper summarizes some of the basic features of the POMDP framework and explores its potential in solving the problem of the management of the patient with chronic isc...

متن کامل

Scalable POMDPs for Diagnosis and Planning in Intelligent Tutoring Systems

2010

Jeremiah T. Folsom-Kovarik Gita Reese Sukthankar Sae Lynne Schatz Denise M. Nicholson

A promising application area for proactive assistant agents is automated tutoring and training. Intelligent tutoring systems (ITSs) assist tutors and tutees by automating diagnosis and adaptive tutoring. These tasks are well modeled by a partially observable Markov decision process (POMDP) since it accounts for the uncertainty inherent in diagnosis. However, an important aspect of making POMDP ...

متن کامل

An Improved Grid-Based Approximation Algorithm for POMDPs

2001

Rong Zhou Eric A. Hansen

Although a partially observable Markov decision process (POMDP) provides an appealing model for problems of planning under uncertainty, exact algorithms for POMDPs are intractable. This motivates work on approximation algorithms, and grid-based approximation is a widely-used approach. We describe a novel approach to grid-based approximation that uses a variable-resolution regular grid, and show...

متن کامل

Hybrid POMDP Algorithms

2006

Sébastien Paquet Brahim Chaib-draa Stéphane Ross

When an agent evolves in a partially observable environment, it has to deal with uncertainties when choosing its actions. An efficient model for such environments is to use partially observable Markov decision processes (POMDPs). Many algorithms have been developed for POMDPs. Some use an offline approach, learning a complete policy before the execution. Others use an online approach, construct...

متن کامل

A Survey of POMDP Applications

2003

Anthony R. Cassandra

An increasing number of researchers in many areas are becoming interested in the application of the partially observable Markov decision process (pomdp) model to problems with hidden state. This model can account for both state transition and observation uncertainty. The majority of recent research interest in the pomdp model has been in the artificial intelligence community and as such, has be...

متن کامل

Learning Policies for Partially Observable Environments: Scaling Up

1995

Michael L. Littman Anthony R. Cassandra Leslie Pack Kaelbling

Partially observable Markov decision processes (pomdp's) model decision problems in which an agent tries to maximize its reward in the face of limited and/or noisy sensor feedback. While the study of pomdp's is motivated by a need to address realistic problems, existing techniques for nding optimal behavior do not appear to scale well and have been unable to nd satisfactory policies for problem...

متن کامل

Planning under the Uncertainty of the Technical Analysis of Stock Markets

2010

Augusto Cesar Espíndola Baffa Angelo E. M. Ciarlini

The stock market can be considered a nondeterministic and partially observable domain, because investors never know all information that affects prices and the result of an investment is always uncertain. Technical Analysis methods demand only data that are easily available, i.e. the series of prices and trade volumes, and are then very useful to predict current price trends. Analysts have howe...

متن کامل

Regionalized Policy Representation for Reinforcement Learning in POMDPs

2007

Xuejun Liao Hui Li Ronald Parr Lawrence Carin

Many decision-making problems can be formulated in the framework of a partially observable Markov decision process (POMDP) [5]. The optimality of decisions relies on the accuracy of the POMDP model as well as the policy found for the model. In many applications the model is unknown and learned empirically based on experience, and building a model is just as difficult as finding the associated p...

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید