limitedunlimited partially gated

نتایج جستجو برای: limitedunlimited partially gated

تعداد نتایج: 159378 فیلتر نتایج به سال:

A Probabilistic Model of Social Decision Making based on Reward Maximization

2016

Koosha Khalvati Seongmin A. Park Jean-Claude Dreher Rajesh P. Rao

A fundamental problem in cognitive neuroscience is how humans make decisions, act, and behave in relation to other humans. Here we adopt the hypothesis that when we are in an interactive social setting, our brains perform Bayesian inference of the intentions and cooperativeness of others using probabilistic representations. We employ the framework of partially observable Markov decision process...

متن کامل

Bayesian Theory of Mind: Modeling Joint Belief-Desire Attribution

2011

Chris L. Baker Rebecca Saxe Joshua B. Tenenbaum

We present a computational framework for understanding Theory of Mind (ToM): the human capacity for reasoning about agents’ mental states such as beliefs and desires. Our Bayesian model of ToM (or BToM) expresses the predictive model of beliefand desire-dependent action at the heart of ToM as a partially observable Markov decision process (POMDP), and reconstructs an agent’s joint belief state ...

متن کامل

An Improved Policy Iteratioll Algorithm

1997

A. Hansen

A new policy iteration algorithm for partially observable Markov decision processes is presented that is simpler and more efficient than an earlier policy iteration algorithm of Sondik (1971,1978). The key simplification is representation of a policy as a finite-state controller. This representation makes policy evaluation straightforward. The paper's contribution is to show that the dynamic-pr...

متن کامل

Incremental Markov-Model Planning

1996

Richard Washington

This paper presents an approach to building plans using partially observable Markov decision processes. The approach begins with a base solution that assumes full observability. The partially observable solution is incrementally constructed by considering increasing amounts of information from observations. The base solution directs the expansion of the plan by providing an evaluation function ...

متن کامل

Individual Planning in Agent Populations: Exploiting Anonymity and Frame-Action Hypergraphs

2015

Ekhlas Sonu Yingke Chen Prashant Doshi

Interactive partially observable Markov decision processes (I-POMDP) provide a formal framework for planning for a self-interested agent in multiagent settings. An agent operating in a multiagent environment must deliberate about the actions that other agents may take and the effect these actions have on the environment and the rewards it receives. Traditional I-POMDPs model this dependence on ...

متن کامل

Information Theoretic Approaches for Predictive Models: Results and Analysis

2006

Monica Dinculescu Doina Precup

Learning the internal representation of partially observable environments has proven to be a di cult problem. State representations which rely on prior models, such as partially observable Markov decision processes (POMDPs) are computation expensive and sensitive to the accuracy of the underlying model dynamics. Recent work by Still and Bialek o ers an information theoretic approach that compre...

متن کامل

Hoc Teamwork by Learning Teammates ’ Task ( JAAMAS Extended

2016

Francisco S. Melo Alberto Sardinha

We address ad hoc teamwork, where an agent must coordinate with other agents in an unknown common task without pre-defined coordination. We formalize the ad hoc teamwork problem as a sequential decision problem and propose (i) the use of an online learning approach that considers the different tasks depending on their ability to predict the behavior of the teammate; and (ii) a decision-theoreti...

متن کامل

Cops and invisible robbers: The cost of drunkenness

Journal: :Theor. Comput. Sci. 2013

Athanasios Kehagias Dieter Mitsche Pawel Pralat

We examine a version of the Cops and Robber (CR) game in which the robber is invisible, i.e., the cops do not know his location until they capture him. Apparently this game (CiR) has received little attention in the CR literature. We examine two variants: in the first the robber is adversarial (he actively tries to avoid capture); in the second he is drunk (he performs a random walk). Our goal ...

متن کامل

Planning in Partially Observable Domains with Fuzzy Epistemic States and Probabilistic Dynamics

2015

Nicolas Drougard Didier Dubois Jean-Loup Farges Florent Teichteil-Königsbuch

A new translation from Partially Observable MDP into Fully Observable MDP is described here. Unlike the classical translation, the resulting problem state space is finite, making MDP solvers able to solve this simplified version of the initial partially observable problem: this approach encodes agent beliefs with fuzzy measures over states, leading to an MDP whose state space is a finite set of...

متن کامل

Accelerated Vector Pruning for Optimal POMDP Solvers

2017

Erwin Walraven Matthijs T. J. Spaan

Partially Observable Markov Decision Processes (POMDPs) are powerful models for planning under uncertainty in partially observable domains. However, computing optimal solutions for POMDPs is challenging because of the high computational requirements of POMDP solution algorithms. Several algorithms use a subroutine to prune dominated vectors in value functions, which requires a large number of l...

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید