markov decision process graph theory

نتایج جستجو برای: markov decision process graph theory

تعداد نتایج: 2385831 فیلتر نتایج به سال:

Reasoning under Uncertainty in Social Reputation Systems: The Advisor-POMDP

2005

Kevin Regan

This paper examines approaches to representing uncertainty in reputation systems for electronic markets with the aim of constructing a decision theoretic framework for collecting information about selling agents and making purchase decisions in the context of a social reputation system. A selection of approaches to representing reputation using Dempster-Shafter Theory and Bayesian probability a...

متن کامل

Logical Markov Decision Programs

2003

Kristian Kersting Luc De Raedt

Motivated by the interest in relational reinforcement learning, we introduce a novel representation formalism, called logical Markov decision programs (LOMDPs), that integrates Markov Decision Processes with Logic Programs. Using LOMDPs one can compactly and declaratively represent complex relational Markov decision processes. Within this framework we then develop a theory of reinforcement lear...

متن کامل

A simulation-based learning automata framework for solving semi-Markov decision problems under long-run average reward

2008

ABHIJIT GOSAVI SUDEEP SARKAR

Many problems of sequential decision making under uncertainty, whose underlying probabilistic structure has a Markov chain, can be set up as Markov Decision Problems (MDPs). However, when their underlying transition mechanism cannot be characterized by the Markov chain alone, the problems may be set up as Semi-Markov Decision Problems (SMDPs). The framework of dynamic programming has been used ...

متن کامل

Learning Direction Theory and the Winners

2001

Reinhard Selten Klaus Abbink Ricarda Cox REINHARD SELTEN KLAUS ABBINK RICARDA COX

We report an experiment on a decision task by SAMUELSON and BAZERMAN (1985). Subjects submit a bid for an item with an unknown value. A winner’s curse phenomenon arises when subjects bid too high and make losses. Learning direction theory can account for this. However, other influences on behaviour can also be identified. We introduce impulse balance theory to make quantitative predictions on t...

متن کامل

Cascade Markov Decision Processes: Theory and Applications

Journal: :CoRR 2015

Manish Gupta

This paper considers the optimal control of time varying continuous time Markov chains whose transition rates are themselves Markov processes. In one set of problems the solution of an ordinary differential equation is shown to determine the optimal performance and feedback controls, while some other cases are shown to lead to singular optimal control problems which are more difficult to solve....

متن کامل

A Reduction Approach to Decision Procedures

2005

Deepak Kapur Calogero G. Zarba

We present an approach for designing decision procedures based on the reduction of complex theories to simpler ones. Specifically, we define reduction functions as a tool for reducing the satisfiability problem of a complex theory to the satisfiability problem of a simpler one. Reduction functions allow us to reduce the theory of lists to the theory of constructors, the theory of arrays to the ...

متن کامل

A posteriori multi-stage optimal trading under transaction costs and a diversification constraint

Journal: :CoRR 2017

Mogens Graf Plessen Alberto Bemporad

This paper presents a method for the evaluation of a posteriori (historical) multi-variate multi-stage optimal trading under transaction costs and a diversification constraint. Starting from a given amount of money in some currency, we analyze the stage-wise optimal allocation over a time horizon with potential investments in multiple currencies and various assets, such as, for example, assets ...

متن کامل

Graph-Grammar Assistance for Automated Generation of Influence Diagrams

1993

John W. Egar Mark A. Musen

One of the most difficult aspects of modeling complex dilemmas in decision-analytic terms is composing a diagram of relevance relations from a set of domain concepts. Decision mod els in domains such as medicine, however, ex hibit certain prototypical patterns that can guide the modeling process. Medical con cepts can be classified according to semantic types that have characteristic positio...

متن کامل

Implementing Access Control Markov Decision Processes with GLPK/GMPL

Journal: :CoRR 2012

Charles Morisset

In a recent approach, we proposed to model an access control mechanism as a Markov Decision Process, thus claiming that in order to make an access control decision, one can use well-defined mechanisms from decision theory. We present in this paper an implementation of such mechanism, using the open-source solver GLPK, and we model the problem in the GMPL language. We illustrate our approach wit...

متن کامل

A POMDP Framework to Find Optimal Inspection and Maintenance Policies via Availability and Profit Maximization for Manufacturing Systems

Journal: International Journal of Engineering 2018

M. H. Abooie M. S. Fallah Nezhad R. Ghandali

Maintenance can be the factor of either increasing or decreasing system's availability, so it is valuable work to evaluate a maintenance policy from cost and availability point of view, simultaneously and according to decision maker's priorities. This study proposes a Partially Observable Markov Decision Process (POMDP) framework for a partially observable and stochastically deteriorating syste...

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید