markovian decision process

نتایج جستجو برای: markovian decision process

تعداد نتایج: 1587387 فیلتر نتایج به سال:

Fahiem Bacchus

1999

Fahiem Bacchus Craig Boutilier Adam Grove

Markov decision processes (MDPs) are a very popular tool for decision theoretic planning (DTP), partly because of the welldeveloped, expressive theory that includes effective solution techniques. But the Markov assumption-that dynamics and rewards depend on the current state only, and not on historyis often inappropriate. This is especially true of rewards: we frequently wish to associate rewar...

متن کامل

Developing a new model for availability optimization applied to a series-parallel system (Quality Engineering Conference Paper)

Journal: International Journal of Industrial Engineering and Productional Research- 2013

Abdolhamid Eshraghnia Jahromi, Ali Yahyatabar Arabi, Mohammad Shabannataj,

Redundancy technique is known as a way to enhance the reliability and availability of non-reparable systems, but for repairable systems, another factor is getting prominent called as the number of maintenance resources. In this study, availability optimization of series-parallel systems is modelled by using Markovian process by which the number of maintenance resources is located into the obje...

متن کامل

On using discretized Cohen-Grossberg node dynamics for model-free actor-critic neural learning in non-Markovian domains

2003

Eiji Mizutani Stuart E. Dreyfus

We describe how multi-stage non-Markovian decision problems can be solved using actor-critic reinforcement learning by assuming that a discrete version of CohenGrossberg node dynamics describes the node-activation computations of a neural network (NN). Our NN (i.e., agent) is capable of rendering the process Markovian implicitly and automatically in a totally model-free fashion without learning...

متن کامل

Deriving Symbolic Representations from Stochastic Process Algebras

2002

Matthias Kuntz Markus Siegle

A new denotational semantics for a variant of the stochastic process algebra TIPP is presented, which maps process terms to Multiterminal binary decision diagrams. It is shown that the new semantics is Markovian bisimulation equivalent to the standard SOS semantics. The paper also addresses the difficult question of keeping the underlying state space minimal at every construction step.

متن کامل

A New Markov Chain Based Acceptance Sampling Policy via the Minimum Angle Method

Journal: Iranian Journal of Operations Research 2012

Akhavan Niaki, Fallah Nezhad,

We develop an optimization model based on Markovian approach to determine the optimum value of thresholds in a proposed acceptance sampling design. Consider an acceptance sampling plan where items are inspected and when the number of conforming items between successive defective items falls below a lower control threshold value, then the batch is rejected, and if it falls above a control thresh...

متن کامل

Duality theorem in Markovian decision problems

Journal: :Journal of Mathematical Analysis and Applications 1975

متن کامل

Incomplete Information in Markovian Decision Models

Journal: :The Annals of Statistics 1974

متن کامل

Routing problems and Markovian decision processes

Journal: :Journal of Mathematical Analysis and Applications 1985

متن کامل

Non-Discounted Denumerable Markovian Decision Models

Journal: :The Annals of Mathematical Statistics 1968

متن کامل

Description and Acquirement of Macro-Actions in Reinforcement Learning

2004

Takeshi Yoshikawa Yuki Kanazawa Masahito Kurihara

Reinforcement learning is a framing of enabling agents to learn from interaction with environments. It has focused generally on Markov decision process (MDP) domains, but a domain may be non-Markovian in the real world. In this paper, we develop a new description of macro-actions for non-Markov decision process (NMDP) domains in reinforcement learning. A macro-action is an action control struct...

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید