markov reward models

نتایج جستجو برای: markov reward models

تعداد نتایج: 981365 فیلتر نتایج به سال:

Simulation - Based Optimization of Markov

1998

Peter Marbach

We propose a simulation-based algorithm for optimizing the average reward in a Markov Reward Process that depends on a set of parameters. As a special case, the method applies to Markov Decision Processes where optimization takes place within a parametrized set of policies. The algorithm involves the simulation of a single sample path, and can be implemented on-line. A convergence result (with ...

متن کامل

Modeling Neuronal Interactivity using Dynamic Bayesian Networks

2005

Lei Zhang Dimitris Samaras Nelly Alia-Klein Nora D. Volkow Rita Z. Goldstein

Functional Magnetic Resonance Imaging (fMRI) has enabled scientists to look into the active brain. However, interactivity between functional brain regions, is still little studied. In this paper, we contribute a novel framework for modeling the interactions between multiple active brain regions, using Dynamic Bayesian Networks (DBNs) as generative models for brain activation patterns. This fram...

متن کامل

Combinations and Mixtures of Optimal Policies in Unichain Markov Decision Processes are Optimal

Journal: :CoRR 2005

Ronald Ortner

We show that combinations of optimal (stationary) policies in unichain Markov decision processes are optimal. That is, let M be a unichain Markov decision process with state space S, action space A and policies π◦ j : S → A (1 ≤ j ≤ n) with optimal average infinite horizon reward. Then any combination π of these policies, where for each state i ∈ S there is a j such that π(i) = π◦ j (i), is opt...

متن کامل

Bounded Parameter Markov Decision Processes with Average Reward Criterion

2007

Ambuj Tewari Peter L. Bartlett

Bounded parameter Markov Decision Processes (BMDPs) address the issue of dealing with uncertainty in the parameters of a Markov Decision Process (MDP). Unlike the case of an MDP, the notion of an optimal policy for a BMDP is not entirely straightforward. We consider two notions of optimality based on optimistic and pessimistic criteria. These have been analyzed for discounted BMDPs. Here we pro...

متن کامل

Strong, Weak and Branching Bisimulation for Transition Systems and Markov Reward Chains: A Unifying Matrix Approach

2009

Nikola Trcka

We first study labeled transition systems with explicit successful termination. We establish the notions of strong, weak, and branching bisimulation in terms of boolean matrix theory, introducing thus a novel and powerful algebraic apparatus. Next we consider Markov reward chains which are standardly presented in real matrix theory. By interpreting the obtained matrix conditions for bisimulatio...

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید

Simulation - Based Optimization of Markov

Modeling Neuronal Interactivity using Dynamic Bayesian Networks

Combinations and Mixtures of Optimal Policies in Unichain Markov Decision Processes are Optimal

Bounded Parameter Markov Decision Processes with Average Reward Criterion

Strong, Weak and Branching Bisimulation for Transition Systems and Markov Reward Chains: A Unifying Matrix Approach

Ancestral graph Markov models

Logical Hidden Markov Models

Scoring hidden Markov models

Hidden semi-Markov models

Profile hidden Markov models