markovian decision process

نتایج جستجو برای: markovian decision process

تعداد نتایج: 1587387 فیلتر نتایج به سال:

MARKOVIAN DECISION PROCESSES WITH RECURSIVE REWARD FUNCTIONS

Journal: :Bulletin of Mathematical Statistics 1973

متن کامل

Markovian and Non-Markovian Protein Sequence Evolution: Aggregated Markov Process Models

Journal: :Journal of Molecular Biology 2011

متن کامل

Human learning in non-Markovian decision making

2013

Aaron Clarke Johannes Friedrich Walter Senn Elisa Tartaglia Silvia Marchesotti Michael H. Herzog

Humans can learn under a wide variety of feedback conditions. Particularly important types of learning fall under the category of reinforcement learning (RL) where a series of decisions must be made and a sparse feedback signal is obtained. Computational and behavioral studies of RL have focused mainly on Markovian decision processes (MDPs), where the next state and reward depends only on the c...

متن کامل

Generalized polynomial approximations in Markovian decision processes

Journal: :Journal of Mathematical Analysis and Applications 1985

متن کامل

Human and Machine Learning in Non-Markovian Decision Making

2015

Aaron Michael Clarke Johannes Friedrich Elisa M. Tartaglia Silvia Marchesotti Walter Senn Michael H. Herzog

Humans can learn under a wide variety of feedback conditions. Reinforcement learning (RL), where a series of rewarded decisions must be made, is a particularly important type of learning. Computational and behavioral studies of RL have focused mainly on Markovian decision processes, where the next state depends on only the current state and action. Little is known about non-Markovian decision m...

متن کامل

A Partially Observable Markovian Maintenance Process with Continuous Cost Functions

Journal: International Journal of Engineering 1988

M.B. Aryanezhad,

In this paper a two-state Markovian maintenance process where the true state is unknown will be considered. The operating cost per period is a continuous random variable which depends on the state of the process. If investigation cost is incurred at the beginning of any period, the system wit I be returned to the "in-control" state instantaneously. This problem is solved using the average crite...

متن کامل

Reduction semantics in Markovian process algebra

Journal: :Journal of Logical and Algebraic Methods in Programming 2018

متن کامل

Recurrent Neural State Estimation in Domains with Long-Term Dependencies

2012

Siegmund Düll Lina Weichbrodt Alexander Hans Steffen Udluft

This paper presents a state estimation approach for reinforcement learning (RL) of a partially observable Markov decision process. It is based on a special recurrent neural network architecture, the Markov decision process extraction network with shortcuts (MPEN-S). In contrast to previous work regarding this topic, we address the problem of long-term dependencies, which cause major problems in...

متن کامل

Probabilistic Branching in Markovian Process Algebras

Journal: :The Computer Journal 1995

متن کامل

On the Expressiveness of Markovian Process Calculi with Durational and Durationless Actions

2010

Marco Bernardo

Several Markovian process calculi have been proposed in the literature, which differ from each other for various aspects. With regard to the action representation, we distinguish between integrated-time Markovian process calculi, in which every action has an exponentially distributed duration associated with it, and orthogonal-time Markovian process calculi, in which action execution is separat...

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید