random modified sp iteration

نتایج جستجو برای: random modified sp iteration

تعداد نتایج: 683930 فیلتر نتایج به سال:

The geometric rate of convergence of random iteration in the Hutchinson distance

Journal: :Aequationes mathematicae 2018

متن کامل

Q-learning and policy iteration algorithms for stochastic shortest path problems

Journal: :Annals OR 2013

Huizhen Yu Dimitri P. Bertsekas

We consider the stochastic shortest path problem, a classical finite-state Markovian decision problem with a termination state, and we propose new convergent Q-learning algorithms that combine elements of policy iteration and classical Q-learning/value iteration. These algorithms are related to the ones introduced by the authors for discounted problems in Bertsekas and Yu (Math. Oper. Res. 37(1...

متن کامل

Solving large test-day models by iteration on data and preconditioned conjugate gradient.

Journal: :Journal of dairy science 1999

M Lidauer I Strandén E A Mäntysaari J Pösö A Kettunen

A preconditioned conjugate gradient method was implemented into an iteration on a program for data estimation of breeding values, and its convergence characteristics were studied. An algorithm was used as a reference in which one fixed effect was solved by Gauss-Seidel method, and other effects were solved by a second-order Jacobi method. Implementation of the preconditioned conjugate gradient ...

متن کامل

An adaptation of the Newton iteration method to solve symmetric positive definite Toeplitz systems

2002

Marc Van Barel Gianni Codevico

The classical Newton iteration method for matrices can be modified into an efficient algorithm when structured matrices are involved. The difficulty, however, is the importance of the choice of the starting matrix. In this paper, we propose a new initial iteration step which makes the choice of the starting matrix less critical. The validity of the approach is illustrated by numerical experiments.

متن کامل

Random Logistic Maps and Lyapunov Exponents

2000

DAVID STEINSALTZ

We prove that under certain basic regularity conditions, a random iteration of logistic maps converges to a random point attractor when the Lyapunov exponent is negative, and does not converge to a point when the Lyapunov exponent is positive.

متن کامل

Polynomial Value Iteration Algorithms for Detrerminstic MDPs

2002

Omid Madani

Value iteration is a commonly used and em pirically competitive method in solving many Markov decision process problems. However, it is known that value iteration has only pseudo polynomial complexity in general. We estab lish a somewhat surprising polynomial bound for value iteration on deterministic Markov decision (DMDP) problems. We show that the basic value iteration procedure converges...

متن کامل

Polynomial Value Iteration Algorithms for Deterministic MDPs

2002

Omid Madani

Value iteration is a commonly used and empirically competitive method in solving many Markov decision process problems. However, it is known that value iteration has only pseudopolynomial complexity in general. We establish a somewhat surprising polynomial bound for value iteration on deterministic Markov decision (DMDP) problems. We show that the basic value iteration procedure converges to th...

متن کامل

Accelerating of Modified Policy Iteration in Probabilistic Model Checking

2016

Mohammadsadegh Mohagheghi

Markov Decision Processes (MDPs) are used to model both non-deterministic and probabilistic systems. Probabilistic model checking is an approach for verifying quantitative properties of probabilistic systems that are modeled by MDPs. Value and Policy Iteration and modified version of them are well-known approaches for computing a wide range of probabilistic properties. This paper tries to impro...

متن کامل

Approximation of Fixed Points of Asymptotically Demicontractive Mappings in Arbitrary Banach Spaces

2002

D. I. IGBOKWE D. I. Igbokwe

Let E be a real Banach Space and K a nonempty closed convex (not necessarily bounded) subset of E. Iterative methods for the approximation of fixed points of asymptotically demicontractive mappings T : K → K are constructed using the more general modified Mann and Ishikawa iteration methods with errors. Our results show that a recent result of Osilike [3] (which is itself a generalization of a ...

متن کامل

Factored Value Iteration Converges

Journal: :Acta Cybern. 2008

István Szita András Lörincz

In this paper we propose a novel algorithm, factored value iteration (FVI), for the approximate solution of factored Markov decision processes (fMDPs). The traditional approximate value iteration algorithm is modified in two ways. For one, the least-squares projection operator is modified so that it does not increase max-norm, and thus preserves convergence. The other modification is that we un...

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید