Stability and Convergence of Stochastic Approximation using the O.D.E. Method - Decision and Control, 1998. Proceedings of the 37th IEEE Conference on

نویسنده

  • V. S. Borkar
چکیده

It is shown here that stability of the stochastic approximation algorithm is implied by the asymptotic stability of the origin for an associated 0.d.e. This in turn implies convergence of the algorithm. Several specific classes of algorithms are considered as applications. It is found that the results provide (i) a simpler derivation of known results for reinforcement learning algorithms; (ii) a proof for the first time that a class of asynchronous stochastic approximation algorithms are convergent without using any a priori assumption of stability. (iii) a proof for the first time that asynchronous adaptive critic and &-learning algorithms are convergent for the average cost optimal control problem.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

APPROXIMATION OF STOCHASTIC PARABOLIC DIFFERENTIAL EQUATIONS WITH TWO DIFFERENT FINITE DIFFERENCE SCHEMES

We focus on the use of two stable and accurate explicit finite difference schemes in order to approximate the solution of stochastic partial differential equations of It¨o type, in particular, parabolic equations. The main properties of these deterministic difference methods, i.e., convergence, consistency, and stability, are separately developed for the stochastic cases.

متن کامل

Minimax-Based Reinforcement Learning with State Aggregation - Decision and Control, 1998. Proceedings of the 37th IEEE Conference on

One of the most important issues in scaling up reinforcement learning for practical problems is how to represent and store cost-to-go functions with more compact representations than lookup tables . In this paper , we address the issue of combining the simple function approximation method-state aggregation with minimaxbased reinforcement learning algorithms and present the convergence theory fo...

متن کامل

The O.D.E. Method for Convergence of Stochastic Approximation and Reinforcement Learning

It is shown here that stability of the stochastic approximation algorithm is implied by the asymptotic stability of the origin for an associated ODE. This in turn implies convergence of the algorithm. Several specific classes of algorithms are considered as applications. It is found that the results provide (i) a simpler derivation of known results for reinforcement learning algorithms; (ii) a ...

متن کامل

Approximation of stochastic advection diffusion equations with finite difference scheme

In this paper, a high-order and conditionally stable stochastic difference scheme is proposed for the numerical solution of $rm Ithat{o}$ stochastic advection diffusion equation with one dimensional white noise process. We applied a finite difference approximation of fourth-order for discretizing space spatial derivative of this equation. The main properties of deterministic difference schemes,...

متن کامل

Stability and sensitivity analysis of periodic orbits in Tapping Mode Atomic Force microscopy - Decision and Control, 1998. Proceedings of the 37th IEEE Conference on

In this paper, the most widely used mode of atomic force microscopy imaging where the cantilever is oscillated at its resonant frequency is studied. It is shown that the amplitude and the sine of the phase of the orbit vary linearly with respect to the cantilever-sample distance. Experiments conducted on a silicon cantilever agree with the theory developed.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004