Reinforcement Learning for Adaptive Optimal Stationary Control of Linear Stochastic Systems

نویسندگان

چکیده

This article studies the adaptive optimal stationary control of continuous-time linear stochastic systems with both additive and multiplicative noises, using reinforcement learning techniques. Based on policy iteration, a novel off-policy algorithm, named optimistic least-squares-based is proposed, which able to find iteratively near-optimal policies problem directly from input/state data without explicitly identifying any system matrices, starting an initial admissible policy. The solutions given by proposed iteration are proved converge small neighborhood solution probability one, under mild conditions. application algorithm triple inverted pendulum example validates its feasibility effectiveness.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Reinforcement learning based computational adaptive optimal control and system identification for linear systems

The duality of estimation and control problems is a wellknown fact in control theory literature. Parameter convergence and closed loop stability are usually of paramount interest for a given adaptive control scheme. However, the controller thus designed doesn’t guarantee any performance other than ensuring closed loop stability and signal boundedness. As far as system identification goes, this ...

متن کامل

Optimal Adaptive Control of Uncertain Stochastic Discrete Linear Systems

The problem of optimal control of stochastic discrete linear time-invariant uncertain systems on finite time interval is formulated and partially solved. This optimal solution shows that previously published adaptive optimal control schemes and indirect adaptive control schemes do not need heuristics for their rationalization. It is shown that these schemes are suboptimal causal approximations ...

متن کامل

Reinforcement Learning Based PID Control of Wind Energy Conversion Systems

In this paper an adaptive PID controller for Wind Energy Conversion Systems (WECS) has been developed. Theadaptation technique applied to this controller is based on Reinforcement Learning (RL) theory. Nonlinearcharacteristics of wind variations as plant input, wind turbine structure and generator operational behaviordemand for high quality adaptive controller to ensure both robust stability an...

متن کامل

Path Integral Stochastic Optimal Control for Reinforcement Learning

Path integral stochastic optimal control based learning methods are among the most efficient and scalable reinforcement learning algorithms. In this work, we present a variation of this idea in which the optimal control policy is approximated through linear regression. This connection allows the use of well-developed linear regression algorithms for learning of the optimal policy, e.g. learning...

متن کامل

Optimal Finite-time Control of Positive Linear Discrete-time Systems

This paper considers solving optimization problem for linear discrete time systems such that closed-loop discrete-time system is positive (i.e., all of its state variables have non-negative values) and also finite-time stable. For this purpose, by considering a quadratic cost function, an optimal controller is designed such that in addition to minimizing the cost function, the positivity proper...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Transactions on Automatic Control

سال: 2023

ISSN: ['0018-9286', '1558-2523', '2334-3303']

DOI: https://doi.org/10.1109/tac.2022.3172250