Reinforcement Learning for Adaptive Optimal Stationary Control of Linear Stochastic Systems
نویسندگان
چکیده
This article studies the adaptive optimal stationary control of continuous-time linear stochastic systems with both additive and multiplicative noises, using reinforcement learning techniques. Based on policy iteration, a novel off-policy algorithm, named optimistic least-squares-based is proposed, which able to find iteratively near-optimal policies problem directly from input/state data without explicitly identifying any system matrices, starting an initial admissible policy. The solutions given by proposed iteration are proved converge small neighborhood solution probability one, under mild conditions. application algorithm triple inverted pendulum example validates its feasibility effectiveness.
منابع مشابه
Reinforcement learning based computational adaptive optimal control and system identification for linear systems
The duality of estimation and control problems is a wellknown fact in control theory literature. Parameter convergence and closed loop stability are usually of paramount interest for a given adaptive control scheme. However, the controller thus designed doesn’t guarantee any performance other than ensuring closed loop stability and signal boundedness. As far as system identification goes, this ...
متن کاملOptimal Adaptive Control of Uncertain Stochastic Discrete Linear Systems
The problem of optimal control of stochastic discrete linear time-invariant uncertain systems on finite time interval is formulated and partially solved. This optimal solution shows that previously published adaptive optimal control schemes and indirect adaptive control schemes do not need heuristics for their rationalization. It is shown that these schemes are suboptimal causal approximations ...
متن کاملReinforcement Learning Based PID Control of Wind Energy Conversion Systems
In this paper an adaptive PID controller for Wind Energy Conversion Systems (WECS) has been developed. Theadaptation technique applied to this controller is based on Reinforcement Learning (RL) theory. Nonlinearcharacteristics of wind variations as plant input, wind turbine structure and generator operational behaviordemand for high quality adaptive controller to ensure both robust stability an...
متن کاملPath Integral Stochastic Optimal Control for Reinforcement Learning
Path integral stochastic optimal control based learning methods are among the most efficient and scalable reinforcement learning algorithms. In this work, we present a variation of this idea in which the optimal control policy is approximated through linear regression. This connection allows the use of well-developed linear regression algorithms for learning of the optimal policy, e.g. learning...
متن کاملOptimal Finite-time Control of Positive Linear Discrete-time Systems
This paper considers solving optimization problem for linear discrete time systems such that closed-loop discrete-time system is positive (i.e., all of its state variables have non-negative values) and also finite-time stable. For this purpose, by considering a quadratic cost function, an optimal controller is designed such that in addition to minimizing the cost function, the positivity proper...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Transactions on Automatic Control
سال: 2023
ISSN: ['0018-9286', '1558-2523', '2334-3303']
DOI: https://doi.org/10.1109/tac.2022.3172250