A novel policy iteration based deterministic Q-learning for discrete-time nonlinear systems
نویسندگان
چکیده
منابع مشابه
Fault Detection for Nonlinear Discrete-Time Systems via Deterministic Learning
Recently, an approach for the rapid detection of small oscillation faults based on deterministic learning theory was proposed for continuous-time systems. In this paper, a fault detection scheme is proposed for a class of nonlinear discrete-time systems via deterministic learning. By using a discrete-time extension of deterministic learning algorithm, the general fault functions (i.e., the inte...
متن کاملIntegral Q-learning and explorized policy iteration for adaptive optimal control of continuous-time linear systems
This paper proposes an integral Q-learning for continuous-time (CT) linear time-invariant (LTI) systems, which solves a linear quadratic regulation (LQR) problem in real time for a given system and a value function, without knowledge about the system dynamics A and B. Here, Q-learning is referred to as a family of reinforcement learning methods which find the optimal policy by interaction with ...
متن کاملA Robust Control Design Technique for Discrete-Time Systems
A robust state feedback design subject to placement of the closed loop eigenvalues in a prescribed region of unit circle is presented. Quantitative measures of robustness and disturbance rejection are investigated. A stochastic optimization algorithm is used to effect trade-off between the free design parameters and to accomplish all the design criteria. A numerical example is given to illustra...
متن کاملPreference-Based Policy Iteration: Leveraging Preference Learning for Reinforcement Learning
This paper makes a first step toward the integration of two subfields of machine learning, namely preference learning and reinforcement learning (RL). An important motivation for a “preference-based” approach to reinforcement learning is a possible extension of the type of feedback an agent may learn from. In particular, while conventional RL methods are essentially confined to deal with numeri...
متن کاملQ-learning-based optimal digital feedback control with computation time delay of linear discrete-time systems
In embedded computers, there are delays due to computation time. Unless they are considered, a controlled system may be unstable. If the system is unknown, Q-learningbased optimal control is one of the useful approaches. Applying it to a system, we can obtain the optimal feedback gain for the unknown system. In this paper, we propose Q-learning-based optimal feedback control taking the delay in...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Science China Information Sciences
سال: 2015
ISSN: 1674-733X,1869-1919
DOI: 10.1007/s11432-015-5462-z