Certainty equivalence control with forcing: revisited
نویسندگان
چکیده
Certainty equivalence control with forcing has been shown to be optimal for several stochastic adaptive control problems with the average cost per unit time criterion. Recently researchers have started looking at stochastic adaptive control problems with a view to minimizing the rate of increase of the learning loss. This criterion is stronger than the average cost per unit time criterion. Certainty equivalence control with forcing does not usually suffice for the learning loss criterion and one has to develop fairly complicated schemes in order to achieve optimality. The objective of this paper is to see how well one might be able to do with a certainty-equivalence-control-with-forcing type of scheme. In particular we construct a class of such schemes whose learning loss is O((log n) 1+8) for 8 > 0, whereas optimal schemes typically have a O(log n) learning loss.
منابع مشابه
Linear Systems with Actuator Nonlinearities
This paper applies nonlinear 3t, control techniques to linear systems with actuator nonlinearities. In [3], this was treated under the certainty equivalence assumption. The control design produces a finite dimensional measurement feedback controller which is computable online provided certain conditions are met. In this paper, we do not use certainty equivalence theory. Instead, we use the more...
متن کاملBackstepping-Based Adaptive PID Control
This paper addresses analysis and design issues in adaptive PID control for linear second order minimal phase processes using the backstepping algorithm. The first step consists in adding an integral action to the basic backstepping algorithm to obtain a zero static error. An integrator is therefore added to the plant model and is then slid back to the controller equation at the end of the desi...
متن کاملCertainty equivalence implies detectability 1
It is shown that any stabilizing, certainty equivalence control used within an adaptive control system, causes the familiar interconnection of a controlled process and associated output estimator to be detectable through the estimator's output error ep, for every frozen value of the index or parameter vector p upon which both the estimator and controller dynamics depend. The fact that certainty...
متن کاملMin-Max Certainty Equivalence Principle and Differential Games
This paper presents a version of the Certainty Equivalence Principle, usable for nonlinear, variable end-time, partial observation Zero-Sum Differential Games, which states that under the unicity of the solution to the auxiliary problem, optimal controllers can be derived from the solution of the related perfect observation game. An example is provided where in one region, the new extended resu...
متن کاملCertainty Equivalence Implies Detectability
It is shown that any stabilizing, certainty equivalence control used within an adaptive control system, causes the familiar interconnection of a controlled process and associated output estimator to be detectable through the estimator's output error e p , for every frozen value of the index or parameter vector p upon which both the estimator and controller dynamics depend. The fact that certain...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1989