eece 574 - adaptive control - overview - phoneoximeter · eece 574 - adaptive control overview guy...

49
EECE 574 - Adaptive Control Overview Guy Dumont Department of Electrical and Computer Engineering University of British Columbia Lectures: Thursday 09h30-12h00 Location: MCML 158 Guy Dumont (UBC) EECE 574 Overview 1 / 49

Upload: lyquynh

Post on 15-Jul-2018

235 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: EECE 574 - Adaptive Control - Overview - PhoneOximeter · EECE 574 - Adaptive Control Overview Guy Dumont Department of Electrical and Computer Engineering University of British Columbia

EECE 574 - Adaptive ControlOverview

Guy Dumont

Department of Electrical and Computer EngineeringUniversity of British Columbia

Lectures: Thursday 09h30-12h00Location: MCML 158

Guy Dumont (UBC) EECE 574 Overview 1 / 49

Page 2: EECE 574 - Adaptive Control - Overview - PhoneOximeter · EECE 574 - Adaptive Control Overview Guy Dumont Department of Electrical and Computer Engineering University of British Columbia

Introduction Objectives

Objectives

Course Objective:

To give an overview of the theory and practice of themainstream adaptive control techniques

Four assignments: 15% eachProject: 40%Textbook:

K.J. Åström and B. Wittenmark, Adaptive Control, Addison-WesleyPublishing Co., Inc., Reading, Massachusetts, 1995. (This book isout of print, but is downloadable from the internet)

Guy Dumont (UBC) EECE 574 Overview 2 / 49

Page 3: EECE 574 - Adaptive Control - Overview - PhoneOximeter · EECE 574 - Adaptive Control Overview Guy Dumont Department of Electrical and Computer Engineering University of British Columbia

Introduction Books

Related Books

N. Hovakimyan, C. Cao, L1 Adaptive Control theory, SIAM Press, Philadelphia, 2010.

P. Ioannou and B. Fidan, Adaptive Control Tutorial, SIAM Press, Philadelphia, 2006.

V. Bobal, J. Bohm, J. Fessl and J. Macacek, Digital Self-Tuning Controllers, Springer-Verlag, Berlin, 2005.

Landau, Lozano and M’Saad, Adaptive Control, Springer-Verlag, Berlin, 1998.

Isermann, Lachmann and Matko, Adaptive Control Systems, Prentice-Hall, Englewood Cliffs, NJ, 1992.

Wellstead and Zarrop, Self-Tuning Systems Control and Signal Processing, J. Wiley and Sons, NY, 1991.

Bitmead, Gevers and Wertz, Adaptive Optimal Control, Prentice-Hall, Englewood Cliffs, NJ, 1990.

Goodwin and Sin, Adaptive Filtering, Prediction, and Control, Prentice-Hall, Englewood Cliffs, NJ, 1984.

Ljung, and Söderström, Theory and Practice of Recursive Identification, MIT Press, Cambridge, MA, 1983.

Guy Dumont (UBC) EECE 574 Overview 3 / 49

Page 4: EECE 574 - Adaptive Control - Overview - PhoneOximeter · EECE 574 - Adaptive Control Overview Guy Dumont Department of Electrical and Computer Engineering University of British Columbia

Introduction Books

Course Outline

1 Introduction2 Identification3 Control Design4 Self-Tuning Control5 Model-Reference Adaptive Control6 Properties of Adaptive Controllers7 Auto-Tuning and Gain Scheduling8 Implementation and Practical Considerations9 Extensions

Guy Dumont (UBC) EECE 574 Overview 4 / 49

Page 5: EECE 574 - Adaptive Control - Overview - PhoneOximeter · EECE 574 - Adaptive Control Overview Guy Dumont Department of Electrical and Computer Engineering University of British Columbia

Background Definitions

What Is Adaptive Control?

According to the Webster’s dictionary, to adapt means:to adjust oneself to particular conditionsto bring oneself in harmony with a particular environmentto bring one’s acts, behaviour in harmony with a particularenvironment

According to the Webster’s dictionary, adaptation means:adjustment to environmental conditionsalteration or change in form or structure to better fit the environment

Guy Dumont (UBC) EECE 574 Overview 5 / 49

Page 6: EECE 574 - Adaptive Control - Overview - PhoneOximeter · EECE 574 - Adaptive Control Overview Guy Dumont Department of Electrical and Computer Engineering University of British Columbia

Background Definitions

When is a Controller Adaptive?

Linear feedback can cope with parameter changes (within somelimits)According to G. Zames1:

A non-adaptive controller is based solely on a-priori informationAn adaptive controller is based on a posteriori information as well

135th CDC, Kobe, Dec 1996Guy Dumont (UBC) EECE 574 Overview 6 / 49

Page 7: EECE 574 - Adaptive Control - Overview - PhoneOximeter · EECE 574 - Adaptive Control Overview Guy Dumont Department of Electrical and Computer Engineering University of British Columbia

Background Definitions

A Narrow Definition of Adaptive Control

An adaptive controller is a fixed-structure controller withadjustable parameters and a mechanism for automaticallyadjusting those parametersIn this sense, an adaptive controller is one way of dealing withparametric uncertaintyAdaptive control theory essentially deals with finding parameterajustment algorithms that guarantee global stability andconvergence

Guy Dumont (UBC) EECE 574 Overview 7 / 49

Page 8: EECE 574 - Adaptive Control - Overview - PhoneOximeter · EECE 574 - Adaptive Control Overview Guy Dumont Department of Electrical and Computer Engineering University of British Columbia

Background Definitions

Why Use Adaptive Control?

Control of systems with time-varying dynamicsIf dynamics change with operating conditions in a known,predictable fashion, use gain schedulingIf the use of a fixed controller cannot achieve a satisfactorycompromise between robustness and performance, then andonly then, should adaptive control be used

Use the simplest technique that meets the specifications 2

2. . . or as A. Einstein apparently once said: “make things as simple aspossible, but no simpler”

Guy Dumont (UBC) EECE 574 Overview 8 / 49

Page 9: EECE 574 - Adaptive Control - Overview - PhoneOximeter · EECE 574 - Adaptive Control Overview Guy Dumont Department of Electrical and Computer Engineering University of British Columbia

Background Process Variations

Feedback and Process Variations

Consider the feedback loop:

ysp u y

+-

C P

Controller Process

The closed-loop transfer function is

T =PC

1 + PC

Differentiating T with respect to P:dT

T=

1

1 + PC

dP

P= S

dP

P

T and S are respectively known as the complementary sensitivity and the sensitivity functions. Note that

S + T = 1

Guy Dumont (UBC) EECE 574 Overview 9 / 49

Page 10: EECE 574 - Adaptive Control - Overview - PhoneOximeter · EECE 574 - Adaptive Control Overview Guy Dumont Department of Electrical and Computer Engineering University of British Columbia

Background Process Variations

Feedback and Process Variations

The closed-loop transfer function is NOT sensitive to processvariations at those frequencies where the loop transfer functionL = PC is largeGenerally L >> 1 at low frequencies, and L << 1 at highfrequenciesHowever, L >> 1 can only be achieved in a limited bandwidth,particularly when unstable zeros are present

Guy Dumont (UBC) EECE 574 Overview 10 / 49

Page 11: EECE 574 - Adaptive Control - Overview - PhoneOximeter · EECE 574 - Adaptive Control Overview Guy Dumont Department of Electrical and Computer Engineering University of British Columbia

Background Process Variations

Judging the Severity of Process Variations

Difficult to judge impact of process variations on closed-loopbehaviour from open-loop time responses

Significant changes in open-loop responses may have little effecton closed-loop responseSmall changes in open-loop responses may have significant effecton closed-loop response

Effect depends on the desired closed-loop bandwidthBetter to use frequency responses

Guy Dumont (UBC) EECE 574 Overview 11 / 49

Page 12: EECE 574 - Adaptive Control - Overview - PhoneOximeter · EECE 574 - Adaptive Control Overview Guy Dumont Department of Electrical and Computer Engineering University of British Columbia

Background Example 1

Effect of Process Variations

Consider the system given by

G(s) =1

(s + 1)(s + a)Open loop step responses for a = −0.01, 0, 0.01:

0 50 100 1500

50

100

150

200

250

300

350

Step Response

Time (sec)

Ampl

itude

a=−0.01a=0a=0.01

Figure: Open-loop responses

Guy Dumont (UBC) EECE 574 Overview 12 / 49

Page 13: EECE 574 - Adaptive Control - Overview - PhoneOximeter · EECE 574 - Adaptive Control Overview Guy Dumont Department of Electrical and Computer Engineering University of British Columbia

Background Example 1

Effect of Process Variations

0 5 10 150

0.2

0.4

0.6

0.8

1

1.2

1.4

Step Response

Time (sec)

Ampl

itude

a=−0.01a=0a=0.01

Figure: Closed-loop responses for unit feedback

Guy Dumont (UBC) EECE 574 Overview 13 / 49

Page 14: EECE 574 - Adaptive Control - Overview - PhoneOximeter · EECE 574 - Adaptive Control Overview Guy Dumont Department of Electrical and Computer Engineering University of British Columbia

Background Example 1

Effect of Process Variations

−100

−50

0

50

100

150

Mag

nitu

de (d

B)

10−4 10−3 10−2 10−1 100 101 102−180

−135

−90

−45

0

Phas

e (d

eg)

Bode Diagram

Frequency (rad/sec)

a=−0.01a=0a=0.01

Figure: Open-loop Bode plots

Guy Dumont (UBC) EECE 574 Overview 14 / 49

Page 15: EECE 574 - Adaptive Control - Overview - PhoneOximeter · EECE 574 - Adaptive Control Overview Guy Dumont Department of Electrical and Computer Engineering University of British Columbia

Background Example 1

Effect of Process Variations

−80

−60

−40

−20

0

20

Mag

nitu

de (d

B)

10−2 10−1 100 101 102−180

−135

−90

−45

0

Phas

e (d

eg)

Bode Diagram

Frequency (rad/sec)

a=−0.01a=0a=0.01

Figure: Closed-loop Bode plots

Guy Dumont (UBC) EECE 574 Overview 15 / 49

Page 16: EECE 574 - Adaptive Control - Overview - PhoneOximeter · EECE 574 - Adaptive Control Overview Guy Dumont Department of Electrical and Computer Engineering University of British Columbia

Background Example 2

Effect of Process Variations

Consider now the system

G(s) =400(1− sT )

(s + 1)(s + 20)(1 + sT )

Open-loop responses for T = 0, 0.015, 0.03:

0 0.5 1 1.5 2 2.5 3 3.5 4 4.5 5−5

0

5

10

15

20

Step Response

Time (sec)

Ampl

itude

T=0T=0.015T=0.03

Figure: Open-loop responses

Guy Dumont (UBC) EECE 574 Overview 16 / 49

Page 17: EECE 574 - Adaptive Control - Overview - PhoneOximeter · EECE 574 - Adaptive Control Overview Guy Dumont Department of Electrical and Computer Engineering University of British Columbia

Background Example 2

Effect of Process Variations

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1−0.5

0

0.5

1

1.5

2

Step Response

Time (sec)

Ampl

itude

T=0T=0.015T=0.03

Figure: Unit-feedback closed-loop responses

Guy Dumont (UBC) EECE 574 Overview 17 / 49

Page 18: EECE 574 - Adaptive Control - Overview - PhoneOximeter · EECE 574 - Adaptive Control Overview Guy Dumont Department of Electrical and Computer Engineering University of British Columbia

Background Example 2

Effect of Process Variations

−150

−100

−50

0

50

Mag

nitu

de (d

B)

10−2 10−1 100 101 102 103 104−180

0

180

360

Phas

e (d

eg)

Bode Diagram

Frequency (rad/sec)

T=0T=0.015T=0.03

Figure: Open-loop Bode plots

Guy Dumont (UBC) EECE 574 Overview 18 / 49

Page 19: EECE 574 - Adaptive Control - Overview - PhoneOximeter · EECE 574 - Adaptive Control Overview Guy Dumont Department of Electrical and Computer Engineering University of British Columbia

Background Example 2

Effect of Process Variations

−150

−100

−50

0

50

Mag

nitu

de (d

B)

100 101 102 103 104−180

0

180

360

Phas

e (d

eg)

Bode Diagram

Frequency (rad/sec)

T=0T=0.015T=0.03

Figure: Unit-feedback closed-loop Bode plots

Guy Dumont (UBC) EECE 574 Overview 19 / 49

Page 20: EECE 574 - Adaptive Control - Overview - PhoneOximeter · EECE 574 - Adaptive Control Overview Guy Dumont Department of Electrical and Computer Engineering University of British Columbia

Background Example 2

Effect of Process Variations

Consider now the same system but with a controllerC(s) = 0.075/(s + 1):

0 0.5 1 1.5 2 2.5 3 3.5 4 4.5 5−2

0

2

4

6

8

10

12

Step Response

Time (sec)

Ampl

itude

T=0T=0.015T=0.03

Figure: New closed-loop responses

Guy Dumont (UBC) EECE 574 Overview 20 / 49

Page 21: EECE 574 - Adaptive Control - Overview - PhoneOximeter · EECE 574 - Adaptive Control Overview Guy Dumont Department of Electrical and Computer Engineering University of British Columbia

Background Example 2

Effect of Process Variations

−150

−100

−50

0

50

Mag

nitu

de (d

B)

10−1 100 101 102 103 104−180

0

180

360

540

Phas

e (d

eg)

Bode Diagram

Frequency (rad/sec)

T=0T=0.015T=0.03

Figure: New closed-loop Bode plots

Guy Dumont (UBC) EECE 574 Overview 21 / 49

Page 22: EECE 574 - Adaptive Control - Overview - PhoneOximeter · EECE 574 - Adaptive Control Overview Guy Dumont Department of Electrical and Computer Engineering University of British Columbia

Background Example 2

Mechanisms for Process Dynamics Changes

Nonlinear actuators or sensorsNonlinear valvespH probes

Flow and speed variationsConcentration controlSteel rolling millsPaper machinesRotary kilns

Guy Dumont (UBC) EECE 574 Overview 22 / 49

Page 23: EECE 574 - Adaptive Control - Overview - PhoneOximeter · EECE 574 - Adaptive Control Overview Guy Dumont Department of Electrical and Computer Engineering University of British Columbia

Background Example 2

Mechanisms for Process Dynamics Changes

Wide operating range with a nonlinear systemFlight control

Variations in Disturbance DynamicsWave characteristics in ship steeringRaw materials in process industries

Guy Dumont (UBC) EECE 574 Overview 23 / 49

Page 24: EECE 574 - Adaptive Control - Overview - PhoneOximeter · EECE 574 - Adaptive Control Overview Guy Dumont Department of Electrical and Computer Engineering University of British Columbia

Adaptive Schemes Gain Scheduling

Gain Scheduling

In many cases, process dynamics change with operatingconditions in a known fashion

Flight control systemsCompensation for production rate changesCompensation for paper machine speed

Controller parameters change in a predetermined fashion withthe operating conditionsIs gain scheduling adaptive?

Guy Dumont (UBC) EECE 574 Overview 24 / 49

Page 25: EECE 574 - Adaptive Control - Overview - PhoneOximeter · EECE 574 - Adaptive Control Overview Guy Dumont Department of Electrical and Computer Engineering University of British Columbia

Adaptive Schemes Gain Scheduling

Gain Scheduling

SetpointInput Output

Controllerparameters Gain

schedule

Controller Process

Guy Dumont (UBC) EECE 574 Overview 25 / 49

Page 26: EECE 574 - Adaptive Control - Overview - PhoneOximeter · EECE 574 - Adaptive Control Overview Guy Dumont Department of Electrical and Computer Engineering University of British Columbia

Adaptive Schemes Development of Adaptive Control

Development of Adaptive Control

Mid 1950s: Flight control systems (eventually solved by gainscheduling)1957: Bellman develops dynamic programming1958: Kalman develops the self-optimizing controller “whichadjusts itself automatically to control an arbitrary dynamicprocess”1960: Feldbaum develops the dual controller in which the controlaction serves a dual purpose as it is “directing as well asinvestigating”

Guy Dumont (UBC) EECE 574 Overview 26 / 49

Page 27: EECE 574 - Adaptive Control - Overview - PhoneOximeter · EECE 574 - Adaptive Control Overview Guy Dumont Department of Electrical and Computer Engineering University of British Columbia

Adaptive Schemes Development of Adaptive Control

Development of Adaptive Control

Mid 60s-early 70s: Model reference adaptive systems

But now came a technical problem that spelled the end. The Honeywell adaptive flight control system began alimit-cycle oscillation just as the plane came out of the spin, preventing the system’s gain changer from reducingpitch as dynamic pressure increased. The X-15 began a rapid pitching motion of increasing severity. All thewhile, the plane shot downward at 160,000 feet per minute, dynamic pressure increasing intolerably. . . . As theX-15 neared 65,000 feet, it was speeding downward at Mach 3.93 and experiencing over 15 g vertically, bothpositive and negative, and 8 g laterally. It broke up into many pieces amid loud sonic rumblings, . . . Then an AirForce pilot, . . . , spotted the main wreckage . . . . Mike Adams was dead and the X-15 destroyed.

Guy Dumont (UBC) EECE 574 Overview 27 / 49

Page 28: EECE 574 - Adaptive Control - Overview - PhoneOximeter · EECE 574 - Adaptive Control Overview Guy Dumont Department of Electrical and Computer Engineering University of British Columbia

Adaptive Schemes Development of Adaptive Control

Development of Adaptive Control

Late 60s-early 70s: System identification approach with recursiveleast-squaresEarly 1980s: Convergence and stability analysisMid 1980s: Robustness analysis1990s: Multimodel adaptive control1990s: Iterative control2000s: L1 adaptive control: fast adaptation with guaranteedrobustness.

Guy Dumont (UBC) EECE 574 Overview 28 / 49

Page 29: EECE 574 - Adaptive Control - Overview - PhoneOximeter · EECE 574 - Adaptive Control Overview Guy Dumont Department of Electrical and Computer Engineering University of British Columbia

Adaptive Schemes Model Reference Adaptive Control

Model Reference Adaptive Control

Performance specifications given in terms of reference modelOriginally introduced for flight control systems (MIT rule)Nontrivial adjusment mechanism

Guy Dumont (UBC) EECE 574 Overview 29 / 49

Page 30: EECE 574 - Adaptive Control - Overview - PhoneOximeter · EECE 574 - Adaptive Control Overview Guy Dumont Department of Electrical and Computer Engineering University of British Columbia

Adaptive Schemes Model Reference Adaptive Control

Model Reference Adaptive Control

Setpoint

Input Output

Controllerparameters

Adjustment mechanism

Controller Process

Model Modeloutput

Guy Dumont (UBC) EECE 574 Overview 30 / 49

Page 31: EECE 574 - Adaptive Control - Overview - PhoneOximeter · EECE 574 - Adaptive Control Overview Guy Dumont Department of Electrical and Computer Engineering University of British Columbia

Adaptive Schemes Self-Tuning Control

Self-Tuning Controller

Model-based tuning consists of two operations:Model building via identificationController design using the identified model

Self-tuning control can be thought of as an automation of thisprocedure when these two operations are performed on-line

Guy Dumont (UBC) EECE 574 Overview 31 / 49

Page 32: EECE 574 - Adaptive Control - Overview - PhoneOximeter · EECE 574 - Adaptive Control Overview Guy Dumont Department of Electrical and Computer Engineering University of British Columbia

Adaptive Schemes Self-Tuning Control

Self-Tuning Controller

Setpoint

Input Output

Controllerparameters

Recursiveestimation

Controller Process

Controldesign

Process parameters

Guy Dumont (UBC) EECE 574 Overview 32 / 49

Page 33: EECE 574 - Adaptive Control - Overview - PhoneOximeter · EECE 574 - Adaptive Control Overview Guy Dumont Department of Electrical and Computer Engineering University of British Columbia

Adaptive Schemes Self-Tuning Control

Self-Tuning vs. Auto-Tuning

Self-tuningContinuous updating of controller parametersUsed for truly time-varying plants

Auto-tuningOnce controller parameters near convergence, adaptation isstoppedUsed for time invariant or very slowly varying processesUsed for periodic, usually on-demand tuning

Guy Dumont (UBC) EECE 574 Overview 33 / 49

Page 34: EECE 574 - Adaptive Control - Overview - PhoneOximeter · EECE 574 - Adaptive Control Overview Guy Dumont Department of Electrical and Computer Engineering University of British Columbia

Adaptive Schemes Self-Tuning Control

Final Motivation...

Guy Dumont (UBC) EECE 574 Overview 34 / 49

Page 35: EECE 574 - Adaptive Control - Overview - PhoneOximeter · EECE 574 - Adaptive Control Overview Guy Dumont Department of Electrical and Computer Engineering University of British Columbia

Adaptive Schemes Self-Tuning Control

Final Motivation...

Guy Dumont (UBC) EECE 574 Overview 35 / 49

Page 36: EECE 574 - Adaptive Control - Overview - PhoneOximeter · EECE 574 - Adaptive Control Overview Guy Dumont Department of Electrical and Computer Engineering University of British Columbia

Dual Control

Dual Control: A Rigorous Approach to AdaptiveControl

Use of nonlinear stochastic control theory to derive an adaptivecontrollerNo distinction between parameters and state variables –HyperstateThe controller is a nonlinear mapping from the hyperstate to thecontrol variable

Setpoint

Input Output

Hyperstateestimation

Nonlinearmapping

Process

Hyperstate

Guy Dumont (UBC) EECE 574 Overview 36 / 49

Page 37: EECE 574 - Adaptive Control - Overview - PhoneOximeter · EECE 574 - Adaptive Control Overview Guy Dumont Department of Electrical and Computer Engineering University of British Columbia

Dual Control

A Rigorous Approach to Adaptive Control

Can handle very rapid parameter changesResulting controller has very interesting features:

RegulationCautionProbing

Unfortunately solution is untractable for most systems

Guy Dumont (UBC) EECE 574 Overview 37 / 49

Page 38: EECE 574 - Adaptive Control - Overview - PhoneOximeter · EECE 574 - Adaptive Control Overview Guy Dumont Department of Electrical and Computer Engineering University of British Columbia

Dual Control

Illustration of Dual Control

Consider the simple process

y(t + 1) = y(t) + bu(t) + e(t + 1)

where e(t) is zero-mean white noise N(0, σ), y(t)and u(t) are theoutput and the input signals.

One-stage control

Find u(t) that minimizes

I1 = E [y2(t + 1)|y(t),u(t)]

Guy Dumont (UBC) EECE 574 Overview 38 / 49

Page 39: EECE 574 - Adaptive Control - Overview - PhoneOximeter · EECE 574 - Adaptive Control Overview Guy Dumont Department of Electrical and Computer Engineering University of British Columbia

Dual Control Certainty equivalence controller

Certainty Equivalence Controller

In case b is known, the solution is trivial:

min I1 = min [y(t − 1) + bu(t − 1)]2 + σ2 = σ2

since e(t) is independent of y(t − 1), u(t − 1) and b.

u(t) = −y(t)b

I1opt = σ2

Guy Dumont (UBC) EECE 574 Overview 39 / 49

Page 40: EECE 574 - Adaptive Control - Overview - PhoneOximeter · EECE 574 - Adaptive Control Overview Guy Dumont Department of Electrical and Computer Engineering University of British Columbia

Dual Control Certainty equivalence controller

Certainty Equivalence Controller

Now, assume that b is unknown.We now have an estimate b̂ with covariance pbIf least-squares is used:

b̂ =

{t∑

s=1

[y(s)− y(s − 1)]u(s − 1)

}/

t∑s=1

u2(s − 1)

pb = σ2/

t∑s=1

u2(s − 1)

Guy Dumont (UBC) EECE 574 Overview 40 / 49

Page 41: EECE 574 - Adaptive Control - Overview - PhoneOximeter · EECE 574 - Adaptive Control Overview Guy Dumont Department of Electrical and Computer Engineering University of British Columbia

Dual Control Certainty equivalence controller

Certainty Equivalence Controller

The most direct way to control the system is simply to replace b by b̂ inthe controller above, thus ignoring the uncertainty:

uce(t) = −y(t)

thenI1ce = σ2 +

pb

b̂2y2(t − 1)

Guy Dumont (UBC) EECE 574 Overview 41 / 49

Page 42: EECE 574 - Adaptive Control - Overview - PhoneOximeter · EECE 574 - Adaptive Control Overview Guy Dumont Department of Electrical and Computer Engineering University of British Columbia

Dual Control Cautious Controller

Cautious Controller

Performing the minimization of I1 actually gives:

u(t) = − b̂b̂2 + pb

y(t)

and the minimum performance index

I1caut = σ2 +pb

b̂2 + pby2(t − 1)

Guy Dumont (UBC) EECE 574 Overview 42 / 49

Page 43: EECE 574 - Adaptive Control - Overview - PhoneOximeter · EECE 574 - Adaptive Control Overview Guy Dumont Department of Electrical and Computer Engineering University of British Columbia

Dual Control Cautious Controller

Cautious Controller

Because pb is positive, the cautious controller has a smaller gainthan the certainty equivalence one, which by ignoring uncertaintymay be at times too boldTurn-off phenomenon:

When the uncertainty pb is large, controller gain is small and sodoes the control actionSo, unless an external perturbation is added to the input, nolearning can take place and the uncertainty pb cannot be reduced

This highlights the importance of probing signals

Guy Dumont (UBC) EECE 574 Overview 43 / 49

Page 44: EECE 574 - Adaptive Control - Overview - PhoneOximeter · EECE 574 - Adaptive Control Overview Guy Dumont Department of Electrical and Computer Engineering University of British Columbia

Dual Control Dual Controller

Dual Controller

N-stage control

Find u(t) that minimizes

IN = E [N∑1

y2(t + i)|y(t),u(t)]

By using the N-stage control problem with N > 1, it can be shownthat the effect of present inputs on the future values of b̂ and pbenters the minimization of INIndeed it is sometimes beneficial to sacrifice short termperformance by sending a probing signal to reduce theuncertainty, and thus improve performance in the long term

Guy Dumont (UBC) EECE 574 Overview 44 / 49

Page 45: EECE 574 - Adaptive Control - Overview - PhoneOximeter · EECE 574 - Adaptive Control Overview Guy Dumont Department of Electrical and Computer Engineering University of British Columbia

Dual Control Dual Controller

Dual Controller

Using dynamic programming, a functional equation (Bellmanequation) can be derivedHowever, this equation can only be solved numerically and forvery simple casesFor large N, the control tends towards a steady-state control law

Guy Dumont (UBC) EECE 574 Overview 45 / 49

Page 46: EECE 574 - Adaptive Control - Overview - PhoneOximeter · EECE 574 - Adaptive Control Overview Guy Dumont Department of Electrical and Computer Engineering University of British Columbia

Dual Control Dual Controller

Dual Controller

Define

η =yσ

β =b̂pb

µ = − b̂uy

µ = 1 corresponds to the certainty-equivalence controllerµ = β2/(1 + β2) corresponds to the cautious controllerDual controller for large N is:

µ =β2 − 0.56β

β2 + 0.08β + 2.2+

(1.9β

β4 + 1.7

)1η

Guy Dumont (UBC) EECE 574 Overview 46 / 49

Page 47: EECE 574 - Adaptive Control - Overview - PhoneOximeter · EECE 574 - Adaptive Control Overview Guy Dumont Department of Electrical and Computer Engineering University of British Columbia

Dual Control Dual Controller

Dual Controller

0

0.5

1

00.2

0.40.6

0.81

−0.5

0

0.5

1

1.5

2

Increasing Control Error

Dual Control Map

Increasing Estimate Accuracy

Figure: Dual control map

Guy Dumont (UBC) EECE 574 Overview 47 / 49

Page 48: EECE 574 - Adaptive Control - Overview - PhoneOximeter · EECE 574 - Adaptive Control Overview Guy Dumont Department of Electrical and Computer Engineering University of British Columbia

Dual Control Dual Controller

Properties of the Dual Controller

Dual control finds the best compromise betweenboldnesscautionprobing

Low uncertainty→ boldness prevailsLarge uncertainty + large control error→ caution prevailsLarge uncertainty + small control error→ probing prevails

Guy Dumont (UBC) EECE 574 Overview 48 / 49

Page 49: EECE 574 - Adaptive Control - Overview - PhoneOximeter · EECE 574 - Adaptive Control Overview Guy Dumont Department of Electrical and Computer Engineering University of British Columbia

Implications forAdaptive Control

Implications for Adaptive Control

The dual controller is in general impossible to computeMost current adaptive control methods enforce certaintyequivalenceThus, learning is passive rather than activePassive learning is a shortcoming of current adaptive controlmethodsPractical methods of active learning attractive for

Commissioning of adaptive controllersAdaptive control of processes with rapidly time-varying dynamics

Guy Dumont (UBC) EECE 574 Overview 49 / 49