linear quadratic gausssian control design with loop transfer recovery [lqg…€¦ · ·...

Linear Quadratic Gausssian Control Design withLoop Transfer Recovery ∗

Leonid Freidovich

Department of MathematicsMichigan State University

MI 48824, USA

e-mail:leonid@math.msu.edu

http://www.math.msu.edu/˜leonid/

December 4, 2003

∗Presentation for ME-891, Fall 2003

Outline

(a) Linear quadratic regulation [LQR]

Outline

(b) H2 design [LQG]

Outline

(b) H2 design [LQG]

(c) Idea of loop transfer recovery [LTR]

Outline

(b) H2 design [LQG]

(d) Technical implementations of LQR/LTR

Outline

(b) H2 design [LQG]

(d) Technical implementations of LQR/LTR

(e) References

[LQR/LQG/LTR]

Linear Quadratic Regulation [LQR] ←x = Ax + B2u,x(0) = x0.

• (A,B2) is controllable.

Linear Quadratic Regulation [LQR] ←x = Ax + B2u,x(0) = x0.

• (A,B2) is controllable.

Goal: Design state feedback controller

u = K(s)x, [ K(s) is a proper real rational transfer function] (2)

depending on A and B2 such that for arbitrary x0 :

1. the closed-loop system (1), (2) is internally stable,

2. the solution x(t) and the control signal u(t) satisfy certainspecifications.

[LQR/LQG/LTR]

Let us choose C1, D12 and define performance index:

Jlqr = ‖z‖22 = ‖C1x + D12u‖22∆=

∞∫0

[C1x(t) + D12u(t)]∗[C1x(t) + D12u(t)] dt (3)

Jlqr = ‖z‖22 = ‖C1x + D12u‖22∆=

∞∫0

[C1x(t) + D12u(t)]∗[C1x(t) + D12u(t)] dt (3)

If C∗1D12 = 0 then Jlqr = ‖C1x‖22 + ‖D12u‖22.

Jlqr = ‖z‖22 = ‖C1x + D12u‖22∆=

∞∫0

[C1x(t) + D12u(t)]∗[C1x(t) + D12u(t)] dt (3)

If C∗1D12 = 0 then Jlqr = ‖C1x‖22 + ‖D12u‖22.

LQR problem: Design state feedback controller (2) depending on A, B2,C1, and D12 such that for arbitrary x0 :

2. the performance index (3) is the smallest.

[LQR/LQG/LTR]

The solution exists if

• D∗12D12 > 0,

• (C1, A) is detectable;

A− jωI B2

C1 D12

]has full column rank for all ω.

[LQR/LQG/LTR]

Properties of the solution:

1. optimal controller K(s) = Klqr is the constant gain:

Klqr = −(D∗12D12)−1(B∗2X + D∗

12C1),

where X ≥ 0 is the stabilizing solution of the Riccati equation:

A∗X + XA−XB2(D∗12D12)−1B∗2X + C∗1 [I −D12(D∗

12D12)−1D∗12]C1 = 0,

with A = A−B2(D∗12D12)−1D∗

Klqr = −(D∗12D12)−1(B∗2X + D∗

12C1),

A∗X + XA−XB2(D∗12D12)−1B∗2X + C∗1 [I −D12(D∗

12D12)−1D∗12]C1 = 0,

with A = A−B2(D∗12D12)−1D∗

2. A + B2Klqr is Hurwitz and limt→∞

x(t) = 0,

Klqr = −(D∗12D12)−1(B∗2X + D∗

12C1),

A∗X + XA−XB2(D∗12D12)−1B∗2X + C∗1 [I −D12(D∗

12D12)−1D∗12]C1 = 0,

with A = A−B2(D∗12D12)−1D∗

2. A + B2Klqr is Hurwitz and limt→∞

x(t) = 0,

3. guaranteed 6 dB (= 20 log 2) gain margin and 60o phase margin in bothdirections.

[LQR/LQG/LTR]

Linear Quadratic Gaussian [LQG or H2 ] problem ←Model:

x = Ax + B2u,y = C2x + D21u,x(0) = x0.

Assumptions:

• (A,B2) is controllable,

• (C2, A) is detectable.

[LQR/LQG/LTR]

Goal: Design output feedback controller

u = K(s)y, [ K(s) is a proper real rational transfer function] (5)

depending on A, B2, C2, and D21 such that for arbitrary x0 :

Let us choose C1, D12, as before.

Let us choose C1, D12, as before. It could be shown that if D12 6= 0,then the output controller minimizing Jlqr = ‖z‖2 = ‖C1x + D12u‖22 cannotbe proper (i.e. solution does not exist).

[LQR/LQG/LTR]

Considerx = Ax + B1w + B2u,y = C2x + D21u,x(0) = x0.

Define performance index

Jlqg = ‖Tw→z‖22 = max‖w‖2≤1

{‖z‖2∞‖w‖22

}= max‖w‖2≤1

{‖C1x + D12u‖2∞

‖w‖22

}. (7)

Considerx = Ax + B1w + B2u,y = C2x + D21u,x(0) = x0.

Define performance index

Jlqg = ‖Tw→z‖22 = max‖w‖2≤1

{‖z‖2∞‖w‖22

}= max‖w‖2≤1

{‖C1x + D12u‖2∞

‖w‖22

}. (7)

LQG problem: Design output feedback controller (5) depending on A, B1,B2, C1, C2, D12, and D21 such that for arbitrary x0 :

2. the performance index (7) is the smallest.[LQR/LQG/LTR]

The solution exists if

• D∗12D12 > 0,

A− jωI B2

C1 D12

]has full column rank for all ω.

• D21D∗21 > 0 ,

A− jωI B1

C2 D21

]has full row rank for all ω.

[LQR/LQG/LTR]

1. optimal K(s) is the observer based controller:

u = Klqgx,˙x = Ax + B2u + Llqg(C2x + D21u− y),x(0) = x0,

1. optimal K(s) is the observer based controller:

u = Klqgx,˙x = Ax + B2u + Llqg(C2x + D21u− y),x(0) = x0,

whereKlqg = Klqr

andLlqg = −(Y C∗2 + B1D

∗21)(D21D

∗21)−1,

where Y ≥ 0 is the stabilizing solution of the Riccati equation:

A∗Y + Y A− Y C∗2(D21D∗21)−1C2Y + B1[I −D∗

21(D21D∗21)−1D21]B∗1 = 0,

with A = A−B1D∗21(D21D

∗21)−1C2,

[LQR/LQG/LTR]

2. observer cannot be too fast since transient with peaking is not ‘compatible’with small values of ‖C1x + D12u‖∞,

3. if w(t) ≡ 0 then limt→∞

x(t) = 0,

2. observer cannot be too fast since transient with peaking is not ‘compatible’with small values of ‖C1x + D12u‖∞,

3. if w(t) ≡ 0 then limt→∞

x(t) = 0,

4. in general, neither gain nor phase margins are guaranteed (have to checkrobustness for each particular design).

[LQR/LQG/LTR]

LQG with loop transfer recovery ←Consider the observer based controller for the system (4) with D21 = 0

x = Ax + B2u,y = C2x,x(0) = x0,

LQG with loop transfer recovery ←Consider the observer based controller for the system (4) with D21 = 0

x = Ax + B2u,y = C2x,x(0) = x0,

u = Klqrx,˙x = Ax + B2u + L(C2x− y),x(0) = x0.

Assumptions:

• (A,B2) is controllable,

• (C2, A) is detectable,

• Klqr is the optimal gain from the LQR problem.

[LQR/LQG/LTR]

Goal: Design the observer gain L so that the closed-loop system (8)recovers internal stability and some of the robustness properties (gain andphase margins) of the LQR design.

Compare

-��

- x = Ax + B2u

�Klqr

Figure 1: u = {Lt(s)}u

u = Klqr (sI −A)−1B2u︸︷︷︸x=Ax+B2u

def= {Lt(s)}u

[LQR/LQG/LTR]

-��

x = Ax + B2u -

��Klqr

observer

Figure 2: u = {Lo(s)}u

u = Klqr [−(sI −A− LC2 −B2Klqr)−1LC2]︸︷︷︸observer: ˙x=Ax+B2[Klqrx]+LC2(x−x)

(sI −A)−1B2u︸︷︷︸x=Ax+B2u

def= {Lo(s)}u

-��

x = Ax + B2u -

��Klqr

observer

Figure 2: u = {Lo(s)}u

u = Klqr [−(sI −A− LC2 −B2Klqr)−1LC2]︸︷︷︸observer: ˙x=Ax+B2[Klqrx]+LC2(x−x)

(sI −A)−1B2u︸︷︷︸x=Ax+B2u

def= {Lo(s)}u

We want Lo(s) ≡ Lt(s), but Lo(s) is biproper and Lt(s) is strictly proper.[LQR/LQG/LTR]

LQG/LTR problem: Design the observer gain L so that

1. the closed-loop system (8) is internally stable,

2. for all ω ∈ [0, ωmax] :Lo(jω) ≈ Lt(jω),

where ωmax � 1,target (under state feedback) open-loop transfer function is

Lt(s) = Klqr(sI −A)−1B2,

achieved (under output feedback) open-loop transfer function is

Lo(s) = KlqrTx→x(s)(sI −A)−1B2,

and ‘excited’ observer transfer function is

Tx→x(s) = [−(sI −A− LC2 −B2Klqr)−1LC2].[LQR/LQG/LTR]

Doyle – Stein formula ←It could be shown that if

−L[I − C2(jωI −A)−1L]−1 = B2[C2(jωI −A)−1B2]−1

thenLo(jω) = Lt(jω).

Supposelimε→0{εL(ε)} = −B2W,

where W is not singular and ε� 1.

Supposelimε→0{εL(ε)} = −B2W,

where W is not singular and ε� 1. Then

limε→0{B2W [εI + C2(jωI −A)−1B2W ]−1} = B2[C2(jωI −A)−1B2]−1

and hencelimε→0{Lo(jω)} = Lt(jω).

[LQR/LQG/LTR]

Remarks on(−L(ε)[I − C2(jωI −A)−1L(ε)]−1 ≈ B2[C2(jωI −A)−1B2]−1

)• In general, the choice L(ε) = −B2W/ε does not guarantee internal

stability of the closed-loop system.

• Since

−L(ε)[I −C2(jωI −A)−1L(ε)]−1 = −(jωI −A)(jωI − [A + L(ε)C2])−1L(ε),

• Since

if εL(ε) = −B2W + O(ε), then some of the eigenvalues of [A + L(ε)C2](i.e. poles of the observer error dynamics) approach the zeros of[C2(jωI −A)−1B2] (i.e. zeros of the plant) and the others approach infinityas ε→ 0.

• Since

• If the plant is nonminimum-phase then A + L(ε)C2 is not Hurwitz forsmall values of ε.

• Since

• If the plant is nonminimum-phase then A + L(ε)C2 is not Hurwitz forsmall values of ε.

• Some of the observer’s modes are fast. Therefore peaking phenomenonmust occur.

[LQR/LQG/LTR]

LQR design of L(ε) ←Consider LQR problem for the system

˙x = Ax + B2u,x(0) = x0,

where A = A∗ and B2 = C∗2 .

˙x = Ax + B2u,x(0) = x0,

Take C1(ε) = B∗2/ε (cheap control)

˙x = Ax + B2u,x(0) = x0,

Take C1(ε) = B∗2/ε (cheap control) and D12 independent of ε so thatD∗

12D12 = R > 0 and C∗1D12 = 0.

˙x = Ax + B2u,x(0) = x0,

Take C1(ε) = B∗2/ε (cheap control) and D12 independent of ε so thatD∗

12D12 = R > 0 and C∗1D12 = 0.

Suppose Klqr(ε) is the solution gain of the problem above for each ε > 0.

TakeL(ε) =

[Klqr(ε)

[LQR/LQG/LTR]

Then [A + B2Klqr(ε)

]∗ = A + L(ε)C2

is Hurwitz

]∗ = A + L(ε)C2

is Hurwitz andL(ε) = −X(ε)C∗2R−1,

where X(ε) is the stabilizing solution of the Riccati equation

AX(ε) + X(ε)A∗ − X(ε)C∗2R−1C2X(ε) + B2B∗2/ε2 = 0.

]∗ = A + L(ε)C2

is Hurwitz andL(ε) = −X(ε)C∗2R−1,

where X(ε) is the stabilizing solution of the Riccati equation

AX(ε) + X(ε)A∗ − X(ε)C∗2R−1C2X(ε) + B2B∗2/ε2 = 0.

It could be shown that if the plant is of minimum phase (and left invertible)then lim

ε→0{ε2[AX(ε) + X(ε)A∗]} = 0 and

limε→0{εL(ε)} = lim

ε→0{−εX(ε)C∗2R−1} = −B2R

−1/2 = −B2W

[LQR/LQG/LTR]

Alternative designs of L(ε) ←

For the ‘broken’ closed-loop system

x = Ax + B2u,y = C2x + D21u,x(0) = x0,

Alternative designs of L(ε) ←

For the ‘broken’ closed-loop system

x = Ax + B2u,y = C2x + D21u,x(0) = x0,

u = Klqrx,˙x = Ax + B2u + L(C2x + D21u− y),x(0) = x0.

we have: Lt(s) = Klqr(sI −A)−1B2 and

Lo(s) = Klqr[−(sI−A−LC2−B2Klqr+LD21Klqr)−1]L[C2(sI−A)−1B2+D21].[LQR/LQG/LTR]

It could be shown that

Lt(s)− Lo(s) = M(s)[I + M(s)]−1(I −KlqrB2),

where M(s) = −Klqr([sI −A]−1 − LC2)−1(B2 + LD21).

Therefore it is reasonable to search for L that minimizes ‖M(s)‖2 or‖M(s)‖∞.

Also, a pole-placement technique can be used to design the observer gain aswell. However, it is not trivial and the system need to be transformed into thespecial coordinate basis.

[LQR/LQG/LTR]

References

[1] H. Kwakernaak and R. Sivan, Linear optimal control systems, N.Y.:Wiley-Interscience, 1972.

[2] J.C. Doyle, “Guaranteed margins for LQG regulators,” IEEE Trans. onAC, Vol. 23, pp. 756–757, 1978.

[3] J.C. Doyle and G. Stein, “Robustness with observers,” IEEE Trans. onAC, Vol. 24, pp. 607–611, 1979.

[4] J.C. Doyle and G. Stein, “Multivariable feedback design: concepts for aclassical/Modern synthesis,” IEEE Trans. on AC, Vol. 26, pp. 4–16, 1981.

[5] G. Stein and M. Athans, “The LQG/LTR procedure for multivariablecontrol design,” IEEE Trans. on AC, Vol. 32, pp. 105–114, 1987.

[LQR/LQG/LTR]

[6] J.C. Doyle, K. Glover, P.P. Khargonekar, and B.A. Francis, “State-spacesolution to standard H2 and H∞ control problems,” IEEE Trans. onAC, Vol. 34, pp. 831–847, 1989.

[7] A. Saberi, B.M. Chen, and P. Sanuti, Loop transfer recovery: analysisand design, London: Springer-Verlag, 1993.

[8] K. Zhou and J.C. Doyle, Essentials of robust control, N.J.: Prentice-Hall,1998.

[9] J.S. Freudenberg, Robustness with observers, Lecture notes, Universityof Michigan, 1999.

[10] C. Gokcek, ME-891: Robust and Optimal Control, Lecture notes,Michigan State University, 2003.

[11] B.M. Chen, Loop transfer recovery, Lecture notes, 2003.←

[LQR/LQG/LTR]

linear quadratic gausssian control design with loop transfer recovery [lqg…€¦ · ·...

Documents

lecture notes on lqr/lqg controller design · lecture notes...

research article the lqr controller design of two-wheeled...

me233 advanced control ii lecture 1 dynamic programming...

lqg-lqr controller design

high warehouse racks: optimal feedback control and ... ·...

cbe507 lecture iv multivariable and optimal control ·...

linear-quadratic regulator (lqr) controlling the operation...

controllers design of an active suspension...

chapter 2 robust lqr design for stabilization of inverted...

a hybrid fuzzy/lqr based oscillation damping controller...

linear quadratic regulator (lqr) c ontroller design...

fuzzy control of vehicle active suspension system ·...

linear quadratic regulator (lqr) controller design for dc...

high bandwidth suspention modelling and design lqr full...

design and implementation of cascade lqr controller for

skripsi desin kontroler menggunakan metode linear quadratic...

regret bounds for the adaptive control of linear quadratic...

linear quadratic regulator (lqr) controller design

linear quadratic regulator (lqr) c ontroller design … ·...

math4406 (control theory) unit 6: the linear quadratic...