optimal control and applications - session 1 2nd astronet-ii

Optimal Control and Applications

V. M. Becerra - Session 1 - 2nd AstroNet-II Summer School

Optimal Control and ApplicationsSession 1

2nd AstroNet-II Summer School

Victor M. BecerraSchool of Systems Engineering

University of Reading

5-6 June 2013

1 / 85

Contents of Session 1

Introduction to optimal control

A brief on constrained nonlinear optimisation

Introduction to to calculus of variations

The optimal control problem

Optimality conditions

Pontryagin’s minimum principle

Equality path constraints

Singular arcs

Inequality path constraints

2 / 85

Direct collocation methods for optimal control

Local discretisations

Global (pseudospectral) discretisations

Practical aspects

Multi-phase problemsEfficient sparse differentiationScalingEstimating the discretisation accuracyMesh refinement

3 / 85

For the purposes of these lectures, a dynamical system can bedescribed by a system of ordinary differential equations, as follows:

x = f[x(t),u(t), t]

where x : [t0, tf ] :7→ Rnx is the state, u(t) : [t0, tf ] 7→ Rnu is thecontrol, t is time, and f : Rnx ×Rnu ×R 7→ Rnx . Suppose that weare interested in an interval of time such that t ∈ [t0, tf ].

4 / 85

We are also interested in a measure of the performance of thesystem along a trajectory.

We are going to quantify the performance of the system bymeans of a functional J, this is a rule of correspondence thatassigns a value to a function or set of functions.

The functional of interest depends on the state history x(t)and the control history u(t) over the period of interestt ∈ [t0, tf ], the initial time t0 and the final time tf .

J = J[x(·),u(·), t0, tf ]

5 / 85

The performance index J might include, for example, measures of:

control effort

fuel consumption

energy expenditure

time taken to reach a target

6 / 85

What is optimal control?

Optimal control is the process of finding control and state historiesfor a dynamic system over a period of time to minimise ormaximise a performance index.

7 / 85

Optimal control theory has been developed over a number ofcenturies.

It is closely related to the theory of calculus of variationswhich started in the late 1600’s.

Also related with developments in dynamic programming andnonlinear programming after the second world war.

The digital computer has enabled many complex optimalcontrol problems to be solved.

Optimal control is a fundamental tool in space missiondesign, as well as in many other fields.

8 / 85

As a form of motivation, consider thedesign of a zero propellant maneouvrefor the international space station bymeans of control moment gyroscopes(CMGs).

The example work was repored by Bhatt(2007) and Bedrossian and co-workers(2009).

The original 90 and 180 degreemaneouvres were computed using apseudospectral method.

Image courtesy of nasaimages.org.

9 / 85

Implemented on the International Space Station on 5November 2006 and 2 January 2007.

Savings for NASA of around US$1.5m in propellant costs.

The case described below corresponds with a 90 maneovurelasting 7200 seconds and using 3 CMG’s.

10 / 85

The problem is formulated as follows. Findqc(t) = [qc,1(t) qc,2(t) qc,3(t) qc,4]T , t ∈ [t0, tf ] and the scalarparameter γ to minimise,

J = 0.1γ +

∫ tf

||u(t)||2dt

subject to the dynamical equations:

q(t) =1

2T(q)(ω(t)− ωo(q))

ω(t) = J−1 (τd(q)− ω(t)× (Jω(t))− u(t))

h(t) = u(t)− ω(t)× h(t)

where J is a 3× 3 inertia matrix, q = [q1, q2, q3, q4]T is thequaternion vector, ω is the spacecraft angular rate relative to aninertial reference frame and expressed in the body frame, h is themomentum, t0 = 0 s, tf = 7200 s

11 / 85

The path constraints:

||q(t)||22 = 1

||qc(t)||22 = 1

||h(t)||22 ≤ γ||h(t)||22 = h2

The parameter bounds

0 ≤ γ ≤ h2max

And the boundary conditions:

q(t0) = q0 ω(t0) = ωo(q0) h(t0) = h0

q(tf ) = qf ω(tf ) = ωo(qf ) h(tf ) = hf

12 / 85

T(q) is given by:

T(q) =

−q2 −q3 −q4

q1 −q4 q3

q4 q1 −q2

−q3 q2 q1

13 / 85

u is the control force, which is given by:

u(t) = J (KP ε(q, qc) + KD ω(ω, qc))

whereε(q,qc) = 2T(qc)Tq

ω(ω, ωc) = ω − ωc

14 / 85

ωo is given by:ωo(q) = nC2(q)

where n is the orbital rotation rate, Cj is the j column of therotation matrix:

C(q) =

1− 2(q23 + q2

4) 2(q2q3 + q1q4) 2(q2q4 − q1q3)2(q2q3 − q1q4) 1− 2(q2

2 + q24) 2(q3q4 + q1q2)

2(q2q4 + q1q3) 2(q3q4 − q1q2) 1− 2(q22 + q2

τd is the disturbance torque, which in this case only incorporatesthe gravity gradient torque τgg , and the aerodynamic torque τa:

τd = τgg + τa = 3n2C3(q)× (JC3(q)) + τa

15 / 85

Image courtesy of nasaimages.org

Video animation of the zero propellant maneouvre using realtelemetry.http://www.youtube.com/watch?v=MIp27Ea9_2I

16 / 85

Numerical results obtained by the author with PSOPT using apseudospectral discretisation method with 60 grid points,neglecting the aerodynamic torque.

0 1000 2000 3000 4000 5000 6000 7000 8000

time (s)

Zero Propellant Maneouvre of the ISS: control torques

u1u2u3

0 1000 2000 3000 4000 5000 6000 7000 8000

-lbf-s

time (s)

Zero Propellant Maneouvre of the ISS: momentum norm

17 / 85

Ingredients for solving realistic optimal control problems

18 / 85

Indirect methods

Optimality conditions of continuous problems are derivedAttempt to numerically satisfy the optimality conditionsRequires numerical root finding and ODE methods.

Direct methods

Continuous optimal control problem is discretisedNeed to solve a nonlinear programming problem

Later in these lectures, we will concentrate on some directmethods.

19 / 85

Consider a twice continuously differentiable scalar functionf : Rny 7→ R. Then y∗ is called a strong minimiser of f if thereexists δ ∈ R such that

f (y∗) < f (y∗ + ∆y)

for all 0 < ||∆y|| ≤ δ.

Similarly y∗ is called a weak minimiser if it is not strong and thereis δ ∈ R such that f (y∗) ≤ f (y∗ + ∆y) for all 0 < ||∆y|| ≤ δ. Theminimum is said to be local if δ <∞ and global if δ =∞.

20 / 85

The gradient of f (y) with respect to y is a column vectorconstructed as follows:

∇yf (y) =∂f

= f Ty =

∂f∂y1...∂f∂yny

21 / 85

The second derivative of f (y) with respect to y is an ny × nysymmetric matrix known as the Hessian and is expressed asfollows:

∇2yf (y) = fyy =

∂f 2/∂y21 · · · ∂f 2/∂y1∂yny

.... . .

...∂f 2/∂yny∂y1 · · · ∂f 2/∂yny

22 / 85

Consider the second order Taylor expansion of f about y∗:

f (y∗ + ∆y) ≈ f (y∗) + fy(y∗)∆y +1

2∆yT fyy(y∗)∆y

A basic result in unconstrained nonlinear programming is thefollowing set of conditions for a local minimum.

Necessary conditions:

fy(y∗) = 0, i.e. y∗ must be a stationary point of f .fyy(y∗) ≥ 0, i.e. the Hessian matrix is positive semi-definite.

Sufficient conditions:

fy(y∗) = 0, i.e. y∗ must be a stationary point of f .fyy(y∗) > 0, i.e. the Hessian matrix is positive definite.

23 / 85

For example, the parabola f (x) = x21 + x2

2 has a global minimum atx∗ = [0, 0]T , where the gradient fx(x∗) = [0, 0] and the Hessian ispositive definite:

fxx(x∗) =

[2 00 2

24 / 85

Constrained nonlinear optimisation involves finding a decisionvector y ∈ Rny to minimise:

subject tohl ≤ h(y) ≤ hu,

yl ≤ y ≤ yu,

where f : Rny 7→ R is a twice continuously differentiable scalarfunction, and h : Rny 7→ Rnh is a vector function with twicecontinuously differentiable elements hi . Problems such as theabove are often called nonlinear programming (NLP) problems.

25 / 85

A particular case that is simple to analyse occurs when thenonlinear programming problem involves only equality constraints,such that the problem can be expressed as follows. Choose y tominimise

subject toc(y) = 0,

where c : Rny 7→ Rnc is a vector function with twice continuouslydifferentiable elements ci , and nc < ny .

26 / 85

Introduce the Lagrangian function:

L[y, λ] = f (y) + λTc(y) (1)

where L : Rny ×Rnc 7→ R is a scalar function and λ ∈ <nc is aLagrange multiplier.

27 / 85

The first derivative of c(y) with respect to y is an nc × ny matrixknown as the Jacobian and is expressed as follows:

∂c(y)

∂y= cy(y) =

∂c1/∂y1 · · · ∂c1/∂yny...

. . ....

∂cnc/∂y1 · · · ∂cnc/∂yny

28 / 85

The first order necessary conditions for a point (y∗, λ) to be a localminimiser are as follows:

Ly[y∗, λ]T = fy(y∗)T + [cy(y∗)]Tλ = 0

Lλ[y∗, λ]T = c(y∗) = 0(2)

Note that equation (2) is a system of ny + nc equations withny + nc unknowns.

29 / 85

For a scalar function f (x) and a single equality constraint h(x), thefigure illustrates the collinearity between the gradient of f and thegradient of h at a stationary point, i.e. ∇x f (x∗) = −λ∇xc(x∗).

30 / 85

For example, considerthe minimisation ofthe function f (x) = x2

1 + x22

subject to thelinear equality constraintc(x) = 2x1 + x2 + 4 = 0.

The first ordernecessary conditions leadto a system of three linearequations whose solutionis the minimiser x∗.

31 / 85

In fairly simple cases, the necessary conditions may help find thesolution. In general, however,

There is no analytic solution

It is necessary to solve the problem numerically

Numerical methods approach the solution iteratively.

32 / 85

If the nonlinear programming problem also includes a set ofinequality constraints, then the resulting problem is as follows.Choose y to minimise

subject toc(y) = 0,

q(y) ≤ 0

where q : Rny 7→ Rnq is a twice continuously differentiable vectorfunction.

33 / 85

The Lagrangian function is redefined to include the inequalityconstraints function:

L[y, λ, µ] = f (y) + λTc(y) + µTq(y) (3)

where µ ∈ Rnq is a vector of Karush-Kuhn-Tucker multipliers.

34 / 85

Definition

Active and inactive constraints An inequality constraint qi (y) issaid to be active at y if qi (y) = 0. It is inactive at y if qi (y) < 0

Definition

Regular point Let y∗ satisfy c(y∗) = 0 and q(y∗) ≤ 0, and letIc(y∗) be the index set of active inequality constraints. Then wesay that y∗ is a regular point if the gradient vectors:

∇yci (y∗), ∇yqj(y

∗), 1 ≤ i ≤ nc , j ∈ Ic(y∗)

are linearly independent.

35 / 85

In this case, the first order necessary conditions for a localminimiser are the well known Karush-Kuhn-Tucker (KKT)conditions, which are stated as follows.If y∗ is a regular point and a local minimiser, then:

µ ≥ 0

Ly[y∗, λ, µ] = fy(y∗) + [cy(y∗)]Tλ+ [qy(y∗)]Tµ = 0

Lλ[y∗, λ] = c(y∗) = 0

µTq(y∗) = 0

36 / 85

Methods for the solution of nonlinear programming problemsare well established.

Well known methods include Sequential QuadraticProgramming (SQP) and Interior Point methods.

A quadratic programming problem is a special type ofnonlinear programming problem where the constraints arelinear and the objective is a quadratic function.

37 / 85

SQP methods involve the solution of a sequence of quadraticprogramming sub-problems, which give, at each iteration, asearch direction which is used to find the next iterate througha line search on the basis of a merit (penalty) function.

Interior point methods involve the iterative enforcement ofthe KKT conditions doing a line search at each major iterationon the basis of a barrier function to find the next iterate.

38 / 85

Current NLP implementations incorporate developments innumerical linear algebra that exploit sparsity in matrices.

It is common to numerically solve NLP problems involvingvariables and constraints in the order of hundreds ofthousands.

Well known large scale NLP solvers methods include SNOPT(SQP) and IPOPT (Interior Point).

39 / 85

A quick introduction to calculus of variations

Definition (Functional)

A functional J is a rule of correspondence that assigns to eachfunction x a unique real number.

For example, Suppose that x is a continuous function of t definedin the interval [t0, tf ] and

J(x) =

∫ tf

x(t) dt

is the real number assigned by the functional J (in this case, J isthe area under the x(t) curve).

40 / 85

The increment of a funcional J is defined as follows:

∆J = J(x + δx)− J(x) (5)

where δx is called the variation of x

41 / 85

The variation of a functional J is defined as follows:

δJ = lim||δx ||→0

∆J = lim||δx ||→0

J(x + δx)− J(x)

Example: Find the variation δJ of the following functional:

J(x) =

0x2(t) dt

Solution:∆J = J(x + δx)− J(x) = . . .

0(2xδx) dt

42 / 85

In general, the variation of a differentiable functional of the form:

∫ tf

L(x)dt

can be calculated as follows for fixed t0 and tf :

∫ tf

[∂L(x)

If t0 and tf are also subject to small variations δt0 and δtf , thenthe variation of J can be calculated as follows:

δJ = L(x(tf ))δtf − L(x(t0))δt0 +

∫ tf

[∂L(x)

43 / 85

If x is a vector function of dimension n, then we have, for fixed t0

and tf :

∫ tf

[∂L(x)

∫ tf

[∂L(x)

44 / 85

A functional J has a local minimum at x∗ if for all functions x inthe vecinity of x∗ we have:

∆J = J(x)− J(x∗) ≥ 0

For a local maximum, the condition is:

∆J = J(x)− J(x∗) ≤ 0

45 / 85

Important result

If x∗ is a local minimiser or maximiser, then the variation of J iszero:

δJ[x∗, δx ] = 0

46 / 85

A basic problem in calculus of variations is the minimisation ormaximisation of the following integral with fixed t0, tf , x(t0),x(tf ):

∫ tf

L[x(t), x(t), t]dt

for a given function L. To solve this problem, we need to computethe variation of J, for a variation in x . This results in:

∫ tf

[∂L∂x− d

(∂L∂x

)]δxdt

47 / 85

Setting δJ = 0 results in the Euler-Lagrange equation:

∂L∂x− d

(∂L∂x

This equation needs to be satisfied by the function x∗ whichminimises or maximises J. This type of solution is called anextremal solution.

48 / 85

Example

Find an extremal for the following functional with x(0) = 0,x(π/2) = 1.

J(x) =

∫ π/2

[x2(t)− x2(t)

Solution: the Euler-Lagrange equation gives −2x − 2x = 0 suchthat x + x = 0

The solution of this second order ODE is x(t) = c1 cos t + c2 sin t.Using the boundary conditions gives c0 = 0 and c1 = 1, such thatthe extremal is:

x∗(t) = sin t

49 / 85

A general optimal control problem

Optimal control problem

Determine the state x(t), the control u(t), as well as t0 and tf , tominimise

J = Φ[x(t0), t0, x(tf ), tf ] +

∫ tf

L[x(t),u(t), t]dt

subject toDifferential constraints:

x = f[x,u, t]

Algebraic path constraints

hL ≤ h[x(t), u(t), t] ≤ hUBoundary conditions

ψ[x(t0), t0, x(tf ), tf ] = 0

50 / 85

Important features include:

It is a functional minimisation problem, as J is a functional.

Hence we need to use calculus of variations to derive itsoptimality conditions.

All optimal control problems have constraints.

51 / 85

There are various classes of optimal control problems:

Free and fixed time problems

Free and fixed terminal/initial states

Path constrained and unconstrained problems

52 / 85

For the sake of simplicity, the first order optimality conditions forthe general optimal control problem will be given ignoring the pathconstraints.

The derivation involves constructing an augmented performanceindex:

Ja = Φ + νTψ +

∫ tf

[L + λT (f − x)

where ν and λ(t) are Lagrange multipliers for the boundaryconditions and the differential constraints, respectively. λ(t) is alsoknown as the co-state vector. The first variation of Ja, δJa, needsto be calculated. We assume that u(t), x(t), x(t0), x(tf ), t0, tf ,λ(t), λ(t0), λ(tf ), and ν are free to vary.

53 / 85

Define the Hamiltonian function as follows:

H[x(t),u(t), λ(t), t] = L[x(t),u(t), t] + λ(t)T f[x(t),u(t), t]

After some work, the variation δJa can be expressed as follows:

δJa = δΦ + δνTψ + νT δψ + δ

∫ tf

[H − λT x

where:

δΦ =∂Φ

∂x(t0)δx0 +

∂t0δt0 +

∂x(tf )δxf +

∂tfδtf

δψ =∂ψ

∂x(t0)δx0 +

∂t0δt0 +

∂x(tf )δxf +

∂tfδtf

δx0 = δx(t0) + x(t0)δt0, δxf = δx(tf ) + x(tf )δtf

54 / 85

∫ tf

[H − λx]dt = λT (t0)δx0 − λT (tf )δxf

+ H(tf )δtf − H(t0)δt0

∫ tf

[(∂H

∂x+ λT

)δx +

∂λ+ xT

)δλ+

∂uδu

55 / 85

Assuming that initial and terminal times are free, and that thecontrol is unconstrained, setting δJa = 0 results in the followingfirst order necessary optimality conditions.

x =[∂H∂λ

]T= f[x(t),u(t), t] (state dynamics)

λ = −[∂H∂x

]T(costate dynamics)

∂H∂u = 0 (stationarity condition)

56 / 85

In addition we have the following conditions at the boundaries:

ψ[x(t0), t0, x(tf ), tf ] = 0

[λ(t0) +

[∂Φ∂x(t0)

∂ψ∂x(t0)

]Tδx0 +

[−H(t0) + ∂Φ

∂t0+ νT ∂ψ

]δt0 = 0

[−λ(tf ) +

[∂Φ∂x(tf )

∂ψ∂x(tf )

]Tδxf +

[H(tf ) + ∂Φ

∂tf+ νT ∂ψ

]δtf = 0

Note that δx0 depends on δt0 so in the second equation above theterms that multiply the variations of δx0 and δt0 cannot be madezero independently. Something similar to this happens in the thirdequation.

57 / 85

The conditions in boxes are necessary conditions, so solutions(x∗(t), λ∗(t),u∗(t), ν∗) that satisfy them are extremalsolutions.

They are only candidate solutions.

Each extremal solution can be a minimum, a maximum or asaddle.

58 / 85

Note the dynamics of the state - co-state system:

x = ∇λHλ = −∇xH

This is a Hamiltonian system.

Boundary conditions are at the beginning and at the end ofthe interval, so this is a two point boundary value problem.

If H is not an explicit function of time, H = constant.

If the Hamiltonian is not an explicit function of time and tf isfree, then H(t) = 0.

59 / 85

Example: The brachistochrone problem

This problem was solved by Bernoulli in 1696. A mass m moves ina constant force field of magnitude g starting at rest at the origin(x , y) = (x0, y0) at time 0. It is desired to find a path of minimumtime to a specified final point (x1, y1).

60 / 85

The equations of motion are:

x = V cos θ

y = V sin θ

where V (t) =√

2gy . The performance index (we are afterminimum time), can be expressed as follows:

∫ tf

01dt = tf

and the boundary conditions on the states are(x(0), y(0)) = (x0, y0), (x(tf ), y(tf )) = (x1, y1).

61 / 85

In this case, the Hamiltonian function is:

H = 1 + λx(t)V (t) cos θ(t) + λy (t)V (t) sin θ(t)

The costate equations are:

λx = 0

λy = − g

V(λx cos θ + λy sin θ)

and the stationarity condition is:

0 = −λxV sin θ + λyV cos θ =⇒ tan θ =λyλx

62 / 85

We need to satify the boundary conditions:

x(0)− x0

y(0)− y0

x(tf )− x1

y(tf )− y1

Also since the Hamiltonian is not an explicit function of time andtf is free, we have that H(t) = 0. Using this, together with thestationarity condition we can write:

λx = −cos θ

λy = −sin θ

63 / 85

After some work with these equations, we can write:

cos2 θ(tf )cos2 θ

x = x1 +y1

2 cos2 θ(tf )[2(θ(tf )− θ) + sin 2θ(tf )− sin 2θ]

tf − t =

√2y1

θ − θ(tf )

cos θ(tf )

Given the current position (x , y) these three equations can be usedto solve for θ(t) and θ(tf ) and the time to go tf − t.

64 / 85

Using the above results, it is possible to find the states:

y(t) = a(1− cosφ(t))

x(t) = a(φ− sinφ(t))

whereφ = π − 2θ

1− cosφ(tf )

The solution describes a cycloid in the x − y plane that passesthrough (x1, y1).

65 / 85

The figures illustrates numerical results with (x(0), y(0)) = (0, 0),(x(tf ), y(tf )) = (2, 2) and g = 9.8.

0 0.5 1 1.5 2

Brachistochrone Problem: x-y plot

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9

time (s)

Brachistochrone Problem: control

66 / 85

Assume that the optimal control is u∗, and that the states andco-states are fixed at their optimal trajectories x∗(t), λ∗(t). wehave that for all u close to u∗:

J(u∗) ≤ J(u) =⇒ J(u)− J(u∗) ≥ 0

Then we have that J(u) = J(u∗) + δJ[u∗, δu].

67 / 85

From the variation in the augmented functional δJa, we have inthis case:

δJ[u∗, δu] =

∫ tf

]δudt

68 / 85

But, using a Taylor series we know that:

H[x∗,u∗ + δu, λ∗] = H[x∗,u∗, λ∗] +

]u∗δu + . . .

So that, to first order:

δJ[u∗, δu] =

∫ tf

(H[x∗,u∗ + δu, λ∗]− H[x∗,u∗, λ∗]) dt ≥ 0

69 / 85

This leads to:

H[x∗,u∗ + δu, λ∗]− H[x∗,u∗, λ∗] ≥ 0

for all admissible variations in u. This in turn leads to theminimum principle:

u∗ = arg minu∈U H[x∗,u, λ∗]

where U is the admissible control set.

70 / 85

Assuming that the admissible control set is closed then, if thecontrol lies in the interior of the set, we have that the minimumprinciple implies that:

∂u= 0

Otherwise, if the optimal control is at the boundary of theadmissible control set, the minimum principle requires that theoptimal control must be chosen to minimise the Hamiltonian:

u∗ = arg minu∈U

H[x∗,u, λ∗]

71 / 85

Example: Minimum time control with constrained input

Consider the following optimal control problem. Chose tf andu(t) ∈ U = [−1, 1], t ∈ [t0, tf ] to minimise:

J = tf =

∫ tf

subject tox1 = x2

x2 = u

with the boundary conditions:

x1(0) = x10, x1(tf ) = 0x2(0) = x20, x2(tf ) = 0

72 / 85

In this case, the Hamiltonian is:

H = 1 + λ1x2 + λ2u

Calculating ∂H/∂u gives no information on the control, so we needto apply the minimum principle:

u∗ = arg minu∈[−1,1]

H[x∗, u, λ∗] =

+1 if λ2 < 0

undefined if λ2 = 0−1 if λ2 > 0

This form of control is called bang-bang. The control switchesvalue when the co-state λ2 changes sign. The actual form of thesolution depends on the values of the initial states.

73 / 85

The costate equations are:

λ1 = 0 =⇒ λ1(t) = λ10 = constant

λ2 = −λ1 =⇒ λ2(t) = λ10t + λ20

There are four cases:

1 λ2 is always positive.

2 λ2 is always negative.

3 λ2 switches from positive to negative.

4 λ2 switches from negative to positive.

So there will be at most one switch in the solution. The minimumprinciple helps us to understand the nature of the solution.

74 / 85

Let’s assume that we impose equality path constraints of theform:

c[x(t),u(t), t] = 0

This type of constraint may be part of the model of thesystem.

The treatment of path constraints depends on the Jacobianmatrix of the constraint function.

75 / 85

If the matrix ∂c/∂u is full rank, then the system of differentialand algebraic equations formed by the state dynamics and thepath constraints is a DAE of index 1. In this case, theHamiltonian can be replaced by:

H = L + λT f + µTc

which results in new optimality conditions.

76 / 85

If the matrix ∂c/∂u is rank defficient, the index of the DAE ishigher than one.

This may require repeated differentiations of c with respect tou until one of the derivatives of c is of full rank, a procedurecalled rank reduction.

This may be difficult to perform and may be prone tonumerical errors.

77 / 85

Singular arcs

In some optimal control problems, extremal arcs satisfying thestationarity condition ∂H/∂u = 0 occur where the Hessianmatrix Huu is singular.

These are called singular arcs.

Additional tests are required to verify if a singular arc isoptimizing.

78 / 85

Singular arcs

A particular case of practical relevance occurs when theHamiltonian function is linear in at least one of the controlvariables.

In such cases, the control is not determined in terms of thestate and co-state by the stationarity condition ∂H/∂u = 0

Instead, the control is determined by repeatedly differentiating∂H/∂u with respect to time until it is possible to obtain thecontrol by setting the k-th derivative to zero.

79 / 85

Singular arcs

In the case of a single control u, once the control is obtainedby setting the k-th time derivative of ∂H/∂u to zero, thenadditional conditions known as the generalizedLegendre-Clebsch conditions must be checked:

(−1)k∂

d (2k)

dt(2k)

≥ 0, k = 0, 1, 2, . . .

If the above conditions are satisfied, then the singular arc willbe optimal.

The presence of singular arcs may cause difficulties tocomputational optimal control methods to find accuratesolutions if the appropriate conditions are not enforced a priori.

80 / 85

These are constraints of the form:

hL ≤ h[x(t),u(t), t] ≤ hU

with hL,i 6= hU,i .

Inequality constraints may be active (h at one of the limits) orinactive (h within the limits) at any one time.

As a result the time domain t ∈ [t0, tf ] may be partitionedinto constrained and unconstrained arcs.

Different optimality conditions apply during constrained andunconstrained arcs.

81 / 85

Three major complications:

The number of constrained arcs is not necessarily known apriori.

The location of the junction points between constrained andunconstrained arcs is not known a priori.

At the junction points it is possible that both the controls uand the costates λ are discontinuous.

Additional boundary conditions need to be satisfied at thejunction points ( =⇒ multipoint boundary value problem).

82 / 85

Summary

Optimal control theory for continuous time systems is basedon calculus of variations.

Using this approach, we formulate the first order optimalityconditions of the problem.

This leads to a Hamiltonian boundary value problem.

83 / 85

Summary

Even if we use numerical methods, a sound understanding ofcontrol theory is important.

It provides a systematic way of formulating the problem.It gives us ways of analysing the numerical solution.It gives us conditions that we can incorporate into theformulation of the numerical problem.

84 / 85

optimal control and applications - session 1 2nd astronet-ii

Documents

era-net project astronet

optimal bunching without optimal control · optimal...

identi cation of young stellar object candidates in the...

on the optimal design of experiments for conceptual and...

optimal observers and optimal control: improving car

astronet coordinating strategic planning for european...

astronet-ii international final conference tossa de mar –...

aspera status astronet gb 20 september 2006 s.katsanevas...

thomas levy optimal nutrition for optimal health

unconventional solar sailing -...

1 optimal mail certificates in mail payment applications...

a family of c 1 finite elements with optimal … · with...

era-net project astronet towards a strategic plan for...

usage of plant protection products in winter wheat … ·...

astronet strategic plan

optimal control 2nd edition lewis syrmos

esfri road mao up-date 2008 16/06/2008 - liverpool astronet...

km 554e-20200914162824 (mechanical...no. 1 2 3 4 5 6 7 8 9...

optimal control lewis 2nd 1995

searchable encryption with optimal locality: achieving...