motivating applications

40
Decentralized Optimization, with application to Multiple Aircraft Coordination Decision Making Under Uncertainty MURI Review, July 2002 Gökhan Inalhan, Dusan Stipanovic, Claire Tomlin Hybrid Systems Laboratory Department of Aeronautics and Astronautics Stanford University

Upload: kamran

Post on 12-Jan-2016

42 views

Category:

Documents


0 download

DESCRIPTION

Motivating applications. (Source: Boeing X45-A). (Source: Northrop Grumman-X47A). (Source: NASA Ames). Hybrid systems. Continuous systems controlled by a discrete logic: embedded systems (autopilot logic) - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Motivating applications

Decentralized Optimization, with application to

Multiple Aircraft Coordination

Decision Making Under Uncertainty MURI Review,

July 2002

Gökhan Inalhan, Dusan Stipanovic, Claire Tomlin

Hybrid Systems LaboratoryDepartment of Aeronautics and Astronautics

Stanford University

Page 2: Motivating applications

Motivating applications

(Source: Boeing X45-A)

(Source: Northrop Grumman-X47A)

(Source: NASA Ames)

Page 3: Motivating applications

Hybrid systems

• Continuous systems controlled by a discrete logic: embedded systems (autopilot logic)

• Coordinating processes: multi-vehicle systems interfacing continuous control with coordination protocols

• Continuous systems with a phased operation: (biological cell growth and division)

xç= f(x;u;d)

continuous systems(control)

discrete systems(computer science)

Page 4: Motivating applications

Verification: a mathematical proof that the system satisfies a property

Controller synthesis: the design of control laws to guarantee that the system satisfies the property

• Methods give definitive answers, unlike simulation• Often give surprising answers, trajectories which one

might not think to simulate• Reduces development time, cost of certification

Verification and Controller Synthesis

unsafe

initial

Page 5: Motivating applications

Verification: a mathematical proof that the system satisfies a property

Controller synthesis: the design of control laws to guarantee that the system satisfies the property

• Methods give definitive answers, unlike simulation• Often give surprising answers, trajectories which one

might not think to simulate• Reduces development time, cost of certification

Verification and Controller Synthesis

unsafe

initial

Page 6: Motivating applications

Verification: a mathematical proof that the system satisfies a property

Controller synthesis: the design of control laws to guarantee that the system satisfies the property

• Methods give definitive answers, unlike simulation• Often give surprising answers, trajectories which one

might not think to simulate• Reduces development time, cost of certification

Verification and Controller Synthesis

unsafe

initial

Page 7: Motivating applications

Verification: a mathematical proof that the system satisfies a property

Controller synthesis: the design of control laws to guarantee that the system satisfies the property

Safety Property can be encoded as a condition on the system’s reachable set of states

Verification and Controller Synthesis

unsafeunsafe initialization

initial

unsafe

safe, under appropriate control

Page 8: Motivating applications

Example: Aircraft Collision Avoidance

Two identical aircraft at fixed altitude & speed:

ud

uxv

uyvv

duy

x

dt

d

sin

cos

),,(xf

‘evader’ (control) ‘pursuer’ (disturbance)

x

y

uv

d

v

Page 9: Motivating applications

Continuous Reachable Set

x

ySolve:

Display: G(t) = fx : J (x;t) < 0g

à @t@J (x;t) =minf0;maxumindH(x; @x

@J (x;t);u;d)g

Page 10: Motivating applications

Collision Avoidance Filter

Simple demonstration– Pursuer: turn to head toward evader– Evader: turn to head right

pursuer

safety filter’s input modification

pursuer’s inputevader’s desired input

evader

evader’s actual input

unsafe setcollision set

Movies …

Page 11: Motivating applications

Blunder Zones for Closely Spaced Approaches

evader

EEM Maneuver 1: accelerateEEM Maneuver 2: turn 45 deg, accelerate

EEM Maneuver 3: turn 60 deg

Page 12: Motivating applications

Blunder Zone is shown by the yellow contour

Red Zone in the green tunnel is the intersection of the BZ with approach path.

The Red Zone corresponds to an assumed 2 second pilot delay. The Yellow Zone corresponds to an 8 second pilot delay

Implementation: Display design courtesy of

Chad Jennings, Andy Barrows, David Powell

Page 13: Motivating applications

Map View showing a blunder

The BZ calculations are performed in real time (40Hz) so that the contour is updated with each video frame.

Page 14: Motivating applications

Verified Mode Switching in Autopilots

Page 15: Motivating applications

Use in Cockpit Interface Verification• Controllable flight envelopes for landing and Take Off / Go

Around (TOGA) maneuvers may not be the same• Pilot’s cockpit display may not contain sufficient information to

distinguish whether TOGA can be initiated

flareflaps extendedminimum thrust

rolloutflaps extendedreverse thrust

slow TOGAflaps extended

maximum thrust

TOGAflaps retracted

maximum thrust

flareflaps extendedminimum thrust

rolloutflaps extendedreverse thrust

TOGAflaps retracted

maximum thrust

revised interface

existing interface

controllable flare envelope

controllable TOGA envelope

intersection

Page 16: Motivating applications

V1

V2

V3 V4

Communication Zone

Safety Assurance Zone

V1

V2

V3 V4

A More General Problem Structure

Page 17: Motivating applications

Neighborhood of ith vehicle

(Decomposed) Centralized Optimization

Page 18: Motivating applications

Fixed time horizon – complete global map

t à 2ts t à ts t t + T à 2ts t + T à ts t + T

Bargainingstart

fixed time horizon

Page 19: Motivating applications

Flight Plans published by aircraft 1

Page 20: Motivating applications

Another Example

Page 21: Motivating applications

Flight Plans published by aircraft 1

Page 22: Motivating applications

Receding horizon – incomplete global map

t à 2ts t à ts t t + T à 2ts t + T à ts t + T

moving time horizon

tsBargainingstart

Page 23: Motivating applications

• Constraints embed:local dynamics: coordinated turn and straight flight [hdi

]

input constraints: limited turn rate and velocity [gei]

global coordination constraints: minimum safety assurance [gsij] for all j within

neighborhood of i

Local Optimization with Constraints

Page 24: Motivating applications

Centralized Optimization

Decomposed Centralized Optimization

Decomposition I

Pareto optimalityNash equilibrium

Page 25: Motivating applications

Define Hamiltonian for each subsystem:

is a Nash equilibrium for the centralized optimization problem if:

where

Thus, none of the subsystems can improve its solution, with all other subsystems’ solutions remaining fixed.

Nash Equilibrium for Centralized Problem

xã= [xã1; :::;x

ãi ; :::x

ãm]

H ci (x;õ;ö) = fi(xi) +õTh(x) +öTg(x)

H ci (x

ã;õ;ö) ô H ci (x

ãi;õ;ö)

xãi = [xã1; :::;xi; :::xã

m]

Page 26: Motivating applications

Decomposed Centralized Optimization

Decentralized Optimization

Decomposition II

Nash equilibriumLocal optimal solutions

Page 27: Motivating applications

Define Hamiltonian for each subsystem:

is a Nash equilibrium for the decentralized optimization problem if:

Optimal solutions by each of the subsystems

Nash Equilibrium for Decentralized Problem

xã= [xã1; :::;x

ãi ; :::x

ãm]

H dci (xi;õi;öijf xjgi) = fi(xi) +õT

i hi(xijf xjgi) +öTi gi(xijf xjgi)

H dci (x

ãi ;õi;öijf x

ãjg) ô H dc

i (xi;õi;öijf xãjg)

Proposition: is a Nash equilibrium of the centralized problem if and only if it is a Nash equilibrium of the

decentralized problem

Page 28: Motivating applications

Example: Nash Equilibrium at (0,0)

Page 29: Motivating applications

• Global contraction function from the local optimization structures

• For a particular solution, local optimization of the ith

vehicle only affects the portion of F tied to its own local optimization

Using Penalty function methods

Page 30: Motivating applications

• Eliminates cases in which a subsystem is artificially acting against a constraint dictated by another group

• Eliminates cases in which two subsystems act against each other with non-identical constraints

Cooperation Assumptions

Page 31: Motivating applications

Multiple solutions, or “threads”, exist within the system:

Vehicle #1 Vehicle #2 Vehicle #3 Vehicle #4Iteration #1

Iteration #2

Iteration #3

Iteration #4

Iteration #4

Nash Bargaining with Multiple Threads

Page 32: Motivating applications

Convergence Results

1. Global convergence to a (not necessarily feasible) Nash ‘solution’

2. If the gradients of the constraint functions are linearly independent (Linear Constraint Qualification Condition, LICQ), then global convergence to a feasible Nash solution

3. Pareto optimality for convex problems

[Inalhan, Stipanovic, Tomlin. Decentralized Optimization, with Application to Multiple Aircraft Coordination. CDC 2002, Submitted to JOTA]

Page 33: Motivating applications

V1

V2

V3 V4

4-Vehicle Example

Page 34: Motivating applications

PF i(xijf xjgi) = PF j(f xjgi;xi) 8j 2 N i

4-Vehicle Example

Page 35: Motivating applications

Flight Plans published by aircraft 1

Page 36: Motivating applications
Page 37: Motivating applications

Decentralized Initialization Procedure Heuristics– Multiple-Depots (Vehicles), Time-windows for access, Priority on

objectives and the vehicles– Iterative selection process carried at each vehicle– Best solution in the fleet is then selected from each vehicle’s

solution set

Applied to other problems of interest…

Page 38: Motivating applications

Non-cooperativeFull information

CooperativeFull information

Cooperativeincomplete information

Non-cooperativeNo information

Lack of information Bounded Irrationality

Spectrum of Approaches

Page 39: Motivating applications

Research Goals

• Design of provably correct and safe decentralized control protocols– Adapt to coordination– Allow for dynamic reconfiguration

• Treatment of information– Multi-scale provisioning of data based on inputs from various

sensing modalities– Urgency of the need for sensed data– Available bandwidth

• Verification algorithms– Used during design phase, to reduce the time spent during

the validation phase

Page 40: Motivating applications

Stanford DragonFly UAV

10 ft wingspan

12 ft wingspan

[Jang, Teo, Inalhan, and Tomlin, DASC 2001], [Jang and Tomlin, AIAA GNC 2002]