robotics coe 584 deliberative & hybrid control

46
ROBOTICS COE 584 Deliberative & Hybrid Control

Upload: zulema

Post on 05-Jan-2016

38 views

Category:

Documents


0 download

DESCRIPTION

ROBOTICS COE 584 Deliberative & Hybrid Control. Lecture Outline. Deliberative control Hybrid control Types of layer organization selection advising adaptation postponement Examples of hybrid control AuRA, Atlantis SSS, PRS. Deliberative Systems. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: ROBOTICS  COE 584 Deliberative & Hybrid Control

ROBOTICS

COE 584

Deliberative & Hybrid Control

Page 2: ROBOTICS  COE 584 Deliberative & Hybrid Control

Lecture Outline Lecture Outline Deliberative control Hybrid control Types of layer organization

selection advising adaptation postponement

Examples of hybrid control AuRA, Atlantis SSS, PRS

Page 3: ROBOTICS  COE 584 Deliberative & Hybrid Control

Deliberative Systems Deliberative Systems

Purely deliberative systems are considered the classical control architecture, since they were the first to be tried

In AI, classical deliberative, planner-based architectures were used for reasoning about actions in various non-physical domains, such as chess

As a result, the same architectures were applied to robotics as well

Page 4: ROBOTICS  COE 584 Deliberative & Hybrid Control

In the 1960’s: Shakey In the 1960’s: Shakey

In the late 1960's, the state-of-the-art in machine vision was used to process visual information on a robot called Shakey, the forerunner of many AI-inspired robotics projects.

Shakey used a classical planner as the underlying structure to decide what to do.

What is planning?

Page 5: ROBOTICS  COE 584 Deliberative & Hybrid Control

Planning as Search Planning as Search Planning is looking ahead, searching The goal is a state The robot's entire state space is enumerated, and

searched, from the current state to the goal state Different paths are tried until one is found that

reaches the goal

If the optimal path is desired, then all possible paths must be considered in order to find the best one

Page 6: ROBOTICS  COE 584 Deliberative & Hybrid Control

SPA = Planner-based SPA = Planner-based Planner-based (deliberative) architectures

typically involve three generic sequential steps or functional modules: 1) sensing (S) 2) planning (P) 3) acting (A), executing the plan

Thus, they are called SPA architectures SPA has serious drawbacks What are they?

Page 7: ROBOTICS  COE 584 Deliberative & Hybrid Control

Problem 1: Time Problem 1: Time Complex state spaces:

very slow plan generation

Dynamic worlds: out of date plans (latency)

Page 8: ROBOTICS  COE 584 Deliberative & Hybrid Control

Problem 2: Space Problem 2: Space Representation of state space may

be very large Search tree (intermediate plan data)

may be very large Modern machines have virtual

memory (page to disk), but swapping is very slow

Page 9: ROBOTICS  COE 584 Deliberative & Hybrid Control

Problem 3: Representation Problem 3: Representation

Representation for planning has two parts: Knowing the state of the world Predicting the outcome of actions

State representation assumed to be: complete accurate current predictable

Page 10: ROBOTICS  COE 584 Deliberative & Hybrid Control

Problem 3: Representation Problem 3: Representation

Sensors have: noise inaccuracies aliasing (partial observability)

Effectors are: unpredictable unreliable

None of the assumptions are valid!

Page 11: ROBOTICS  COE 584 Deliberative & Hybrid Control

Problem 4: Execution Problem 4: Execution Execution is assumed to be:

sequential reliable unique (one actor)

But: blind execution of long sequences of

unreliable actions will fail E.g., p(success | 1 action) = 0.90 => p(success | 10 actions) = 0.35

Page 12: ROBOTICS  COE 584 Deliberative & Hybrid Control

Deliberative Summary Deliberative Summary In short, deliberative (SPA)

approaches: require search (slow) require representations (hard) encourage open-loop execution

(dangerous)

Page 13: ROBOTICS  COE 584 Deliberative & Hybrid Control

Opposition to SPA Opposition to SPA As a consequence, much opposition

from real robot practitioners mounted against SPA architectures

In the early/mid 1980's alternatives were proposed reactive systems hybrid systems

What happened to purely deliberative systems?

Page 14: ROBOTICS  COE 584 Deliberative & Hybrid Control

Role of Pure Deliberation Role of Pure Deliberation

Pure deliberation is alive and well in other domains, like game playing (chess, go, etc.) and other static worlds with plenty of time to plan

Page 15: ROBOTICS  COE 584 Deliberative & Hybrid Control

Planners Live On in Robotics Planners Live On in Robotics

The SPA approach has not been abandoned, it has been expanded

Given the two fundamental problems with purely deliberative approaches, we can augment them: search/planning is slow, so save/cache

important and/or urgent decisions; open-loop plan execution is bad, use

closed-loop feedback, and be ready to respond or re-plan when the plan fails.

Page 16: ROBOTICS  COE 584 Deliberative & Hybrid Control

Reusing Plans Reusing Plans Some frequently useful planned

decisions may need to be reused, so to avoid planning, an intermediate layer may cache and look those up

These can be intermediate-level actions (ILAs) macro operators: plans compiled into

more general operators for future use

Page 17: ROBOTICS  COE 584 Deliberative & Hybrid Control

Universal Plans Universal Plans Suppose for a given problem, all

possible plans are generated for all possible situations in advance, and stored

If for each situation a robot has a pre-existing optimal plan, it can react optimally, be reactive and optimal

It has a universal plan (These are complete reactive

mappings)

Page 18: ROBOTICS  COE 584 Deliberative & Hybrid Control

Viability of Universal Plans Viability of Universal Plans

A system with a universal plan is reactive; the planning is done at compile-time, not at run-time

Universal plans are not viable in most domains, because they require that: the world must be deterministic the world must not change the goals must not change

The world is too complex (state space is too large)

Page 19: ROBOTICS  COE 584 Deliberative & Hybrid Control

Situated Automata Situated Automata A formal notion of finite state machines

whose inputs are connected to sensors and whose outputs are connected to effectors are called situated automata.

Situated means existing in and interacting with a complex world, and automata is the formal name for FSMs (formally: finite state automata).

Situated automata are used to create reactive principled control systems.

Page 20: ROBOTICS  COE 584 Deliberative & Hybrid Control

Control w/ Situated AutomataControl w/ Situated Automata

Situated automata can be constructed in two basic ways: By hand (i.e., the designer puts FSMs

together), as in the Subsumption Architecture). By pre-compiling a complete plan (similar to

Universal Plans, but reduced down to circuits of FSMs). This requires the use of a special programming language that implements the right semantics and compiles down into FSM circuitry, as Rex and Gapps.

Page 21: ROBOTICS  COE 584 Deliberative & Hybrid Control

Domain Knowledge Domain Knowledge A key advantage of pre-compiled systems

is that domain knowledge, i.e., information that the designer has about the environment, the robot, and the task, can be embedded into the system in a principled way

Then, the system is compiled into a reactive circuit, so the knowledge does not have to be reasoned about (or planned with) explicitly, in real-time

Page 22: ROBOTICS  COE 584 Deliberative & Hybrid Control

Disadvantages Disadvantages A key disadvantage of pre-compiled

systems is that it quickly becomes prohibitively large to enumerate the state space of a real robot, and thus pre-compiling generally does not scale up to complex systems

Another disadvantage is common to compiled or hard-wired systems: the result is not flexible in the presence of changing environments, tasks or goals

Page 23: ROBOTICS  COE 584 Deliberative & Hybrid Control

Inventing Hybrid ControlInventing Hybrid Control The basic idea is simple: we want the

best of both worlds (if possible) The goal is to combine closed-loop

and open-loop execution That means to combine reactive and

deliberative control This implies combining the different

time-scales and representations This mix is called hybrid control

Page 24: ROBOTICS  COE 584 Deliberative & Hybrid Control

Organizing Hybrid Systems Organizing Hybrid Systems

A hybrid system typically consists of three components: a reactive layer a planner a layer that puts the two together

Hybrid architectures are often called three-layer architectures (TLA)

The planner and the reactive system are both standard, as we have covered them so far

Page 25: ROBOTICS  COE 584 Deliberative & Hybrid Control

The Magic Middle The Magic Middle The middle layer has a hard job:

1) compensate for the limitations of both the planner and the reactive system

2) reconcile their different time-scales 3) deal with their different

representations 4) reconcile any contradictory

commands between the two This is the challenge of hybrid

systems

Page 26: ROBOTICS  COE 584 Deliberative & Hybrid Control

Interaction of Layers Interaction of Layers

Hierarchical integrationPlanning guides reaction

Coupled planning & reacting

Page 27: ROBOTICS  COE 584 Deliberative & Hybrid Control

Dynamic Re-planning Dynamic Re-planning

Reaction can influence planning Any "important" changes discovered

by the low-level controller are passed back to the planner in a way that the planner can use to re-plan

The planner is interrupted when even a partial answer is needed in real-time

The reactive controller (and thus the robot) is stopped if it must wait for the planner to tell it where to go.

Page 28: ROBOTICS  COE 584 Deliberative & Hybrid Control

Planner-Driven Reaction Planner-Driven Reaction

Planning can influence reaction Any "important" optimizations the

planner discovers are passed down to the reactive controller

The planner’s suggestions are used if they are possible and safe

Who has priority, planner or reactor?

Page 29: ROBOTICS  COE 584 Deliberative & Hybrid Control

Types of Interaction Types of Interaction

Selection: Planning is viewed as configuration

Advising: Planning is viewed as advice giving

Adaptation: Planning is viewed as adaptation of controller

Postponing: Planning is viewed as a least commitment process

Page 30: ROBOTICS  COE 584 Deliberative & Hybrid Control

Selection Example: AuRA Selection Example: AuRA

R. Arkin (1986)

Planning is viewed as configuration Initial A* planner integrated with

schema-based controller Provides modularity, flexibility, and

adaptability

Page 31: ROBOTICS  COE 584 Deliberative & Hybrid Control

AuRA Schematic AuRA Schematic

Page 32: ROBOTICS  COE 584 Deliberative & Hybrid Control

Advising Example: Atlantis Advising Example: Atlantis

E. Gat (1991) (JPL) Three layers: controller, sequencer, deliberator Asynchronous, heterogeneous: reactivity and

deliberation Implemented in ALFA (A Language for Action) Planning as advice giving, not decree Notion of cognizant failure Tested on NASA rovers

QuickTime™ and aTIFF (Uncompressed) decompressor

are needed to see this picture.

Rocky 4

Page 33: ROBOTICS  COE 584 Deliberative & Hybrid Control

Atlantis Schematic Atlantis Schematic

Page 34: ROBOTICS  COE 584 Deliberative & Hybrid Control

Adaptation Example: Planner-ReactorAdaptation Example: Planner-ReactorD. Lyons (1992)Continuous modification of a reactive control

systemPlanning is a form of reactor adaptationAdaptation is on-line rather than off-line

deliberationPlanning is used to remove performance

errors when they occurUses a particular underlying mathematical

model called a process algebraTested in both assembly cell and grasp

planning

Page 35: ROBOTICS  COE 584 Deliberative & Hybrid Control

Planner-Reactor ArchitecturePlanner-Reactor Architecture

REACTOR

PLANNER WORLDPERCEPTIONS

REACTIONS

ADAPTION ACTION

SENSINGPERCEPTION

GOALS

Page 36: ROBOTICS  COE 584 Deliberative & Hybrid Control

Postponing Example: PRSPostponing Example: PRS PRS = Procedural Reasoning System Georgeff and A. Lansky (1987)

Least commitment via plan elaboration postponement

Tested on SRI Flakey

Page 37: ROBOTICS  COE 584 Deliberative & Hybrid Control

Flakey the robot Flakey the robot

QuickTime™ and aYUV420 codec decompressor

are needed to see this picture.

Page 38: ROBOTICS  COE 584 Deliberative & Hybrid Control

PRS Schematic PRS Schematic

Page 39: ROBOTICS  COE 584 Deliberative & Hybrid Control

Another Example: SSS Another Example: SSS J. Connell (1992)

SSS = Servo Subsumption Symbolic 3 layers: servo, subsumption, symbolic World models are a convenience, not a

necessity Symbolic: where-to-next (discrete time) Subsumption: where-to-go-now Servo: making it go (continuous time)

Tested on TJ

Page 40: ROBOTICS  COE 584 Deliberative & Hybrid Control

SSS Implementation: T J SSS Implementation: T J

Page 41: ROBOTICS  COE 584 Deliberative & Hybrid Control

More Examples More ExamplesSOMASS hybrid assembly system

C. Malcolm and T. Smithers (Edinburgh U.)

cognitive/subcognitive components planning as configuration

Agent architecture B. Hayes-Roth (Stanford) physical and cognitive levels functional boundary blurry

Multi-valued logic Saffiotti, Konolige, Ruspini (SRI)

Page 42: ROBOTICS  COE 584 Deliberative & Hybrid Control

Even More Examples Even More Examples

Supervenience L. Spector (1992, U. of Maryland)

Multiple levels of abstraction

Teleo-reactive agent architecture Benson and N. Nilsson (1995, Stanford)

Planning yields TR operator tree

Reactive Deliberation M. Sahota (1993, U. of British Columbia)

Robosoccer

Page 43: ROBOTICS  COE 584 Deliberative & Hybrid Control

Still More Examples Still More Examples Theoagent

T. Mitchell (CMU, 1990)Reacts when it can plans when it mustEmphasis on learning

Generic Robot ArchitectureNoreils and Chatila (1995, France)3 levels: planning, control system, functional

Dynamical Systems ApproachSchoner and Dose (1992)Planning is selecting and parameterizing

behavioral fields

Behaviors use vector summation

Page 44: ROBOTICS  COE 584 Deliberative & Hybrid Control

And Still More Examples And Still More Examples

Integrated path planning and dynamic steering control Krogh and C. Thorpe (1986, CMU)

Relaxation over grid-based model with potential fields controller

Planner generated waypoints for controller

Many others (including several for UUVs)

Page 45: ROBOTICS  COE 584 Deliberative & Hybrid Control

Hybrids Everywhere? Hybrids Everywhere?

Hybrid systems are the most popular alternative for single-robot control

Behavior-based systems are not used by quite as many researchers, but have more specialized niches (e.g., multi-robot systems) and more practical applications

Page 46: ROBOTICS  COE 584 Deliberative & Hybrid Control

Textbook Readings Textbook Readings MM 13, 15