information sharing for distributed planning

36
Information Sharing for Distributed Planning Prasanna Velagapudi AAMAS 2010 - Doctoral Symposium 1

Upload: kennan

Post on 24-Feb-2016

35 views

Category:

Documents


0 download

DESCRIPTION

Information Sharing for Distributed Planning. Prasanna Velagapudi. Large Heterogeneous Teams. 100s to 1000s of robots, agents, people Complex, collaborative tasks Dynamic, uncertain environment Joint planning intractable. Scaling Team Planning. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Information Sharing for Distributed Planning

AAMAS 2010 - Doctoral Symposium 1

Information Sharing forDistributed Planning

Prasanna Velagapudi

Page 2: Information Sharing for Distributed Planning

AAMAS 2010 - Doctoral Symposium 2

Large Heterogeneous Teams

• 100s to 1000s of robots, agents, people

• Complex, collaborative tasks

• Dynamic, uncertain environment

• Joint planning intractable

Page 3: Information Sharing for Distributed Planning

AAMAS 2010 - Doctoral Symposium 3

Scaling Team Planning

• Independent planners: can’t account for teammates• Existing work: needs specific structure or doesn’t

scale to these sizes– DPC, Prioritized Planning– JESP, Factored MDP, ND-POMDP

Page 4: Information Sharing for Distributed Planning

AAMAS 2010 - Doctoral Symposium 4

Iterated Distributed Planning

1. Factor the problem, enumerate interactions2. Compute independent plans & potential interactions3. Exchange messages about interactions4. Use exchanged information, improve local model

Page 5: Information Sharing for Distributed Planning

AAMAS 2010 - Doctoral Symposium 5

Iterated Distributed Planning

1. Factor the problem, enumerate interactions2. Compute independent plans & potential interactions3. Exchange messages about interactions4. Use exchanged information, improve local model

?

Page 6: Information Sharing for Distributed Planning

AAMAS 2010 - Doctoral Symposium 6

Iterated Distributed Planning

1. Factor the problem, enumerate interactions2. Compute independent plans & potential interactions3. Exchange messages about interactions4. Use exchanged information, improve local model

?

Page 7: Information Sharing for Distributed Planning

AAMAS 2010 - Doctoral Symposium 7

Iterated Distributed Planning

1. Factor the problem, enumerate interactions2. Compute independent plans & potential interactions3. Exchange messages about interactions4. Use exchanged information, improve local model

Page 8: Information Sharing for Distributed Planning

AAMAS 2010 - Doctoral Symposium 8

A Tale of Two Distributed Planners

Distributed Prioritized Planning (DPP) L-TREMOR

Page 9: Information Sharing for Distributed Planning

AAMAS 2010 - Doctoral Symposium 9

Distributed Prioritized Planning

Page 10: Information Sharing for Distributed Planning

AAMAS 2010 - Doctoral Symposium 10

Multiagent Path Planning

Start

Goal

Page 11: Information Sharing for Distributed Planning

AAMAS 2010 - Doctoral Symposium 11

Multiagent Path Planning

Page 12: Information Sharing for Distributed Planning

AAMAS 2010 - Doctoral Symposium 12

Prioritized Planning

• Assign priorities to agents based on path length

[van den Berg, et al 2005]

Page 13: Information Sharing for Distributed Planning

AAMAS 2010 - Doctoral Symposium 13

Prioritized Planning

• Plan from highest priority to lowest priority• Use previous agents as dynamic obstacles

[van den Berg, et al 2005]

Page 14: Information Sharing for Distributed Planning

AAMAS 2010 - Doctoral Symposium 14

Distributed Prioritized Planning

Parallelizable& Equivalent

Page 15: Information Sharing for Distributed Planning

AAMAS 2010 - Doctoral Symposium 15

Large-Scale Path Solutions

Page 16: Information Sharing for Distributed Planning

AAMAS 2010 - Doctoral Symposium 16

Large-Scale Path Solutions

Page 17: Information Sharing for Distributed Planning

AAMAS 2010 - Doctoral Symposium 17

DPP Results

Fewer Sequential Plans

Page 18: Information Sharing for Distributed Planning

AAMAS 2010 - Doctoral Symposium 18

DPP Results

Longer Planning TimeFewer Sequential Plans

Page 19: Information Sharing for Distributed Planning

AAMAS 2010 - Doctoral Symposium 19

• Prioritized Planning

• DPP

Why does this happen?

ABCD

ABCD

Longest planning agents might replan multiple times

Individual agent planning times varied by >2 orders of magnitude

Solution 2: Incremental Planning

Solution 1: Prioritize by plan time?

Page 20: Information Sharing for Distributed Planning

AAMAS 2010 - Doctoral Symposium 20

Summary of DPP

• Observable, certain world• Only one type of interaction: collision

• Far fewer sequential planning iterations• Incremental planning may reduce execution time

Page 21: Information Sharing for Distributed Planning

AAMAS 2010 - Doctoral Symposium 21

L-TREMOR

Page 22: Information Sharing for Distributed Planning

AAMAS 2010 - Doctoral Symposium 22

A Simple Rescue Domain

Rescue Agent

Cleaner Agent

Narrow Corridor

Victim

Unsafe Cell

Clearable Debris

Page 23: Information Sharing for Distributed Planning

AAMAS 2010 - Doctoral Symposium 23

A Simple (Large) Rescue Domain

Page 24: Information Sharing for Distributed Planning

AAMAS 2010 - Doctoral Symposium 24

Distributed POMDP with Coordination Locales (DPCL)

• Often, interactions between agents are sparse

Only fits one agent Passable if

cleaned

[Varakantham, et al 2009]

Page 25: Information Sharing for Distributed Planning

AAMAS 2010 - Doctoral Symposium 25

Distributed POMDP with Coordination Locales (DPCL)

• Define coordination locales (CLs) where POMDP model functions are not independent:

[Varakantham, et al 2009]

<S, A, Ω, P, R, O> (states) (actions) (obs.) (transition)(reward)(obs. fn)

Page 26: Information Sharing for Distributed Planning

AAMAS 2010 - Doctoral Symposium 26

Distributed POMDP with Coordination Locales (DPCL)

• Define coordination locales (CLs) where POMDP model functions are not independent:

[Varakantham, et al 2009]

S1, A1 S2, A2

SglobalR1, P1, O1 R2, P2, O2

Outside CL:(typical)

Page 27: Information Sharing for Distributed Planning

AAMAS 2010 - Doctoral Symposium 27

Distributed POMDP with Coordination Locales (DPCL)

• Define coordination locales (CLs) where POMDP model functions are not independent:

[Varakantham, et al 2009]

S1, A1 S2, A2

Sglobal

R12, P12, O12

Inside CL:(interaction)

Page 28: Information Sharing for Distributed Planning

AAMAS 2010 - Doctoral Symposium 28

TREMOR

Role Allocation Policy Solution Interaction Detection Coordination

TREMOR

Branch & Bound MDP

Independent EVA[3] solvers

Joint policy evaluation

Reward shapingof independent

models

[Varakantham, et al 2009]

Page 29: Information Sharing for Distributed Planning

AAMAS 2010 - Doctoral Symposium 29

L-TREMOR

Role Allocation Policy Solution Interaction Detection Coordination

TREMOR

Branch & Bound MDP

Independent EVA[3] solvers

Joint policy evaluation Reward shaping

of independentmodels

L-TREMOR

DecentralizedAuction

Sampling & message passing

Distributed & Parallelizable

Page 30: Information Sharing for Distributed Planning

AAMAS 2010 - Doctoral Symposium 30

Preliminary Results – Joint Utility

N = 6 N = 10N = 100

(structurally similar to N=10)

Page 31: Information Sharing for Distributed Planning

AAMAS 2010 - Doctoral Symposium 31

Preliminary Results – Timing

Page 32: Information Sharing for Distributed Planning

AAMAS 2010 - Doctoral Symposium 32

Preliminary Results – Model Accuracy

R = 0.804

Page 33: Information Sharing for Distributed Planning

AAMAS 2010 - Doctoral Symposium 33

Current Issues

• Oscillations in solutions

• Discovery of relevant locales

?

Page 34: Information Sharing for Distributed Planning

AAMAS 2010 - Doctoral Symposium 34

Summary of L-TREMOR

• Partially-observable, uncertain world• Multiple types of interactions• Role-allocation of tasks

• Improvement over independent planning• Handles large problems• Next steps: improving convergence

Page 35: Information Sharing for Distributed Planning

AAMAS 2010 - Doctoral Symposium 35

Conclusions

• Two approaches to distributed planning– DPP: approaching centralized performance– L-TREMOR: exceeding joint tractability

• Analogous strategies for distributing planning– Both iterate independent planners– Both exchange messages about states, actions

Page 36: Information Sharing for Distributed Planning

AAMAS 2010 - Doctoral Symposium 36

Future Work

• Generalized framework for distributed planning through iterative message exchange

• Reduce necessary communication• Better search over task allocations• Scaling to larger team sizes