over-subscription planning with numeric goals j. benton computer sci. & eng. dept. arizona state...

20
Over-subscription Planning with Numeric Goals J. Benton Computer Sci. & Eng. Dept. Arizona State University Tempe, AZ Minh Do Palo Alto Research Center (PARC) Palo Alto, CA Subbarao Kambhampati Computer Sci. & Eng. Dept. Arizona State University Tempe, AZ

Post on 20-Dec-2015

217 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: Over-subscription Planning with Numeric Goals J. Benton Computer Sci. & Eng. Dept. Arizona State University Tempe, AZ Minh Do Palo Alto Research Center

Over-subscription Planning with Numeric Goals

J. BentonComputer Sci. & Eng.

Dept.Arizona State University

Tempe, AZ

Minh DoPalo Alto Research Center

(PARC)Palo Alto, CA

Subbarao KambhampatiComputer Sci. & Eng.

Dept.Arizona State University

Tempe, AZ

Page 2: Over-subscription Planning with Numeric Goals J. Benton Computer Sci. & Eng. Dept. Arizona State University Tempe, AZ Minh Do Palo Alto Research Center

Over-subscription Planning

Goals optional & have utility

Actions have cost Maximize utility-cost

“Benefit”

cost = 200

cost = 500

cost = 300

Util = 500

Util = 200

B

CA

Initial: At A

Goals: Soil_Sample @ B & C

[“The Mystery Talk”, Smith 2003]

-100

300

200

Rovers Example

300

Page 3: Over-subscription Planning with Numeric Goals J. Benton Computer Sci. & Eng. Dept. Arizona State University Tempe, AZ Minh Do Palo Alto Research Center

Motivation

Numeric goals also have utility More soil gives better instrument

reading More packages give more profit

Cost for achieving varying values differs More soil requires more weight More packages require more

deliveries

Page 4: Over-subscription Planning with Numeric Goals J. Benton Computer Sci. & Eng. Dept. Arizona State University Tempe, AZ Minh Do Palo Alto Research Center

Objective

Want more/less G = soil-sample ∈ [2,4]

U(G) = (* (soil-sample) 2)

Challenge – A measurable level of numeric goal achievement: degree of satisfaction

Collect Cost=1Collect Cost=2

1 gram

1 gram

cost=3

soil collected

util=2*2=4 Collect Cost=3

1 gram

action cost

cost=6util=3*2=6Benefit=4-

3=1Benefit=6-6=0

Satisfy numeric goals at different values to give varying utility

Benefit

v a l u e

best benefit

Page 5: Over-subscription Planning with Numeric Goals J. Benton Computer Sci. & Eng. Dept. Arizona State University Tempe, AZ Minh Do Palo Alto Research Center

Modeling Numeric Goal Over-subscription

Achieve with a given utility

Specify a goal range

U(G) = (* (soil-sample) 2)

G = soil-sample ∈ [2,4]

4

2

8

1 2 3 40

6

Sample

Utility

1. Fixed utility forsatisfying level

2. Linear

3. Hard bounds

Infinity onrange OK

4. Model as aseparate goal

Page 6: Over-subscription Planning with Numeric Goals J. Benton Computer Sci. & Eng. Dept. Arizona State University Tempe, AZ Minh Do Palo Alto Research Center

SapaMps Architecture

Over-subscribed PlanningPlanning Problem

Input Initial State

Select state with bestf-value

Queue ofTime-Stamped

States Better benefit plan?

Yes OutputPlan

Generate States by Applying Actions

Build RTPGPropagate Cost

Find Utility

No

Anytime A* Search

Based on SapaPS

Page 7: Over-subscription Planning with Numeric Goals J. Benton Computer Sci. & Eng. Dept. Arizona State University Tempe, AZ Minh Do Palo Alto Research Center

Challenge – Heuristic Support

Heuristic needs to… Estimate cost of achieving variable

values Find the utility of the values

Extend current state-of-the-art techniques Planning graph structure

Reachability estimation Cost propagation

Page 8: Over-subscription Planning with Numeric Goals J. Benton Computer Sci. & Eng. Dept. Arizona State University Tempe, AZ Minh Do Palo Alto Research Center

Challenge – Find Goal Achievement Cost

Propagate reachable values with cost

Sample_Soil

Communicate

0 1 2 2.5

Move(Waypoint1)

Sample_Soil

cost( ): 0 1 2

Cost of achievingeach value bound

v1: [0,0] [0,1] [0,2]

A range of possible values

Page 9: Over-subscription Planning with Numeric Goals J. Benton Computer Sci. & Eng. Dept. Arizona State University Tempe, AZ Minh Do Palo Alto Research Center

Cost Propagation on Variable Bounds

Bound cost dependent upon action cost previous bound cost

- current bound cost adds to the next Cost of all bounds in

expressions

Sample_Soil

Cost(v1=2)

Sample_Soil

C(Sample_Soil)+Cost(v1=1)

v1: [0,0] [0,1] [0,2]

Sample_Soil

Cost(v1=6)

Sample_Soil

C(Sample_Soil)+Cost(v2=3)+Cost(v1=3)

v1: [0,0] [0,3] [0,6]

v2: [0,3]

Sample_SoilEffect: v1+=1

Sample_SoilEffect: v1+=v2

Page 10: Over-subscription Planning with Numeric Goals J. Benton Computer Sci. & Eng. Dept. Arizona State University Tempe, AZ Minh Do Palo Alto Research Center

Extracting Relaxed Plan with Numeric Info

Start with best benefit bounds Relaxed plan includes

Actions Supporting bounds

Benefit

v a l u e

best benefit

Page 11: Over-subscription Planning with Numeric Goals J. Benton Computer Sci. & Eng. Dept. Arizona State University Tempe, AZ Minh Do Palo Alto Research Center

Sample_Soil 1 (Sa1)

Dur = 1

Cost: 1 (at end)V1 += 1

Sample_Soil 2 (Sa2)

Dur = 1.25

Cost: 2 (at end)V1 += 2

Communicate (Com)

Dur = 1.5

Cost: 3(at start) V1 ≥ 1

Sa1

t0 1 1.25 2 2.5 3 3.75

C:1 Sa1 C:1 Sa1 C:1

Sa2 C:2 Sa2 C:2 Sa2 C:2

Com C:4 Com C:4

4

Goal: v2 ∈ [5,∞], U(v2 ∈ [5,∞]) = v2 * 3

(at start)V2 := V1

v1

value

cost

value

costv2

upper bound@ time point

v1 – soil sample in rover’s store

v2 – soil sample communicated

Page 12: Over-subscription Planning with Numeric Goals J. Benton Computer Sci. & Eng. Dept. Arizona State University Tempe, AZ Minh Do Palo Alto Research Center

Sample_Soil 1 (Sa1)

Dur = 1

Cost: 1 (at end)V1 += 1

Sample_Soil 2 (Sa2)

Dur = 1.25

Cost: 2 (at end)V1 += 2

Communicate (Com)

Dur = 1.5

Cost: 3 (at start)V2 := V1

(at start) V1 ≥ 1

Sa1

t0 1 1.25 2 2.5 3 3.75

C:1 Sa1 C:1 Sa1 C:1

Sa2 C:2 Sa2 C:2 Sa2 C:2

Com C:4

4

v1

value

cost

value

cost

Com C:4

satisfies goal

h(S) = U(G) - (cost of actions + cost of bounds)

v2

Page 13: Over-subscription Planning with Numeric Goals J. Benton Computer Sci. & Eng. Dept. Arizona State University Tempe, AZ Minh Do Palo Alto Research Center

Results – Modified Rovers

Added numeric variables: Soil and rock sample amount in rover store More communicated soil/rock - greater utility

Page 14: Over-subscription Planning with Numeric Goals J. Benton Computer Sci. & Eng. Dept. Arizona State University Tempe, AZ Minh Do Palo Alto Research Center

Average improvement: 3.06

Results – Modified Rovers

Page 15: Over-subscription Planning with Numeric Goals J. Benton Computer Sci. & Eng. Dept. Arizona State University Tempe, AZ Minh Do Palo Alto Research Center

Anytime A* Search Behavior

Page 16: Over-subscription Planning with Numeric Goals J. Benton Computer Sci. & Eng. Dept. Arizona State University Tempe, AZ Minh Do Palo Alto Research Center

Results – Modified Logistics

Added numeric variables: Number of packages at location More packages - greater utility

Page 17: Over-subscription Planning with Numeric Goals J. Benton Computer Sci. & Eng. Dept. Arizona State University Tempe, AZ Minh Do Palo Alto Research Center

Results – Modified Logistics

Average improvement: 2.88

Page 18: Over-subscription Planning with Numeric Goals J. Benton Computer Sci. & Eng. Dept. Arizona State University Tempe, AZ Minh Do Palo Alto Research Center

Summary

Over-subscription planning in the presence of Numeric goals Durative actions

Propagating cost over numeric values

Page 19: Over-subscription Planning with Numeric Goals J. Benton Computer Sci. & Eng. Dept. Arizona State University Tempe, AZ Minh Do Palo Alto Research Center

Future Work

Delayed satisfaction of goals

Goal utility dependency

late

-10

late

-10

Page 20: Over-subscription Planning with Numeric Goals J. Benton Computer Sci. & Eng. Dept. Arizona State University Tempe, AZ Minh Do Palo Alto Research Center

Questions.