david m. bond niels a. widger xiaoxun sun wheeler …ruml/papers/rtds-socs10-talk.pdfdavid m. bond...

David M. BondNiels A. WidgerWheeler Ruml

Xiaoxun Sun

Special thanks to Sven Koenig, Nathan Sturtevant, Brad Larsen,

NSF grant IIS-0812141 and the DARPA CSSG program

1David Bond (UNH)Real Time Search in Dynamic Worlds

Real Time Search in Dynamic Worlds

• Real Time Search

– Search that interleaves planning and execution of the plan

– Hard constraint on the amount of planning that can be done each time step

• Dynamic Worlds

– Worlds in which changes occur

– Increases or decreases in edge cost(s)

David Bond (UNH) 2Real Time Search in Dynamic Worlds

• Video Games

• Robotics

• Agents must returnan action within abounded time

• Doors, Bridges, Walls

David Bond (UNH) 3

Problem

• Solutions to Dynamism in Real Time Search

– Run a real time search algorithm repetitively

• No real time algorithm designed to work in dynamic worlds

• Do not account for edge cost decreases

Outline

• Problem

• Previous Work

– LSS-LRTA* – real time search

• agent centered search

– D* Lite – non-real time, dynamic search

• Solution - Real Time D* Lite (RTD*)

• Empirical Evaluation

Previous Work: LSS-LRTA*

• Repetitive agent centered search

• Uses A* lookahead for agent centered search

• Moves towards most promising node on the fringe of the agent centered search

• Updates cached h values of all states in local search space with Dijkstra’s algorithm

• Avoids local minima through these cached values

LSS-LRTA* Video

Previous Work: D* Lite

• Dynamic non-real time search algorithm

• Search backwards from goal to agent

• When edge cost(s) change reuses previous search

D* Lite Video

Solution: Real Time D* Lite

• Real Time• Dynamic• Do D* Lite

– Stop before computation limit reached– Do some agent centered search– Resume

• Bidirectional Search– Use D* Lite to search from goal to agent

• Global search

– Use LSS-LRTA* to search from agent to goal• Agent centered search

Global Search

• Combine a real time search algorithm with a dynamic search algorithm

11David Bond (UNH)

Agent Centered Search

• Persist global search through planning phase

• Move based on agent centered search when the global search does not have a complete path

• Move based on global search once complete path from goal to agent is found

• Computation limit divided between global and local search based on ratio parameter

Real Time D* Lite Video

Properties of Real Time D* Lite

• Complete

• Time complexity

– Single step constant time

• Space complexity

– Inherits from underlying algorithms

Empirical Evaluation

• Measure to evaluate a solution?– Time steps taken to reach goal

– Equivalent to trajectory/path length for LSS-LRTA*, RTD*

• What algorithms used for comparison?

• LSS-LRTA*– Not designed to handle edge cost decreases

– Still functions in worlds with decreases

– May not notice new paths

• Quality Bounds?• Two variants of D* Lite

– Non-Real Time D* Lite• Gives us the trajectory length of RTD* as the

computation limit approaches infinity

– Naïve real time version, RTD* NoOp• Agent returns do not move action after each planning

phase• Cost of solution increases by one each time if just

waiting there• We need to do better than this, otherwise just run D*

Lite without the real time constraint

• Two domains

– Initially unknown, static world pathfinding

• Warcraft maps

– Fully observable, dynamic world pathfinding

• Rooms generated with doors opening and closing

• Path to goal always guaranteed

Initially Unknown, Static World Pathfinding

Initially Unknown, Static World Pathfinding Video

1 10 100

Expansion Limit

NoOp RTD*

Non-RTD*

1 10 100

Expansion Limit

NoOp RTD*

LSS-LRTA*

Non-RTD*

Fully Observable, Dynamic World Pathfinding

0 100 200 300 400 500

Expansion Limit

Non-RTD*

0 100 200 300 400 500

Expansion Limit

LSS-LRTA*

Non-RTD*

Conclusion

• The first algorithm designed for dynamic real time worlds

• Modular and expandable

• Visit http://mokon.net/mai.aspx for source code and detailed results

David Bond (UNH) 25

david m. bond niels a. widger xiaoxun sun wheeler …ruml/papers/rtds-socs10-talk.pdfdavid m. bond...

Documents

z8ynek petracek-dran nacgh wjesti? / 1^/ jan ruml-iimit...

dylan o’ceallaigh and wheeler ruml department of computer...

produced by david widger (illustrated html version) a

zia widger, forrester research

todd aagaard villanova university charles widger school of...

ansi common lisp - 7chan of india private limited, ... ansi...

jordan thayer and wheeler ruml and jeff kreis

refleksion skriftserie nr. 10/2008 - ruml · og diskussion....

taxes for revenue are obsolete ruml 1946

joint commonsense and relation reasoning for …...joint...

volume xxix, issue 10 may, 2014 five elgin public students...

suboptimal variants of the conflict-based search algorithm...

is term project: unsignedmotion tom accuosti joe dooley...

zpátky ke hvězdám...obsazení thomas jerome newton igor...

industryracing.comindustryracing.com/downloads/031018program.pdf ·...

produced by david widger - bouquineux.comproduced by david...

cfpe inh. edith ruml ritterstraße 3 d-78661 dietingen www...

a survey of suboptimal search algorithms - computer...

faster than weighted a*: an optimistic approach to bounded...

university of chicago library guide to the beardsley ruml...