spatial reasoning in a multi-modal user guide for a complex machine

24
Spatial reasoning in a multi-modal user guide for a complex machine Nadejda Soudzilovskaia, Rafael Bidarra, Frederik W. Jansen Delft University of Technology, Faculty of Information Technology and Systems

Upload: kyrene

Post on 21-Jan-2016

34 views

Category:

Documents


0 download

DESCRIPTION

Delft University of Technology, Faculty of Information Technology and Systems. Spatial reasoning in a multi-modal user guide for a complex machine. Nadejda Soudzilovskaia, Rafael Bidarra, Frederik W. Jansen. Contents. Problem introduction Application Geometry based reasoning Techniques - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Spatial reasoning in a multi-modal user guide for a complex machine

Spatial reasoning in a multi-modal user guide for a complex machine

Nadejda Soudzilovskaia, Rafael Bidarra, Frederik W. Jansen

Delft University of Technology,Faculty of Information Technology and Systems

Page 2: Spatial reasoning in a multi-modal user guide for a complex machine

Contents

Problem introduction Application Geometry based reasoning Techniques Examples Questions

Page 3: Spatial reasoning in a multi-modal user guide for a complex machine

Problem introduction Multimodal interaction

– Multimodal = speech + graphics + text• Computer speech needs feedback of text and graphics• Speech introduces geometric references,

(e.g.the object in front of the building)

• Solving of these references needs combination of geometric and functional reasoning

Page 4: Spatial reasoning in a multi-modal user guide for a complex machine

?

Application: Interactive fax user guide Speech & Reasoning & Graphics Task: develop dialogue technologies

Application

Page 5: Spatial reasoning in a multi-modal user guide for a complex machine

Single questions & answers about the fax Questions: Speech Answers: Graphics & Speech

How can I make a copy of a document?

• Put the document into the paper source tray

• Press copy button

Fax encyclopedia

Page 6: Spatial reasoning in a multi-modal user guide for a complex machine

Pre-programmed answers– develop set of possible user questions– develop answer + “movie” for every question

Possible solutions

Not an intelligent system. Only pre-cooked question and answers are possible

Page 7: Spatial reasoning in a multi-modal user guide for a complex machine

Possible solutions

Answers derived by reasoning

– knowledge based system containing:

• data about fax functionality

• data about fax construction (3D model)

– Reasoning system based on:

• semantic

• geometry

Answers derived by reasoning

– knowledge based system containing:

• data about fax functionality

• data about fax construction (3D model)

– Reasoning system based on:

• semantics

• geometry

Page 8: Spatial reasoning in a multi-modal user guide for a complex machine

System architecture

Dialogue manager

Speech recognizer

Parser

Geometrybased

reasoning

Semanticbased

reasoning

Graphical answer

generation

Textual answer

generation

?

Page 9: Spatial reasoning in a multi-modal user guide for a complex machine

Graphics

– Solve geometric references

– Derive descriptions of a component location

– Show components

Tasks:

Geometry based reasoning

Tasks:

– Navigation though the virtual world

Page 10: Spatial reasoning in a multi-modal user guide for a complex machine

How can I make a copy of a document?

• Put the document into the paper source tray

• Press copy button

Geometry based reasoning Show components

Page 11: Spatial reasoning in a multi-modal user guide for a complex machine

Geometry based reasoning

– choose appropriate scale– choose appropriate view point

Show components

Page 12: Spatial reasoning in a multi-modal user guide for a complex machine

Where is the power slot?

Power slot is on theback side of the fax

Geometry based reasoning Deriving location descriptions

Page 13: Spatial reasoning in a multi-modal user guide for a complex machine

What is the button left of the start button?

This is a copy button

Geometry based reasoning Solve geometric references

Page 14: Spatial reasoning in a multi-modal user guide for a complex machine

Data about fax construction

– hierarchy graph

– feature graph

– geometry graph

Geometry based reasoning

Top face

Rib

Bottom face

CornerCorner

vertexedgeedge

face face

Telefax

Corpus Telephone Powerplug

Telephonereceiver

Operational panel

LCD screen

Start button

Keypad

....

....

Page 15: Spatial reasoning in a multi-modal user guide for a complex machine

Problem: search for a suitable reference object Proposal: use of hierarchy Result: reduction of reasoning complexity

Techniques

Telefax

Corpus Telephone Powerplug

Telephonereceiver

Operational panel

LCD screen

Start button

Keypad

....

....

Page 16: Spatial reasoning in a multi-modal user guide for a complex machine

Analyze of dimensionality in view of children location

Techniques

Objects that do not have children nodes:• Leaf objects

Objects that have children nodes:

• 3D objects: fax corpus, telephone

• 2D objects: operational panel, slot panel

• 1D objects: cables

• Set objects: group of slots, group of switches

Page 17: Spatial reasoning in a multi-modal user guide for a complex machine

Use of applicability zone in for a geometric reference

• when reason about a position on a surface of another object the spatial reference differ for 3D object and 2D object

• use of fuzzy logic to process zones of applicability

Techniques

Page 18: Spatial reasoning in a multi-modal user guide for a complex machine

Where is the power slot?

Power slot is on theback side of the fax

Examples Deriving location descriptions

Page 19: Spatial reasoning in a multi-modal user guide for a complex machine

1 priority Location on the surface

1.1 on the corner 1.2 on the right / left side 1.3 on the top / bottom face 1.4 on the front / back face

• Search for the object parent

• Select meaningful descriptions

2 priority Location relatively to neighbor objects

2.1 between (two other objects) 2.2 in front of , under, above 2.3 to the left of 2.4 to the left of 2.5 behind, under 2.6. near

Developed answer : The printer slot is on the back face of the fax, to the left of the printer slot

Deriving location descriptions

Page 20: Spatial reasoning in a multi-modal user guide for a complex machine

What is the button left of the start button?

This is a copy button

Examples Solve geometric references

Page 21: Spatial reasoning in a multi-modal user guide for a complex machine

1. Search through the same hierarchy level

1.1 Search for a parent group of operational buttons

1.2 Analyze location left of the start button

1.3. Choose suitable objects

2. Go one level higher in the hierarchy

2.1 Search for a parent of the parentoperational panel

2.2 Analyze location left of the start button

2.3 Select objects

3. Go one level higher……...

Solve geometric references

Page 22: Spatial reasoning in a multi-modal user guide for a complex machine

Summary

Role of spatial reasoning techniques in the application of multi-modal user guides

A fax machine user guide system Untilisng 3D model of a fax Use of hierarchy Use of dimentionality Use of areference applicability zone

Page 23: Spatial reasoning in a multi-modal user guide for a complex machine

Conclusions

Geometry based reasoning is a new functionality for such a dialogue system

3D model of the fax-machine is used as a source of data for reasoning

Several techniques for the geometry-based reasoning were developed and implemented using Java3D and OpenGL languages

Page 24: Spatial reasoning in a multi-modal user guide for a complex machine

Questions

?