wrapping up

22
Wrapping Up Ling575 Spoken Dialog Systems June 5, 2013

Upload: alida

Post on 24-Feb-2016

45 views

Category:

Documents


0 download

DESCRIPTION

Wrapping Up. Ling575 Spoken Dialog Systems June 5, 2013. Roadmap. Overview Distinctive factors in dialog: Human-human Human-computer Dialog components & dialog management Specialized topics: Detailed analysis of: Distinctive factors Techniques and applications Discussion: - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Wrapping Up

Wrapping UpLing575

Spoken Dialog SystemsJune 5, 2013

Page 2: Wrapping Up

RoadmapOverview

Distinctive factors in dialog:Human-human Human-computer

Dialog components & dialog management Specialized topics:

Detailed analysis of: Distinctive factors Techniques and applications

Discussion:Trends, techniques, interrelations

Page 3: Wrapping Up

Characteristics of DialogHuman-human:

Multi-party interaction:Flexible turn-taking, mixed initiative

Speech acts:Actions via speech, levels of interpretation

Implicature:Grice’s maxims

Cooperativity & closure:Grounding and levels of display

Corrections, repairs, and confirmations

Page 4: Wrapping Up

Characteristics of DialogHuman-computer – most deployed systems

Multi-party interaction:

Page 5: Wrapping Up

Characteristics of DialogHuman-computer – most deployed systems

Multi-party interaction:Rigid silence-based turn-taking, system or “mixed”

initiativeSpeech acts:

Page 6: Wrapping Up

Characteristics of DialogHuman-computer – most deployed systems

Multi-party interaction:Rigid silence-based turn-taking, system or “mixed”

initiativeSpeech acts:

Actions via speech: dialog acts, NLU Implicature:

Page 7: Wrapping Up

Characteristics of DialogHuman-computer – most deployed systems

Multi-party interaction:Rigid silence-based turn-taking, system or “mixed”

initiativeSpeech acts:

Actions via speech: dialog acts, NLU Implicature:

Um… depends on dialog management, NLU Grounding:

Page 8: Wrapping Up

Characteristics of DialogHuman-computer – most deployed systems

Multi-party interaction:Rigid silence-based turn-taking, system or “mixed” initiative

Speech acts:Actions via speech: dialog acts, NLU

Implicature:Um… depends on dialog management, NLU

Grounding:Confirmation: implicit/explicit: learned?Corrections, repairs: problematic

Why?

Page 9: Wrapping Up

Characteristics of DialogHuman-computer – most deployed systems

Multi-party interaction:Rigid silence-based turn-taking, system or “mixed” initiative

Speech acts:Actions via speech: dialog acts, NLU

Implicature:Um… depends on dialog management, NLU

Grounding:Confirmation: implicit/explicit: learned?Corrections, repairs: problematic

Constrained by complexity, processing, speed, etc

Page 10: Wrapping Up

Dialog System Components

HMM-based ASR modelsNLU: call-routing, semantic grammarsDialog acts and recognitionDialog management:

Finite-state Frame-based

VoiceXML Information state Statistical dialog management

Lots of examples!

Page 11: Wrapping Up

TopicsIn-depth discussions:

Computational approaches to make human-computer interaction more like human-human interactionMany issues raised in characterizing dialog:

Multi-party

Page 12: Wrapping Up

TopicsIn-depth discussions:

Computational approaches to make human-computer interaction more like human-human interactionMany issues raised in characterizing dialog:

Multi-party: multi-party interaction, turn-taking, initiative Grounding

Page 13: Wrapping Up

TopicsIn-depth discussions:

Computational approaches to make human-computer interaction more like human-human interactionMany issues raised in characterizing dialog:

Multi-party: multi-party interaction, turn-taking, initiative Grounding: Miscommunication & repair, incremental

processing Interpretation:

Page 14: Wrapping Up

TopicsIn-depth discussions:

Computational approaches to make human-computer interaction more like human-human interactionMany issues raised in characterizing dialog:

Multi-party: multi-party interaction, turn-taking, initiative Grounding: Miscommunication & repair, incremental processing Interpretation: Reference, affect, subjectivity, personification,

information structure, prosody Multi-modality

Applications and issues:Tutoring, machine translation, information-seekingNon-native speech

Page 15: Wrapping Up

Interconnections

Sentiment

Reference

Persona

Turn-taking

Apps: MT

Multi-party

Prosody

Tutoring Non-native

Multi-modality

Miscommunication

Info. Struct

Increment

Affect

Initiative

Page 16: Wrapping Up

Interconnections

Sentiment

Reference

Persona

Turn-taking

Apps: MT

Multi-party

Prosody

Tutoring Non-native

Multi-modality

Miscommunication

Info. Struct

Increment

Affect

Initiative

Page 17: Wrapping Up

Techniques & Sources of Information

Range of techniques:

Page 18: Wrapping Up

Techniques & Sources of Information

Range of techniques:Deep processing, shallow processing, manual rules

Machine learning:

Page 19: Wrapping Up

Techniques & Sources of Information

Range of techniques:Deep processing, shallow processing, manual rules

Machine learning:Anything from decision trees to POMDPs

Information sources:

Page 20: Wrapping Up

Techniques & Sources of Information

Range of techniques:Deep processing, shallow processing, manual rules

Machine learning:Anything from decision trees to POMDPs

Information sources:Acoustic, lexical, prosodic, timing, syntactic,

semantic, pragmatic, etcMultimodal: gaze, gesture, etc

Integration

Page 21: Wrapping Up

Techniques & Sources of Information

Range of techniques: Deep processing, shallow processing, manual rules

Machine learning: Anything from decision trees to POMDPs

Information sources: Acoustic, lexical, prosodic, timing, syntactic, semantic,

pragmatic, etcMultimodal: gaze, gesture, etc Integration: Complex and varied

Huge feature vectors, tandem models, blackboards, learned

Substantial strides, but huge remaining challenges

Page 22: Wrapping Up

Questions?Favorite topic?

Most surprising result?

Most obvious result?

Most surprising gap?