speech processing 1 introduction waldemar skoberla phone: +49 731 3994 110 fax: +49 731 3994 250...

19
Speech Processing 1 Introduction Waldemar Skoberla e-mail : [email protected] phone : +49 731 3994 110 fax : +49 731 3994 250 WWW : http://www.starrec.com

Upload: meredith-thomas

Post on 18-Jan-2018

218 views

Category:

Documents


0 download

DESCRIPTION

Speech Processing 3 Communication Saying is not listening Listening is not understanding Understanding is not accepting Introduction

TRANSCRIPT

Page 1: Speech Processing 1 Introduction Waldemar Skoberla   phone: +49 731 3994 110 fax: +49 731 3994 250 WWW:

Speech Processing1

Introduction

Waldemar Skoberlae-mail : [email protected] : +49 731 3994 110fax : +49 731 3994 250WWW : http://www.starrec.com

Page 2: Speech Processing 1 Introduction Waldemar Skoberla   phone: +49 731 3994 110 fax: +49 731 3994 250 WWW:

Speech Processing2

Contents

1. Introduction

2. The Purpose of Voice Portals

3. The Expectations of the Performance

4. The Reality

5. Challenges and Problems

6. The Solution

7. Summary

Page 3: Speech Processing 1 Introduction Waldemar Skoberla   phone: +49 731 3994 110 fax: +49 731 3994 250 WWW:

Speech Processing3

Communication

Saying is not listening

Listening is not understanding

Understanding is not accepting

Introduction

Page 4: Speech Processing 1 Introduction Waldemar Skoberla   phone: +49 731 3994 110 fax: +49 731 3994 250 WWW:

Speech Processing4

The purpose of a dialogue is to ensure that the user gets what he wants

Introduction

There is a long way between an utterance and its acceptance.

Page 5: Speech Processing 1 Introduction Waldemar Skoberla   phone: +49 731 3994 110 fax: +49 731 3994 250 WWW:

Speech Processing5

The Purpose of Voice Portals

e.g. Unified Messaging

e.g. Time Information

e.g. Ticket Reservation

Voice PortalInstant access to services like news, traffic, weather, stocks,

Sports, etc. over the phone

User Profile:e.g. personal address book or calendar

Access through unique phone number

Page 6: Speech Processing 1 Introduction Waldemar Skoberla   phone: +49 731 3994 110 fax: +49 731 3994 250 WWW:

Speech Processing6

•Comfortable phone access to any kind of information and services (Simple search and find strategy).

•Quick access to preferred services and information (personalized services).

•No restrictions to the user’s input (natural language understanding).

Voice Portals should open the access to information and services and not prevent it.

The Purpose of Voice Portals

Page 7: Speech Processing 1 Introduction Waldemar Skoberla   phone: +49 731 3994 110 fax: +49 731 3994 250 WWW:

Speech Processing7

Does the user get what he wants?

The Expectation

Page 8: Speech Processing 1 Introduction Waldemar Skoberla   phone: +49 731 3994 110 fax: +49 731 3994 250 WWW:

Speech Processing8

The Expectation

•Easy to use (intuitive dialogues, natural language understanding).

•Flat menu structures with cross connections to all available services.

•Guidance through the dialogue and context sensitive online help.

•Easy to maintain and to extend (new services)

•Easy to find one’s way through the dialogue structure (homogenous user interface in all available services).

Possible? Practicable?

Page 9: Speech Processing 1 Introduction Waldemar Skoberla   phone: +49 731 3994 110 fax: +49 731 3994 250 WWW:

Speech Processing9

Voice Portal

Service 1

Service 2

Service N

Phone Access

The Reality

VP Application(ASR)

S1 Application(ASR)

S1 Application(ASR & DTMF)

S2 Application(DTMF)

Page 10: Speech Processing 1 Introduction Waldemar Skoberla   phone: +49 731 3994 110 fax: +49 731 3994 250 WWW:

Speech Processing10

The Reality

•Different approach for Dialogue Design within available services (system driven call flow, mixed initiative approach, with/without Barge In)

•The voice in System Prompts changes from one service to another (sometimes demanded but usually disturbing).

•Usage of different technologies (speech recognition, DTMF, Barge In)

Confusion for the user

Page 11: Speech Processing 1 Introduction Waldemar Skoberla   phone: +49 731 3994 110 fax: +49 731 3994 250 WWW:

Speech Processing11

The Challenge

•Vocabularies become bigger and grammars more complex (due to new services, natural language understanding, …)

•Continuous modification of available services and adding of new services

•Growing number of cross connections among all services

•Implementation of Text-To-Speech more and more necessary

fast growing complexity of the Voice Portal application

Page 12: Speech Processing 1 Introduction Waldemar Skoberla   phone: +49 731 3994 110 fax: +49 731 3994 250 WWW:

Speech Processing12

The Problems

•Big vocabularies may cause recognition confusions, speech recognizer do not offer100% accuracy (higher possibility of similar sounding words)

•Adding new applications cause increasing maintenance efforts (pre-recording of system prompts, establishment of cross connections, updating of online help, adding new words for the recognizer)

•Increased misunderstandings due to involved Text-To-Speech engines. (the state of the art quality of TTS is not as good as pre-recorded prompts)

nothing and nobody is perfect

Page 13: Speech Processing 1 Introduction Waldemar Skoberla   phone: +49 731 3994 110 fax: +49 731 3994 250 WWW:

Speech Processing13

The Problems

•Does the user always know what he can say and do? (clear structure and prompting)

•People sometimes do not say what they mean.

•Is the user always able to handle new and sophisticated features? (natural language understanding)

sometimes less is more (simple applications increase the acceptance)

Page 14: Speech Processing 1 Introduction Waldemar Skoberla   phone: +49 731 3994 110 fax: +49 731 3994 250 WWW:

Speech Processing14

Voice Portal

Service 1

Service 2

Service N

Phone Access

The Solution?

VP Application(ASR)

S1 Application(ASR)

S1 Application(ASR)

S2 Application(ASR)

Application Data Exchange

Page 15: Speech Processing 1 Introduction Waldemar Skoberla   phone: +49 731 3994 110 fax: +49 731 3994 250 WWW:

Speech Processing15

The Solution

•User adapted guidance and online support (tracing of user behavior and storing of the profile )

•Support through natural language understanding (no limits to what the user can say)

•Hidden user guidance (intelligent prompting)

•Design to Error

virtually (for the user ) no limitation to the flexibility

Page 16: Speech Processing 1 Introduction Waldemar Skoberla   phone: +49 731 3994 110 fax: +49 731 3994 250 WWW:

Speech Processing16

The Solution

•Easier dialogue design through predefined and approved dialogue modules for standardized tasks (input of phone number, requesting time and date, etc.)

•Support through sophisticated dialogue development tools (no limits to what the user can say)

intelligent tools and predefined dialogue modules reduce development efforts but do not replace human creativity

Page 17: Speech Processing 1 Introduction Waldemar Skoberla   phone: +49 731 3994 110 fax: +49 731 3994 250 WWW:

Speech Processing17

The Future

•Fast growing number of speech enabled services.

•Users become more and more familiar with new technologies.

•Consideration of Multiple Modes of operation. (e.g. speech input combined with graphical output)

•Automatic creation of new applications. (research status)

Page 18: Speech Processing 1 Introduction Waldemar Skoberla   phone: +49 731 3994 110 fax: +49 731 3994 250 WWW:

Speech Processing18

The Summary

•The complexity of voice portal applications will grow very fast in future.

•The quality of the services will be improved (sophisticated methods and ASR).

•Short time to market through dialogue tools and modules support

Page 19: Speech Processing 1 Introduction Waldemar Skoberla   phone: +49 731 3994 110 fax: +49 731 3994 250 WWW:

Speech Processing19

Since Dialogues are human …

… they cannot be completely designed by machines.