distant speech recognition in smart homes initiated by hand clapping within noisy environments

21
Distant Speech Recognition in Smart Homes Initiated by Hand Clapping within Noisy Environments. Florian Bacher & Christophe Sourisse [623.400] Seminar in Interactive Systems

Upload: dooley

Post on 25-Jan-2016

29 views

Category:

Documents


0 download

DESCRIPTION

Distant Speech Recognition in Smart Homes Initiated by Hand Clapping within Noisy Environments. Florian Bacher & Christophe Sourisse. [623.400] Seminar in Interactive Systems. Agenda. Introduction Methodology Experiment Description Implementation Results Conclusion. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Distant Speech Recognition in Smart Homes Initiated by Hand Clapping  within  Noisy Environments

Distant Speech Recognition in Smart Homes Initiated by Hand

Clapping within Noisy Environments.

Florian Bacher & Christophe Sourisse

[623.400] Seminar in Interactive Systems

Page 2: Distant Speech Recognition in Smart Homes Initiated by Hand Clapping  within  Noisy Environments

Agenda Introduction

Methodology

Experiment Description

Implementation

Results

Conclusion

Page 3: Distant Speech Recognition in Smart Homes Initiated by Hand Clapping  within  Noisy Environments

I. Introduction

Page 4: Distant Speech Recognition in Smart Homes Initiated by Hand Clapping  within  Noisy Environments

Introduction Smart homes have become a major field of

research in information and communication technologies.

Possible way of interaction: Voice commands.

Goal of our experiment: evaluate the possibility of recognizing voice commands initiated by hand claps in a noisy environment.

Gather a set of voice commands uttered by various speakers.

Page 5: Distant Speech Recognition in Smart Homes Initiated by Hand Clapping  within  Noisy Environments

II. Methodology

Page 6: Distant Speech Recognition in Smart Homes Initiated by Hand Clapping  within  Noisy Environments

Methodology Main method: Lecouteux et al. [1]

◦ Deals with speech recognition within distress situations.

◦ Problem: no background noise was considered.

Chosen methodology: adapt Lecouteux et al. protocol considering:◦ Noisy settings.◦ Initiating recognition using hand claps.

Page 7: Distant Speech Recognition in Smart Homes Initiated by Hand Clapping  within  Noisy Environments

Choice of the room setting◦ Lecouteux et al. [1]: a whole flat.◦ Vovos et al. [7]: one-room microphone array.◦ Choice: one room with 2 microphones.

Choice of background noises◦ Hirsch and Pierce [8]: NoiseX 92 database.◦ Moncrieff et al. [5]: “Background noise is defined

as consisting of typical regularly occurring sounds.”

◦ Choice: background noises of the daily house life.

Methodological issues

Page 8: Distant Speech Recognition in Smart Homes Initiated by Hand Clapping  within  Noisy Environments

III. Experiment Description

Page 9: Distant Speech Recognition in Smart Homes Initiated by Hand Clapping  within  Noisy Environments

Experiment Settings Performed in a 3m x 3m room.

Sounds were captured by two microphones which were hidden in the room.

Page 10: Distant Speech Recognition in Smart Homes Initiated by Hand Clapping  within  Noisy Environments

20 participants (10 men, 10 women, 25,5 ± 11 years) participated to a 2-phase exp.

1st phase: recognize a word (“Jeeves”) as a command◦ System’s attention is catched by double clapping.◦ 4 scenarios.◦ Background noises tested: step noises, opening doors,

moving chairs, radio show.

2nd phase: Gather a set of voice commands◦ List of 15 command-words.◦ Reference record for pronounciation issues.◦ Each word is uttered 10 times.

Experimental Protocol

Page 11: Distant Speech Recognition in Smart Homes Initiated by Hand Clapping  within  Noisy Environments

IV. Implementation

Page 12: Distant Speech Recognition in Smart Homes Initiated by Hand Clapping  within  Noisy Environments

Implementation Used technologies:

◦ C# Library System.Speech.Recognition: Interface to the Speech Recognition used by Windows.

◦ Microphones: Two dynamic microphones with cardioid polar pattern (Sennheiser BF812/e8155)

◦ Line6 UX1 Audio Interface ◦ Line6 Pod Farm 2.5

Page 13: Distant Speech Recognition in Smart Homes Initiated by Hand Clapping  within  Noisy Environments

Implementation Signal is captured in real time.

If there are exactly two signal peaks within a certain timeframe, the software classifies them as a double clap.

After a double clap has been detected, the actual speech recognition engine is activated (i.e. the software is waiting for commands).

Page 14: Distant Speech Recognition in Smart Homes Initiated by Hand Clapping  within  Noisy Environments

V. Results

Page 15: Distant Speech Recognition in Smart Homes Initiated by Hand Clapping  within  Noisy Environments

True positive

True negative

False positive

False negative

Attempt of the participant?

Performance?

System recognized something?

Results’ Classification

Page 16: Distant Speech Recognition in Smart Homes Initiated by Hand Clapping  within  Noisy Environments

General Results

Page 17: Distant Speech Recognition in Smart Homes Initiated by Hand Clapping  within  Noisy Environments

Detailed Results

Page 18: Distant Speech Recognition in Smart Homes Initiated by Hand Clapping  within  Noisy Environments

VI. Conclusion

Page 19: Distant Speech Recognition in Smart Homes Initiated by Hand Clapping  within  Noisy Environments

A new idea of how to initiate speech recognition in human computer interaction.

An evaluation of the potential influence of a noisy environment.

Results: encouraging, but not yet satisfying.

Next step: perform this experiment in a real smart-home-context.

Conclusion

Page 20: Distant Speech Recognition in Smart Homes Initiated by Hand Clapping  within  Noisy Environments

[1] B. Lecouteux, M. Vacher and F. Portet. Distant speech recognition in a smart home: comparison of several multisouce ASRs in realistic conditions. Interspeech., 2011.

[2] A. Fleury, N. Noury, M. Vacher, H. Glasson and J.-F. Serignat. Sound and speech detection and classification in a health smart home. 30th Annual International IEEE EMBS Conference, Vancouver, British Columbia, Canada, August 2008.

[3] M. Vacher, N. Guirand, J.-F. Serignat and A. Fleury. Speech recognition in a smart home: Some experiments for telemonitoring. Proceedings of the 5th Conference on Speech Technology and Human-Computer Dialogue, pages 1 – 10, June 2009.

[4] J. Rouillard and J.-C. Tarby. How to communicate smartly with your house? Int. J. Ad Hoc and Ubiquitous Computing, 7(3), 2011.

[5] S. Moncrieff, S. Venkatesh, G. West, and S. Greenhill. Incorporating contextual audio for an actively anxious smart home. Proceedings of the 2005 International Conference on Intelligent Sensors, Sensor Networks and Information Processing, pages 373 – 378, Dec. 2005.

[6] M. Vacher, D. Istrate, F. Portet, T. Joubert, T. Chevalier, S. Smidtas, B. Meillon, B. Lecouteux, M. Sehili, P. Chahuara and S. Méniard. The sweet-home project: Audio technology in smart homes to improve well-being and reliance. 33rd Annual International IEEE EMBS Conference, Boston, Massachusetts, USA, 2011.

[7] A. Vovos, B. Kladis and N. Fakotakis, Speech operated smart-home control system for users with special needs, in Proc. Interspeech 2005, 2005, pp. 193 – 196.

[8] H.-G. Hirsch and D. Pearce. The AURORA experimental framework for the performance evaluation of speech recognition systems under noisy conditions. In ASR-2000, pages 181 – 188.

References

Page 21: Distant Speech Recognition in Smart Homes Initiated by Hand Clapping  within  Noisy Environments

Thank you for your attention!

Questions