open source speech recognition
DESCRIPTION
TRANSCRIPT
![Page 1: Open Source Speech Recognition](https://reader033.vdocuments.net/reader033/viewer/2022051817/54833596b47959fb0c8b4a26/html5/thumbnails/1.jpg)
FOSS Speech Recognition
![Page 2: Open Source Speech Recognition](https://reader033.vdocuments.net/reader033/viewer/2022051817/54833596b47959fb0c8b4a26/html5/thumbnails/2.jpg)
![Page 3: Open Source Speech Recognition](https://reader033.vdocuments.net/reader033/viewer/2022051817/54833596b47959fb0c8b4a26/html5/thumbnails/3.jpg)
![Page 4: Open Source Speech Recognition](https://reader033.vdocuments.net/reader033/viewer/2022051817/54833596b47959fb0c8b4a26/html5/thumbnails/4.jpg)
![Page 5: Open Source Speech Recognition](https://reader033.vdocuments.net/reader033/viewer/2022051817/54833596b47959fb0c8b4a26/html5/thumbnails/5.jpg)
![Page 6: Open Source Speech Recognition](https://reader033.vdocuments.net/reader033/viewer/2022051817/54833596b47959fb0c8b4a26/html5/thumbnails/6.jpg)
Dictation
Virtual Assistant
Automatic Translation
![Page 7: Open Source Speech Recognition](https://reader033.vdocuments.net/reader033/viewer/2022051817/54833596b47959fb0c8b4a26/html5/thumbnails/7.jpg)
![Page 8: Open Source Speech Recognition](https://reader033.vdocuments.net/reader033/viewer/2022051817/54833596b47959fb0c8b4a26/html5/thumbnails/8.jpg)
![Page 9: Open Source Speech Recognition](https://reader033.vdocuments.net/reader033/viewer/2022051817/54833596b47959fb0c8b4a26/html5/thumbnails/9.jpg)
![Page 10: Open Source Speech Recognition](https://reader033.vdocuments.net/reader033/viewer/2022051817/54833596b47959fb0c8b4a26/html5/thumbnails/10.jpg)
![Page 11: Open Source Speech Recognition](https://reader033.vdocuments.net/reader033/viewer/2022051817/54833596b47959fb0c8b4a26/html5/thumbnails/11.jpg)
![Page 12: Open Source Speech Recognition](https://reader033.vdocuments.net/reader033/viewer/2022051817/54833596b47959fb0c8b4a26/html5/thumbnails/12.jpg)
![Page 13: Open Source Speech Recognition](https://reader033.vdocuments.net/reader033/viewer/2022051817/54833596b47959fb0c8b4a26/html5/thumbnails/13.jpg)
This is a test
![Page 14: Open Source Speech Recognition](https://reader033.vdocuments.net/reader033/viewer/2022051817/54833596b47959fb0c8b4a26/html5/thumbnails/14.jpg)
This is a test
![Page 15: Open Source Speech Recognition](https://reader033.vdocuments.net/reader033/viewer/2022051817/54833596b47959fb0c8b4a26/html5/thumbnails/15.jpg)
This is a test
![Page 16: Open Source Speech Recognition](https://reader033.vdocuments.net/reader033/viewer/2022051817/54833596b47959fb0c8b4a26/html5/thumbnails/16.jpg)
This is a test
![Page 17: Open Source Speech Recognition](https://reader033.vdocuments.net/reader033/viewer/2022051817/54833596b47959fb0c8b4a26/html5/thumbnails/17.jpg)
This is a test
![Page 18: Open Source Speech Recognition](https://reader033.vdocuments.net/reader033/viewer/2022051817/54833596b47959fb0c8b4a26/html5/thumbnails/18.jpg)
Windows Vista: User
Windows Vista: Live Demo
Windows Vista: Perl
![Page 19: Open Source Speech Recognition](https://reader033.vdocuments.net/reader033/viewer/2022051817/54833596b47959fb0c8b4a26/html5/thumbnails/19.jpg)
FOSS Dictation: Demo
![Page 20: Open Source Speech Recognition](https://reader033.vdocuments.net/reader033/viewer/2022051817/54833596b47959fb0c8b4a26/html5/thumbnails/20.jpg)
FOSS Dictation: 13.30 % WER(Prototype in Optimal Conditions)
![Page 21: Open Source Speech Recognition](https://reader033.vdocuments.net/reader033/viewer/2022051817/54833596b47959fb0c8b4a26/html5/thumbnails/21.jpg)
What's next?
![Page 22: Open Source Speech Recognition](https://reader033.vdocuments.net/reader033/viewer/2022051817/54833596b47959fb0c8b4a26/html5/thumbnails/22.jpg)
80:20 Rule
![Page 23: Open Source Speech Recognition](https://reader033.vdocuments.net/reader033/viewer/2022051817/54833596b47959fb0c8b4a26/html5/thumbnails/23.jpg)
Language and Acoustic ModelsAudio BooksSubmissions to VoxforgeServer-side collection: Games?Recordings of talks?Text donation project for e.g. Blogs?etc.
![Page 24: Open Source Speech Recognition](https://reader033.vdocuments.net/reader033/viewer/2022051817/54833596b47959fb0c8b4a26/html5/thumbnails/24.jpg)
Dialog ManagerImproving dictation: Corrections, performance, etc.Virtual Assistant based on Nepomuk?Plasma Active?$mobileOS?etc.
![Page 25: Open Source Speech Recognition](https://reader033.vdocuments.net/reader033/viewer/2022051817/54833596b47959fb0c8b4a26/html5/thumbnails/25.jpg)
Our own Speech Model:Flexibility++
![Page 26: Open Source Speech Recognition](https://reader033.vdocuments.net/reader033/viewer/2022051817/54833596b47959fb0c8b4a26/html5/thumbnails/26.jpg)
Incredibly interesting,Incredibly little interest
![Page 28: Open Source Speech Recognition](https://reader033.vdocuments.net/reader033/viewer/2022051817/54833596b47959fb0c8b4a26/html5/thumbnails/28.jpg)
Image Credits
"Jumping" by Javier Morales"Gamer 542" by c ps"Language barrier might widen gaps in learning" from the World Bank Photo Collection"Robot" by Andrea Vallejos