windows speech platform and cortana extension · pdf file•two test criteria defined, with...

27

Upload: vanphuc

Post on 07-Feb-2018

216 views

Category:

Documents


0 download

TRANSCRIPT

MICROSOFT CONFIDENTIAL – for discussion purposes only. © 2015 Microsoft Corporation. All rights reserved.

MICROSOFT CONFIDENTIAL – for discussion purposes only. © 2015 Microsoft Corporation. All rights reserved.

• Two test criteria defined, with focus on Standard at Win10 RTM

Standard – Device meets recommended functional guidelines for working great with Speech Recognition.

Premium – Device meets basic functional guidelines for working with Speech Recognition, such as working well in less optimal environments (e.g. background noise, distance)

• Cortana and the Microsoft Speech Platform features do not have an associated Windows Certification program

MICROSOFT CONFIDENTIAL – for discussion purposes only. © 2015 Microsoft Corporation. All rights reserved.

• Speech Recognition Quiet test

The quiet tests represent an ideal environment with minimal ambient noise (noise floor < 35 dBA

SPL).

• Speech Recognition ambient test

The ambient noise tests represent various levels and types of noisy environments, e.g. Café & Pub

• Speech Recognition Echo test

The echo noise tests represent various levels and types of render playback scenarios (e.g. media

playing).

MICROSOFT CONFIDENTIAL – for discussion purposes only. © 2015 Microsoft Corporation. All rights reserved.

MICROSOFT CONFIDENTIAL – for discussion purposes only. © 2015 Microsoft Corporation. All rights reserved.

CORTANA TEST REQUIRE ETSI Test Room ETSI EG 202 396-1 (Section 6)

• Test room size should be in range between

2.7m X 3.7m and 3.5m X 4.4m, height should

be between 2.2m and 2.5.

• Noise floor must be < 35 dB SPL(A), target 28

dB SPL(A)

• Reverberation time of the room should be

0.4s<RT<0.7s in frequency range between

100Hz and 8kHz

MICROSOFT CONFIDENTIAL – for discussion purposes only. © 2015 Microsoft Corporation. All rights reserved.

Cortana Test ROOM-CONT’

MICROSOFT CONFIDENTIAL – for discussion purposes only. © 2015 Microsoft Corporation. All rights reserved.

DUT Test Position

MICROSOFT CONFIDENTIAL – for discussion purposes only. © 2015 Microsoft Corporation. All rights reserved.

DUT Test Position-CONT’

MICROSOFT CONFIDENTIAL – for discussion purposes only. © 2015 Microsoft Corporation. All rights reserved.

Test EquipmentItem Recommendations Example

Head and Torso Simulator

and/or Mouth Simulator

Compliant with ITU-T P.58

Compliant with ITU-T P.51

B&K 4128C

G.R.A.S. 44AB , B&K 4227

or NTi TalkBox

Amplifier (if HATS used) Compatible with HATS used B&K Nexus or B&K 2716C

Audio Generator/Analyzer Fulfil test signal requirements AP 585

Room loudspeakers &

Stands

JBL LSR2325P & SMS-6000

Reference Free Field

Microphone

Or Sound Level Meter

100 Hz – 12 kHz, < 1% THD 94 dB

SPL @ MRP, ± 2 dB on axis response

DPA 4007

B&K 2240 or NTi Xl2 with M4260

PC Audio Interface RME 9632 or Roland Octa-Capture

MICROSOFT CONFIDENTIAL – for discussion purposes only. © 2015 Microsoft Corporation. All rights reserved.

Test Signal, Files, SoftwareStimulus Length Level Frequencies Other Parameters

Input Sweep 3s 94 dB SPL @ MRP 100 – 12000 Hz Continuous, Log Sweep

Speech Input ~ 20 min 89 dB SPL @ MRP 100 – 12000 Hz LongCleanTalk-

CortanaSubset_48k_24bit.wav

Music File >= 70 dBA SPL @LRP Local playback for echo test

Ambient Noise >= 57 dBA SPL @ DUT Café, Pub

from ETSI ES 202 396-1

Calibration

(mouth)

100 – 12000 Hz LongCleanTalk-Calibration.wav

OEMDriverVerifytool Check Audio Driver Configuration

Recording Tool Record Audio Speech

Score Tool Score

MICROSOFT CONFIDENTIAL – for discussion purposes only. © 2015 Microsoft Corporation. All rights reserved.

• Prepare the normal ETSI room with any objects needed for

test scenarios

• Calibrate the reference mic sensitivity according to

manufacturer guidelines if use reference microphone

• Calibrate the HATS (or mouth simulator) to flat magnitude

response according to manufacturer guidelines

MICROSOFT CONFIDENTIAL – for discussion purposes only. © 2015 Microsoft Corporation. All rights reserved.

Two Speech files are provided• LongCleanTalk-CortanaSubset_48k_24bit.wav is used in Cortana test• LongCleanTalk-Calibration.wav is near 8 minutes and used to calibrate

Talker Level

Calibration File

Clean Talk Speech Input File

Long

Clean TalkLongCleanTalk-

Calibration.wav

LongCleanTalk-

CortanaSubset_48k_24bit.wav

MICROSOFT CONFIDENTIAL – for discussion purposes only. © 2015 Microsoft Corporation. All rights reserved.

• Ambient Noise Calibration

Play the background noise file and increase the averaging time on the reference

mic (or sound level meter) until it is steady

Change the level of the background noise to be >= 57dB SPL

• Echo Noise Calibration

Play the music file and increase the averaging time on sound level meter (or the

reference mic) until it is steady.

Change the playback level on the DUT until >= 70dB SPL @ LRP is reached (or the

DUT max playback level is reached)

MICROSOFT CONFIDENTIAL – for discussion purposes only. © 2015 Microsoft Corporation. All rights reserved.

MICROSOFT CONFIDENTIAL – for discussion purposes only. © 2015 Microsoft Corporation. All rights reserved.

MICROSOFT CONFIDENTIAL – for discussion purposes only. © 2015 Microsoft Corporation. All rights reserved.

DriverConfigurationVerification

MICROSOFT CONFIDENTIAL – for discussion purposes only. © 2015 Microsoft Corporation. All rights reserved.

OEMVerification: Version 1.4 Target Device: Microphone Array (Realtek High Definition Audio) Driver Mode Support: Raw: supported Speech: supported Number of Channels: 2 Array Type: Linear Beam Frequency Range: 120 ~ 7500hz Beam Target H. Range: 30 ~ -30deg Beam Target V. Range: 90 ~ -90deg Microphone Count: 2 Mic0: Type=Omni Location: (0, -23, 0) mm H. Angle: 0 deg V. Angle 0 deg Mic1: Type=Omni Location: (0, 22, 0) mm H. Angle: 0 deg V. Angle 0 deg Distance between Mic0 and Mic1 is 4.50 cm Effects: AcousticEchoCancellation NoiseSuppression BeamForming Feedback: Pipeline Indication: OEM provided expected to be used Default Mic Gain (from registry): 64.01% (0x1901)

MICROSOFT CONFIDENTIAL – for discussion purposes only. © 2015 Microsoft Corporation. All rights reserved.

MICROSOFT CONFIDENTIAL – for discussion purposes only. © 2015 Microsoft Corporation. All rights reserved.

Get Score Result

Visual C++ Redistributable Packages for Visual Studio 2013

MICROSOFT CONFIDENTIAL – for discussion purposes only. © 2015 Microsoft Corporation. All rights reserved.

Transcription:Any news about mortgage

Recognition: ortona how far away is

Transcription:Can you play Billy Joel Piano Man

Transcription:Cortana how far away is the moon

Recognition: porn

Transcription:Average water bill in Belmont California

Recognition: nine zero five eight one nine five

Transcription:Change the current date and time

Recognition: zero zero zero

Transcription:nine zero five eight one nine five zero zero zero

Recognition: weather clear up on the big island

Transcription:Did the weather clear up on the big island

Recognition: a note to myself

Transcription:august ninth

Recognition: remind me to get butter

Transcription:A note to myself

Recognition: what time

Transcription:Remind me to get butter at Safeway

Recognition: one six two five four

Transcription:Cortana play some blues

Recognition: five six five zero

Transcription:one six two five four five six five zero

Transcription:Any news about mortgage

Recognition: any news about mortgage

Transcription:Can you play Billy Joel Piano Man

Recognition: can you play billy joel piano man

Transcription:Cortana how far away is the moon

Recognition: cortana how far away is the moon

Transcription:Average water bill in Belmont California

Recognition: average water bill in belmont california

Transcription:Change the current date and time

Recognition: change the current date and time

Transcription:nine zero five eight one nine five zero zero zero

Recognition: nine zero five eight one nine five zero zero zero

Transcription:Did the weather clear up on the big island

Recognition: did the weather clear up on the big island

Transcription:august ninth

Recognition: august night

Transcription:A note to myself

Recognition: a note to myself

Transcription:Remind me to get butter at Safeway

Recognition: remind me to get butter at safeway

Transcription:Cortana play some blues

Recognition: time to play some blues

MICROSOFT CONFIDENTIAL – for discussion purposes only. © 2015 Microsoft Corporation. All rights reserved.

Result Analysis - CONT’

MICROSOFT CONFIDENTIAL – for discussion purposes only. © 2015 Microsoft Corporation. All rights reserved.

MICROSOFT CONFIDENTIAL – for discussion purposes only. © 2015 Microsoft Corporation. All rights reserved.

MICROSOFT CONFIDENTIAL – for discussion purposes only. © 2015 Microsoft Corporation. All rights reserved.

(c) 2015 Microsoft Corporation. All rights reserved. This document is provided "as-is." Information and views

expressed in this document, including URL and other Internet Web site references, may change without notice. You

bear the risk of using it. This document does not provide you with any legal rights to any intellectual property in any

Microsoft product. You may copy and use this document for your internal, reference purposes.

Some information relates to pre-released product which may be substantially modified before it’s commercially

released. Microsoft makes no warranties, express or implied, with respect to the information provided here.