speech coding using lpc. what is speech coding speech coding is the procedure of transforming...

24
Speech Coding Using LPC

Upload: monica-young

Post on 12-Jan-2016

240 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: Speech Coding Using LPC. What is Speech Coding  Speech coding is the procedure of transforming speech signal into more compact form for Transmission

Speech CodingUsing LPC

Page 2: Speech Coding Using LPC. What is Speech Coding  Speech coding is the procedure of transforming speech signal into more compact form for Transmission

What is Speech Coding

Speech coding is the procedure of transforming speech signal into more compact form for

Transmission Available Bandwidth

Encryption

Page 3: Speech Coding Using LPC. What is Speech Coding  Speech coding is the procedure of transforming speech signal into more compact form for Transmission

Uncompressed Speech signal

Analog speech is a bandpassed signal between 200 and 3400 Hz.

Uncompressed digital speech is a bit stream at 64kB/s.

Transmission technology must transmit the signals from point A to point B:

with minimum degradationusing minimum bandwidth

Page 4: Speech Coding Using LPC. What is Speech Coding  Speech coding is the procedure of transforming speech signal into more compact form for Transmission

Speech coding

By coding we mean an efficient representation of the signal

– COMPRESSION

The main approaches: waveform coding transform coding Parametric / hybrid coding

} smart quantizers

Page 5: Speech Coding Using LPC. What is Speech Coding  Speech coding is the procedure of transforming speech signal into more compact form for Transmission

{

How each of these works:

Waveform coders: try to find an efficient representation of the waveform, directly.

Transform coders: try to find an efficient representation in the frequency domain.

Parametric coders: try to find a small set of parameters that are an efficient representation of the signal.

FFT, etc.

)(Hexc. speech

Page 6: Speech Coding Using LPC. What is Speech Coding  Speech coding is the procedure of transforming speech signal into more compact form for Transmission

Comparison of Comparison of speech coders

Page 7: Speech Coding Using LPC. What is Speech Coding  Speech coding is the procedure of transforming speech signal into more compact form for Transmission

LPC (Linear Predictive coding)

LPC is a model for signal production: it is based on the assumption that the speech signal is produced by a very specific model.

Page 8: Speech Coding Using LPC. What is Speech Coding  Speech coding is the procedure of transforming speech signal into more compact form for Transmission

Speech Production in HumaSpeech Production in Humans

The speech signal is created by: A pressure source (lungs),

exciting ... A Filter (Vocal tract:

pharynx - mouth [soft palate, tongue] - nasal cavity)

Page 9: Speech Coding Using LPC. What is Speech Coding  Speech coding is the procedure of transforming speech signal into more compact form for Transmission

For DSP Engineer For DSP Engineer

An excitation source A time varying filter

H(t, )

filter:Excitation speech

Page 10: Speech Coding Using LPC. What is Speech Coding  Speech coding is the procedure of transforming speech signal into more compact form for Transmission

The model and its representationThe model and its representation

The LPC model looks at speech as: Excitation:

periodic (voiced) - originating in the larynx

noise (unvoiced) - fricative, produced in the mouth

An all-pole filter representing the vocal tract

H()

all polefilter:.. ..

Page 11: Speech Coding Using LPC. What is Speech Coding  Speech coding is the procedure of transforming speech signal into more compact form for Transmission

Block Diagram

Page 12: Speech Coding Using LPC. What is Speech Coding  Speech coding is the procedure of transforming speech signal into more compact form for Transmission

Why the name “Linear Predictive Coding”

It is assumed that the new sample is the weighted linear combination of previous samples

p

inGeins

ians

1)()()(

Page 13: Speech Coding Using LPC. What is Speech Coding  Speech coding is the procedure of transforming speech signal into more compact form for Transmission

Z-Plane RepresentationZ-Plane Representation

In the z-plane we can write the model as a transfer function:

H zG

a zii

i

p( )

1

1

• Clearly this transfer function has only poles - which is why it represents an all pole filter.

Page 14: Speech Coding Using LPC. What is Speech Coding  Speech coding is the procedure of transforming speech signal into more compact form for Transmission

Mathematical analysisMathematical analysis

Reminder: our problem is to find the LPC parameters, for a given speech signal. This is called the Inverse Problem.

How do we find the set of parameters that gives the best match to the signal?

Page 15: Speech Coding Using LPC. What is Speech Coding  Speech coding is the procedure of transforming speech signal into more compact form for Transmission

What are these Parameters

The Coefficients of the All Pole Filter Pitch of the speech

Page 16: Speech Coding Using LPC. What is Speech Coding  Speech coding is the procedure of transforming speech signal into more compact form for Transmission

How do we find the Coefficients: least squares

Formulation: Given a signal s(n); Defining an error as:

Find the set of that will minize the mean square error:

p

ii insansne

1)()()(

ai

E e nn

2( )

Page 17: Speech Coding Using LPC. What is Speech Coding  Speech coding is the procedure of transforming speech signal into more compact form for Transmission

Solution:Solution:

Simply equate the derivative of E to zero:

E

ai p

i

0 1, ...

• Which gives us the Normal Equations:

piinsnsinsknsan

p

k nk ...1,)()()()(

1

• These are no more than p linear equations in p unknowns...

Page 18: Speech Coding Using LPC. What is Speech Coding  Speech coding is the procedure of transforming speech signal into more compact form for Transmission

Or in matricial form:

n

n

n

p

nnn

nnn

nnn

nspns

nsns

nsns

a

a

a

pnspnsnspnsnspns

pnsnsnsnsnsns

pnsnsnsnsnsns

)()(

)()2(

)()1(

)()()2()()1()(

)()2()2()2()1()2(

)()1()2()1()1()1(

2

1

Page 19: Speech Coding Using LPC. What is Speech Coding  Speech coding is the procedure of transforming speech signal into more compact form for Transmission

A correlation; in other words: take the signal, multiply it by a shifted version, and sum.

Since our signal is long and time varying- we did it on short windows

Two variants: autocorrelation method covariance method

?)()( n

inskns

What is each element of the form-

Page 20: Speech Coding Using LPC. What is Speech Coding  Speech coding is the procedure of transforming speech signal into more compact form for Transmission

Solving the Matrix

Found the Coefficients a(i) by Using the

Levinson-Durbin recursion method

Page 21: Speech Coding Using LPC. What is Speech Coding  Speech coding is the procedure of transforming speech signal into more compact form for Transmission

Second Parameter

Pitch was found by the finding the correlation of the signal window with itself

Then these parameters were transmitted

Page 22: Speech Coding Using LPC. What is Speech Coding  Speech coding is the procedure of transforming speech signal into more compact form for Transmission

Predictor coefficients 18 * 8 = 144

Gain 5

Pitch period 6

Voiced/unvoiced switch

1

Total 156

Overall bit rate50 * 156 = 7800

bits / second

Bit rate for plain LPC vocoder

Page 23: Speech Coding Using LPC. What is Speech Coding  Speech coding is the procedure of transforming speech signal into more compact form for Transmission

Predictor coefficients

18 * 8 = 144

Gain 5

DCT coefficients

40 * 4 = 160

Total 309

Overall bit rate

50 * 309 = 15450 bits /

second

Bit rate for voice-excited LPC vocoder with DCT

Page 24: Speech Coding Using LPC. What is Speech Coding  Speech coding is the procedure of transforming speech signal into more compact form for Transmission

Conclusion Sound produced through LPC method is

not exactly the real sound but it sounds intelligibly understandable

LPC can be used in Speech recognition systems

LPC was widely used in Military because of low bit rate in transmission

There are many variants over the basic scheme: LPC-10, CELP, MELP, RELP, VSELP, ASELP, LD-CELP...