the psychoacoustics of reverberation - · pdf filethe psychoacoustics of reverberation ......

47
The psychoacoustics of reverberation Steven van de Par [email protected] July 19, 2016 Thanks to Julian Grosse and Andreas Häußler 2016 AES International Conference on Sound Field Control

Upload: hoangdien

Post on 06-Feb-2018

232 views

Category:

Documents


4 download

TRANSCRIPT

Page 1: The psychoacoustics of reverberation - · PDF fileThe psychoacoustics of reverberation ... control The auditory ... -0.5 0 0.5 1 Frequency (Hz) C IACC ref IACC RinR IACC RinR-Opt IACC

The psychoacoustics of reverberation

Steven van de Par [email protected]

July 19, 2016

Thanks to

Julian Grosse and

Andreas Häußler

2016 AES International Conference on Sound Field Control

Page 2: The psychoacoustics of reverberation - · PDF fileThe psychoacoustics of reverberation ... control The auditory ... -0.5 0 0.5 1 Frequency (Hz) C IACC ref IACC RinR IACC RinR-Opt IACC

Introduction

The psychoacoustics of reverberation, what is this talk about?

• Reverberation is nearly always present in our daily life

• It creates large distortions of the physical waveform

• Yet it mostly has only a small effect on (speech) perception

Page 3: The psychoacoustics of reverberation - · PDF fileThe psychoacoustics of reverberation ... control The auditory ... -0.5 0 0.5 1 Frequency (Hz) C IACC ref IACC RinR IACC RinR-Opt IACC

Introduction

The psychoacoustics of reverberation, what is this talk about?

• Reverberation is nearly always present in our daily life

• It creates large distortions of the physical waveform

• Yet it mostly has only a small effect on (speech) perception

T60 = 250 ms

0 1 2 3 4 5-1

-0.5

0

0.5

1

Time(s)

Am

plit

ude

Clean speech

Reverberated speech

2.95 3 3.05 3.1 3.15 3.2

-0.3

-0.2

-0.1

0

0.1

0.2

0.3

0.4

Time(s)

Am

plit

ude

Clean speech

Reverberated speech

Page 4: The psychoacoustics of reverberation - · PDF fileThe psychoacoustics of reverberation ... control The auditory ... -0.5 0 0.5 1 Frequency (Hz) C IACC ref IACC RinR IACC RinR-Opt IACC

Introduction

The psychoacoustics of reverberation, what is this talk about?

• Reverberation is nearly always present in our daily life

• It creates large distortions of the physical waveform

• Yet it mostly has only a small effect on (speech) perception

Outline:

• Principles and mechanisms in perception that help beating reverberation

• Some ideas about controlling sound fields in a perceptually motivated manner

Page 5: The psychoacoustics of reverberation - · PDF fileThe psychoacoustics of reverberation ... control The auditory ... -0.5 0 0.5 1 Frequency (Hz) C IACC ref IACC RinR IACC RinR-Opt IACC

The Peripheral Auditory System

Page 6: The psychoacoustics of reverberation - · PDF fileThe psychoacoustics of reverberation ... control The auditory ... -0.5 0 0.5 1 Frequency (Hz) C IACC ref IACC RinR IACC RinR-Opt IACC

Cochlea:

• Mechanical energy (oval window) is converted into a neural signal (auditory nerve)

• Performs a time-frequency analysis

The inner ear

Page 7: The psychoacoustics of reverberation - · PDF fileThe psychoacoustics of reverberation ... control The auditory ... -0.5 0 0.5 1 Frequency (Hz) C IACC ref IACC RinR IACC RinR-Opt IACC

1. cochlear duct 2. scala vestibuli 3. scala tympani 4. spiral ganglion 5. auditory nerve fibres

The Cochlea

• The red arrow is from the oval window • The blue arrow points to the round window • The cochlea is about 2 mm in diameter

Page 8: The psychoacoustics of reverberation - · PDF fileThe psychoacoustics of reverberation ... control The auditory ... -0.5 0 0.5 1 Frequency (Hz) C IACC ref IACC RinR IACC RinR-Opt IACC

Inner Ear: the Basilar Membrane

Frequency-to-place transformation:

Each point on BM acts as a band-pass filter

Page 9: The psychoacoustics of reverberation - · PDF fileThe psychoacoustics of reverberation ... control The auditory ... -0.5 0 0.5 1 Frequency (Hz) C IACC ref IACC RinR IACC RinR-Opt IACC

Cochleagram

Simulates basilar-membrane filtering, and represents magnitudes in dBs.

Brain captures a relatively coarse spectro-temporal representation

Page 10: The psychoacoustics of reverberation - · PDF fileThe psychoacoustics of reverberation ... control The auditory ... -0.5 0 0.5 1 Frequency (Hz) C IACC ref IACC RinR IACC RinR-Opt IACC

Auditory signal representation

Cochleagram is a reasonable first order approximation of perception (loudness, timbre)

Additional perceptual cues (‘texture cues’):

- Timing Information for binaural processing

- ITDs, 20 s JND (source direction)

- Interaural cross-correlation (source width, listener envelopment)

- Temporal pitch cues

- Modulation cues (e.g. roughness of a sound)

Included in advanced models by e.g. Patterson, Meddis and colleagues and Dau et al. (1996, 1997)

Page 11: The psychoacoustics of reverberation - · PDF fileThe psychoacoustics of reverberation ... control The auditory ... -0.5 0 0.5 1 Frequency (Hz) C IACC ref IACC RinR IACC RinR-Opt IACC

Another function within the auditory system

Source segregation:

• Often multiple sources are present simultaneously

• We can focus on one source

• Cocktail party processing:

Listen to one speaker only

Spatial separation helps

How does the brain do it?

Page 12: The psychoacoustics of reverberation - · PDF fileThe psychoacoustics of reverberation ... control The auditory ... -0.5 0 0.5 1 Frequency (Hz) C IACC ref IACC RinR IACC RinR-Opt IACC

Complex acoustical scenes

Acoustic mixtures are often spectro-temporally sparse: For each time-frequency interval one source dominates in level Grouping of signal components is essential to make sense of the speech signal

Time (sec)F

requency (

Hz)

0.5 1 1.5 2 2.5

80

127

201

318

503

796

1260

1995

3159

5000

Azim

uth

(deg)

-50

-40

-30

-20

-10

0

10

20

30

40

50

Time (sec)

Fre

quency (

Hz)

0.5 1 1.5 2 2.5

500

1000

1500

2000

2500

3000

3500

4000

4500

5000

Energ

y

Cochleagram of a mix of two speakers binary mask indicating source dominance

Page 13: The psychoacoustics of reverberation - · PDF fileThe psychoacoustics of reverberation ... control The auditory ... -0.5 0 0.5 1 Frequency (Hz) C IACC ref IACC RinR IACC RinR-Opt IACC

Auditory grouping / segregation Bregman 1990: Auditory Scene Analysis

Primitive grouping cues:

• Common onset

• Common pitch

• Common AM/FM modulation

• Common location

All have to do with the physics of sound generation

See also: http://webpages.mcgill.ca/staff/Group2/abregm1/web/

Page 14: The psychoacoustics of reverberation - · PDF fileThe psychoacoustics of reverberation ... control The auditory ... -0.5 0 0.5 1 Frequency (Hz) C IACC ref IACC RinR IACC RinR-Opt IACC

Common frequency modulation is a grouping cue (Bregman,

http://webpages.mcgill.ca/staff/Group2/abregm1/web/index.htm)

Auditory grouping / segregation Fusion by common frequency change

Page 15: The psychoacoustics of reverberation - · PDF fileThe psychoacoustics of reverberation ... control The auditory ... -0.5 0 0.5 1 Frequency (Hz) C IACC ref IACC RinR IACC RinR-Opt IACC

Visual grouping / occlusion Apparent continuity

Difficult to see what we are dealing with

Page 16: The psychoacoustics of reverberation - · PDF fileThe psychoacoustics of reverberation ... control The auditory ... -0.5 0 0.5 1 Frequency (Hz) C IACC ref IACC RinR IACC RinR-Opt IACC

Without providing extra parts of the letters we can now see the letter “B” We added information about where the letters are cut The overlay is a physically plausible cause for not seeing part of the letters

Visual grouping / occlusion Apparent continuity

Page 17: The psychoacoustics of reverberation - · PDF fileThe psychoacoustics of reverberation ... control The auditory ... -0.5 0 0.5 1 Frequency (Hz) C IACC ref IACC RinR IACC RinR-Opt IACC

Is the auditory equivalent a two speaker situation?

Time

Fre

quency

Female Speaker

0.5 1 1.5 2 2.5

x 104

0

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

1

Time

Fre

quency

Male Speaker

0.5 1 1.5 2 2.5 3 3.5

x 104

0

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

1

Time

Fre

quency

Two speakers

0.5 1 1.5 2 2.5 3 3.5

x 104

0

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

1Female speaker

Male speaker

2 speakers

Page 18: The psychoacoustics of reverberation - · PDF fileThe psychoacoustics of reverberation ... control The auditory ... -0.5 0 0.5 1 Frequency (Hz) C IACC ref IACC RinR IACC RinR-Opt IACC

Dominating voice only

Female mask

Male mask

Linear Sum

30 ms frames 1 critical band

Is the auditory equivalent a two speaker situation?

Female speaker 2 speakers

Male speaker Mask

Page 19: The psychoacoustics of reverberation - · PDF fileThe psychoacoustics of reverberation ... control The auditory ... -0.5 0 0.5 1 Frequency (Hz) C IACC ref IACC RinR IACC RinR-Opt IACC

Role of low-SNR speech glimpses?

Schoenmaker and van de Par (Advances in experimental medicine and biology, 2016)

Remove speech target tiles

with and SNR below a

criterion value

Speech intelligibility impaired

only beyond about 0 dB SNR

Only positive SNR parts of speech contribute to intelligibility

Page 20: The psychoacoustics of reverberation - · PDF fileThe psychoacoustics of reverberation ... control The auditory ... -0.5 0 0.5 1 Frequency (Hz) C IACC ref IACC RinR IACC RinR-Opt IACC

Reverberated

Female mask

Male mask

Linear Sum

30 ms frames 1 critical band T60 = 750 ms

What about reverberation?

Female speaker 2 speakers

Male speaker Mask

Page 21: The psychoacoustics of reverberation - · PDF fileThe psychoacoustics of reverberation ... control The auditory ... -0.5 0 0.5 1 Frequency (Hz) C IACC ref IACC RinR IACC RinR-Opt IACC

Reverberation and the Auditory Representation

Reverberation will temporally smear the auditory signal

Multiple delayed reflections will add to the direct sound

Often reverberant field is stronger than direct sound (critical radius)

Speech phonemes will start to overlap (Speech rate 10 Syllables/sec)

Music is slower (Allegro 150 bpm 3 notes/sec)

Segregation will become more difficult

Remember the primitive grouping cues:

• Common onset (largely preserved)

• Common pitch (pitch unaffected, changes will be smeared)

• Common AM/FM modulation (high rates changed and converted)

• Common location (much reduced reliability)

Page 22: The psychoacoustics of reverberation - · PDF fileThe psychoacoustics of reverberation ... control The auditory ... -0.5 0 0.5 1 Frequency (Hz) C IACC ref IACC RinR IACC RinR-Opt IACC

Measure distribution of binaural cues

– Target at 10º

Page 23: The psychoacoustics of reverberation - · PDF fileThe psychoacoustics of reverberation ... control The auditory ... -0.5 0 0.5 1 Frequency (Hz) C IACC ref IACC RinR IACC RinR-Opt IACC

Measure distribution of binaural cues

– Target at 10º

– Reverberation

Page 24: The psychoacoustics of reverberation - · PDF fileThe psychoacoustics of reverberation ... control The auditory ... -0.5 0 0.5 1 Frequency (Hz) C IACC ref IACC RinR IACC RinR-Opt IACC

Precedence effect:

- The first arriving wave front determines perceived direction

- Allows spatial cues to contribute to segregation in reverberant conditions

Sound localization: Precedence effect (Haas effect)

Page 25: The psychoacoustics of reverberation - · PDF fileThe psychoacoustics of reverberation ... control The auditory ... -0.5 0 0.5 1 Frequency (Hz) C IACC ref IACC RinR IACC RinR-Opt IACC

Intermediate summary

How does perception cope with reverberation:

• Reverberation is not represented well in the brain due to coarse spectro-temporal resolution of the auditory system

• Important perceptual segregation cues are robust against reverb

Common onset

Pitch

Common low-rate AM/FM

Spatial cues (due to precedence effect)

Page 26: The psychoacoustics of reverberation - · PDF fileThe psychoacoustics of reverberation ... control The auditory ... -0.5 0 0.5 1 Frequency (Hz) C IACC ref IACC RinR IACC RinR-Opt IACC

How to use this knowledge for sound field control

The auditory principles that cope with reverberation are implemented in the ‘transformed’ auditory domain.

It is not possible to apply these processing principles directly on acoustical signals.

Two examples will be given that use perceptual processing knowledge for sound field control

Page 27: The psychoacoustics of reverberation - · PDF fileThe psychoacoustics of reverberation ... control The auditory ... -0.5 0 0.5 1 Frequency (Hz) C IACC ref IACC RinR IACC RinR-Opt IACC

Recording room Playback room

Authentic Audio reproduction

Page 28: The psychoacoustics of reverberation - · PDF fileThe psychoacoustics of reverberation ... control The auditory ... -0.5 0 0.5 1 Frequency (Hz) C IACC ref IACC RinR IACC RinR-Opt IACC

Authentic Audio reproduction

Approach to authentic reproduction: - Optimizing spatial parameters on a coarse spectro-temporal

scale is enough: - Direct sound for directional information - Reverberant sound for ASW and LEV (IACC)

- ‘Texture’ cues are represented in microphone signals

- Consider the (reverberant) acoustics at the reproduction side

Grosse and van de Par (IEEE J. OF SELECTED TOPICS IN SIGNAL PROCESSING, 2015)

Page 29: The psychoacoustics of reverberation - · PDF fileThe psychoacoustics of reverberation ... control The auditory ... -0.5 0 0.5 1 Frequency (Hz) C IACC ref IACC RinR IACC RinR-Opt IACC

Room-in-room reproduction

Page 30: The psychoacoustics of reverberation - · PDF fileThe psychoacoustics of reverberation ... control The auditory ... -0.5 0 0.5 1 Frequency (Hz) C IACC ref IACC RinR IACC RinR-Opt IACC

Room-in-room reproduction

Only direct sound can be reproduced optimally No control over reverberant sound field

Page 31: The psychoacoustics of reverberation - · PDF fileThe psychoacoustics of reverberation ... control The auditory ... -0.5 0 0.5 1 Frequency (Hz) C IACC ref IACC RinR IACC RinR-Opt IACC

Perceptual approach

Perceptual Optimization

Page 32: The psychoacoustics of reverberation - · PDF fileThe psychoacoustics of reverberation ... control The auditory ... -0.5 0 0.5 1 Frequency (Hz) C IACC ref IACC RinR IACC RinR-Opt IACC

Perceptual approach

Perceptual Optimization

Optimization targeting perceptually relevant statistical properties of reverberant sound field

Page 33: The psychoacoustics of reverberation - · PDF fileThe psychoacoustics of reverberation ... control The auditory ... -0.5 0 0.5 1 Frequency (Hz) C IACC ref IACC RinR IACC RinR-Opt IACC

Perceptual approach

Perceptual Optimization

Optimization targeting perceptually relevant statistical properties of reverberant sound field

The acoustics of the playback room is an integral part of the optimization

Page 34: The psychoacoustics of reverberation - · PDF fileThe psychoacoustics of reverberation ... control The auditory ... -0.5 0 0.5 1 Frequency (Hz) C IACC ref IACC RinR IACC RinR-Opt IACC

Optimization

Optimize perceptually relevant statistical parameters:

• Auditory Transfer Function

– Direct sound (front loudspeakers)

– Reverberant sound (dipole loudspeakers)

• Interaural Cross Correlation (frequency dependent)

– Cross-talk dipole loudspeakers

• T60

– Direct-to-reverberant ratio

Page 35: The psychoacoustics of reverberation - · PDF fileThe psychoacoustics of reverberation ... control The auditory ... -0.5 0 0.5 1 Frequency (Hz) C IACC ref IACC RinR IACC RinR-Opt IACC

Perceptual approach

Page 36: The psychoacoustics of reverberation - · PDF fileThe psychoacoustics of reverberation ... control The auditory ... -0.5 0 0.5 1 Frequency (Hz) C IACC ref IACC RinR IACC RinR-Opt IACC

Evaluation

All reproduction methods simulated with Room Impulse Responses over headphones convolved with dry instrument recordings

Compare objective parameters Recording: Seminar room & Church 699 (ms) 3040 (ms) Playback (PBR): Small Lab & Seminar Room 371(ms) 697 (ms)

Page 37: The psychoacoustics of reverberation - · PDF fileThe psychoacoustics of reverberation ... control The auditory ... -0.5 0 0.5 1 Frequency (Hz) C IACC ref IACC RinR IACC RinR-Opt IACC

Objective parameters

RinR = Conventional reproduction without optimization

RinR,Opt = Our proposed optimization

mCH = Multi-channel reproduction with surround speakers

Ref = Recording room

- Coloration can be reduced compared to RinR

- Spatial properties (IACC) better conserved

100 1000 10000

-10

-5

0

5

10

Frequency (Hz)

E

(d

B)

ERinR

ERinR,Opt

EmCH

100 1000 10000-1

-0.5

0

0.5

1

Frequency (Hz)

IAC

C

IACCref

IACCRinR

IACCRinR-Opt

IACCmCH

0 100 200 300 400 500 600

-60

-40

-20

0

t (ms)

L (

dB

)

edcref

edcRinR

edcRinR-Opt

edcmCH

Page 38: The psychoacoustics of reverberation - · PDF fileThe psychoacoustics of reverberation ... control The auditory ... -0.5 0 0.5 1 Frequency (Hz) C IACC ref IACC RinR IACC RinR-Opt IACC

Objective parameters

RinR = Conventional reproduction without optimization

RinR,Opt = Our proposed optimization

mCH = Multi-channel reproduction with surround speakers

Ref = Recording room

- Coloration can be reduced compared to RinR

- Spatial properties (IACC) better conserved

100 1000 10000

-10

-5

0

5

10

Frequency (Hz)

E

(d

B)

ERinR

ERinR,Opt

EmCH

100 1000 10000-1

-0.5

0

0.5

1

Frequency (Hz)

IAC

C

IACCref

IACCRinR

IACCRinR-Opt

IACCmCH

0 100 200 300 400 500 600

-60

-40

-20

0

t (ms)

L (

dB

)

edcref

edcRinR

edcRinR-Opt

edcmCH

Page 39: The psychoacoustics of reverberation - · PDF fileThe psychoacoustics of reverberation ... control The auditory ... -0.5 0 0.5 1 Frequency (Hz) C IACC ref IACC RinR IACC RinR-Opt IACC

Listening test

All reproduction methods simulated with Room Impulse Responses over headphones convolved with dry instrument recordings

MUSHRA test Ref = Recording room Recording: Seminar room & Church 699 (ms) 3040 (ms) Playback (PBR): Small Lab & Seminar Room 371(ms) 697 (ms)

Page 40: The psychoacoustics of reverberation - · PDF fileThe psychoacoustics of reverberation ... control The auditory ... -0.5 0 0.5 1 Frequency (Hz) C IACC ref IACC RinR IACC RinR-Opt IACC

Results and Conclusions Simple loudspeaker set-

up allows:

• Perceptual authentic reproduction

• Individualization by considering playback acoustics

Grosse and van de Par (2015) IEEE Journal of Selected Topics in Signal Processing

Seminar room

Church

Page 41: The psychoacoustics of reverberation - · PDF fileThe psychoacoustics of reverberation ... control The auditory ... -0.5 0 0.5 1 Frequency (Hz) C IACC ref IACC RinR IACC RinR-Opt IACC

Perceptual dereverberation

Scenario:

- Speech reproduction in a reverberant room

- Preprocessing of the speech signal to enhance speech intelligibility

Preprocessing

Speech signal

Reverberant room

Page 42: The psychoacoustics of reverberation - · PDF fileThe psychoacoustics of reverberation ... control The auditory ... -0.5 0 0.5 1 Frequency (Hz) C IACC ref IACC RinR IACC RinR-Opt IACC

Perceptual dereverberation

Main Idea:

- Conserve spectro-temporal pattern

- Use time-variant filtering (Hodoshima et al., 2006)

0 0.01 0.02 0.03 0.04 0.05

-1

-0.5

0

0.5

1

1.5

2

Time(s)

Am

plit

ude

Reverberated sine

Clean sine

Page 43: The psychoacoustics of reverberation - · PDF fileThe psychoacoustics of reverberation ... control The auditory ... -0.5 0 0.5 1 Frequency (Hz) C IACC ref IACC RinR IACC RinR-Opt IACC

Perceptual dereverberation Approach:

Preprocessing of Loudspeaker inputs

Adapt current frame based on past

Optimize algorithm parameters

with perceptual model.

(Jørgensen et al.

2013)

Page 44: The psychoacoustics of reverberation - · PDF fileThe psychoacoustics of reverberation ... control The auditory ... -0.5 0 0.5 1 Frequency (Hz) C IACC ref IACC RinR IACC RinR-Opt IACC

Perceptual dereverberation

Listening test:

- Reverberated (pre-processed) speech with reverberated noise

- Measure Speech Reception Threshold

Page 45: The psychoacoustics of reverberation - · PDF fileThe psychoacoustics of reverberation ... control The auditory ... -0.5 0 0.5 1 Frequency (Hz) C IACC ref IACC RinR IACC RinR-Opt IACC

Perceptual dereverberation

Listening test:

- Robustness for position

- Measure Speech Reception Threshold

Page 46: The psychoacoustics of reverberation - · PDF fileThe psychoacoustics of reverberation ... control The auditory ... -0.5 0 0.5 1 Frequency (Hz) C IACC ref IACC RinR IACC RinR-Opt IACC

Summary

The auditory system:

• Uses low-resolution spectro-temporal representation

• Extracts some special ‘texture’ cues

• Uses robust cues for segregation/grouping

Two examples for sound field control were shown

• Authentic audio reproduction in a reverberant playback room

• Perceptual dereverberation

Page 47: The psychoacoustics of reverberation - · PDF fileThe psychoacoustics of reverberation ... control The auditory ... -0.5 0 0.5 1 Frequency (Hz) C IACC ref IACC RinR IACC RinR-Opt IACC

Thank you for your attention Questions …