nguyen thi huong lien - hirota lab 1/10 echo hiding using exponential time-spread echo kernel...

10
Nguyen Thi Huong Lien - Hiro ta Lab 1/10 Echo Hiding using Exponential Time-spread Echo Kernel (ETEK-EH) and Its Applications to Audio Digital Watermarking and Speaker Recognition Proposed ETEK - EH Digital Watermarking for Music Two channel DW based on ETEK-EH & LSB Speaker Recognition System Trapezoid Fuzzy Similarity Personal Identification Voice ETEK-EH Overview Application 1 Application 2 (A data hiding method)

Upload: august-garrison

Post on 31-Dec-2015

214 views

Category:

Documents


0 download

TRANSCRIPT

Nguyen Thi Huong Lien - Hirota Lab 1/10

Echo Hiding using Exponential Time-spread Echo Kernel (ETEK-EH) and Its Applications to Audio Digital Watermarking and Speaker Recognition

ProposedETEK - EH

Digital Watermarkingfor Music

Two channel DW based on ETEK-EH & LSB

Speaker Recognition System

•Trapezoid Fuzzy Similarity•Personal Identification Voice•ETEK-EH

Overview

Application 1

Application 2

(A data hiding method)

Nguyen Thi Huong Lien - Hirota Lab 2/10

=

input output

original signal echo kernel echoed signal

(a) embedding process

original signal

echo

2( )lh

+

t

t

t

originalsignal

echo signal

1d

(b) 1 kernel

2( )2

lx

K

originalsignal

echo signal

(c) 0 kernel0d

2( )2

lx

K

2i

EH [D. Gruhl et al, 1996]

Conventional Echo Hiding

Nguyen Thi Huong Lien - Hirota Lab 3/10

Echo Hiding using Exponential Time-spread Echo Kernel (ETEK-EH)

i

originalsignal

echoesp

1KpK

Exponentialtime-spreadecho kernel

delay

Single Echo Method[D. Gruhl et al, 1996]

Room Reverberation

originalsignal

echo signal

(c) 0 kernel0d

2( )2

lx

K

2i

Single echo kernel

Nguyen Thi Huong Lien - Hirota Lab 4/10

ETEK-EH: Experimental ResultsRetrieval rate and Sound quality

Music Data: 10 clips (44.1 kHz, 30s~90s)

Embedded Information: 26x26 image

Accuracy Rate

0

20

40

60

80

100

1 2 3 4 5 6 7 8 9 10

Music clip

Per

cent

age

(%)

従来法提案法

Conventionalmethod

Proposedmethod

Single echo

ETEK

Retrieval rate comparison of the proposed method and conventional single echo method on 10 music clips

Retrieval Rate

Music Clip

Per

cen

tage

(%

)

1

2

3

4

5The proposed method=4.57

The conventional method=3.86

(7 young peopleSONY, MDR-CD900ST)

Sound evaluation

Nguyen Thi Huong Lien - Hirota Lab 5/10

Application to Digital Audio Watermarking:Two Channel Digital Watermarking for Music based on ETEK

Easy copy

Copyright Problems Robustness

Capacity

Digital Watermarking: Trade-off

Embedding Process2channelseparation

Segment-tation

Segment-tation

Method 1

Method 2based on

ETEK

Encoding

Re-segment

Re-segment

2channelunification

originalmusic signal watermarked

music signal

watermark

watermarklength

watermarkbits

Extracted watermarks: (a) channel 1 (b) channel 2

Imperceptibility

Nguyen Thi Huong Lien - Hirota Lab 6/10

ExperimentsComparison Methods & Measurement

1. Least Significant Bit [Yeh, 2004]2. Single EH [Gruhl, 1996]3. Multiple EH [Oh, 2001]4. Negative EH [Foo, 2001]5. Backward and Forward EH [Kim, 2003]6. ETEK-2 Channel Method

30 clips (8 genres, 44.1 kHz, 16bit)

Watermarking using 6 methods

Applying SMBA 42 attacks

Sound EvaluationObjective Difference Grade ( ODG)

[ITU-R BS.1387, 2001]EAQUAL Software

Robustness against AttacksStirMark Benchmark for Audio (SMBA)

[Andreas, 2005]42 attacks

Measurements for comparing different methods

Nguyen Thi Huong Lien - Hirota Lab 7/10

Two Channel Digital Watermarking for Music based on ETEK:Experiment Results

LSB [Yeh, 2004] Single EH [Gruhl, 2001] Multiple EH [Oh, 2001]

Negative EH [Foo, 2001] Backward & Forward EH [Kim, 2003]

Proposed ETEK-2channel

Nguyen Thi Huong Lien - Hirota Lab 8/10

Application to Speaker RecognitionSpeaker Recognition based on PIV & TFS

8/16

Nearest Centroid Neighbor

TFS=Trapezoid Fuzzy Similarity

Personal Identification Voice (PIV)=

Record 3 times/person8 voice features

Proposal

Application

Voice from 24 users/ 3 times

Similarity

mean

½ range

½ range

0

1

,i jA,i jB

,i jD,i jC,i jE jx

Increase recognition accuracy

Nguyen Thi Huong Lien - Hirota Lab 9/10

Application to Speaker Recognition Security Enhancement for SR by ETEK-EH

9/16

Watermark exists

Fail

Proposal

Nguyen Thi Huong Lien - Hirota Lab 10/10

Conclusion

Two channel DW based on ETEK & LSB

for Music

ETEK for EHSpeaker Recognition System

Improve security of SRby ETEK

Digital Watermarking: EH with ETEK: robust against

attacks, natural sound quality Copyright Protection for Music

Speaker Identification: New features, trapezoid fuzzy

similarity: better performance ETEK to improve security for SR

A data hiding method