japanese-english code-switching speech data ….../20 code-switching code-switching (cs): a speaker...

31
/20 Japanese-English Code-switching Speech Data Construction Sahoko Nakayama, Takatomo Kano, Quoc Truong Do, Sakriani Sakti, Satoshi Nakamura Nara Institute of Science and Technology Japan 1 Copyright © Nara Institute of Science Technology All rights Reserved

Upload: others

Post on 28-Jun-2020

27 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Japanese-English Code-switching Speech Data ….../20 Code-switching Code-switching (CS): A speaker switches languages within a conversation [Inter-sentential code-switching]: ああ、そうだってね。On

/20

Japanese-English Code-switching Speech Data Construction

〇Sahoko Nakayama, Takatomo Kano, Quoc Truong Do, Sakriani Sakti, Satoshi Nakamura

Nara Institute of Science and Technology

Japan

1Copyright © Nara Institute of Science Technology All rights Reserved

Page 2: Japanese-English Code-switching Speech Data ….../20 Code-switching Code-switching (CS): A speaker switches languages within a conversation [Inter-sentential code-switching]: ああ、そうだってね。On

/20

Code-switching

Code-switching (CS):

A speaker switches languages within a conversation

[Inter-sentential code-switching]:

ああ、そうだってね。 On the honeymoon, they bought this.

(Oh,year, your’re right. On their honeymoon, they bought this)

[Intra-sentential word-level code-switching]:

確率を与えたので、その確率を使ってrecountingします。

(Since we gave a probability, we recalculate using that probability.)

[Intra-sentential phrase-level code-switching]:

わたしね、that’s not the pointとか言われたもんね。

(I was told that that’s not the point.)

Loanword insertion:

中間言語を使った時のメリットに何があるか?

(What is the merit of using an interlingua?)2

[Fujimura+2013]

[From NAIST lecture ]

[Nakamura+2005]

[From Naist lecture]

Copyright © Nara Institute of Science Technology All rights Reserved

Page 3: Japanese-English Code-switching Speech Data ….../20 Code-switching Code-switching (CS): A speaker switches languages within a conversation [Inter-sentential code-switching]: ああ、そうだってね。On

/20

Bilinguals and Code-switching

3

Bilingual speakers have increased in Japan.

[McSwan+2000]

Copyright © Nara Institute of Science Technology All rights Reserved

0

5000

10000

15000

20000

25000

1990 2016

Children

with a foreign parent

0

2000

4000

6000

8000

10000

12000

14000

1977 2015

21,180

13,686

0

10000

20000

30000

40000

50000

60000

70000

2006 2016

5,900

12,527 60,643

23,633

Children

who have lived abroad

Students

who study abroad

Code-switching plays a vital role in bilingualism.

(years)

(people) (people) (people)

(years) (years)

Data : MHLW Data : MEXT Data : JASSO

Page 4: Japanese-English Code-switching Speech Data ….../20 Code-switching Code-switching (CS): A speaker switches languages within a conversation [Inter-sentential code-switching]: ああ、そうだってね。On

/20

Reports of Code-switching in Japanese-English Case

4

A Japanese child who lives in the US

59 code-switching in half an hour[Nakamura+2005]

Bilingual children living in Japan, who have foreign parents

153 code-switching in 4 hours[Fujimura+2013]

1.

2.

Copyright © Nara Institute of Science Technology All rights Reserved

By aturukus

Page 5: Japanese-English Code-switching Speech Data ….../20 Code-switching Code-switching (CS): A speaker switches languages within a conversation [Inter-sentential code-switching]: ああ、そうだってね。On

/20

えっと、日本語で”coincidence”って何ていうんだっけ?(Um, what do you say “coincidence”

in Japanese?)

Speech Recognition

5

Realize to recognize every conversation

翻訳された文章をtarget sentence

といいます。(The translated sentence is called

target sentence)

1. Lecture, Meeting 2. Dialogue with a robot

Copyright © Nara Institute of Science Technology All rights Reserved

Requirement

<Usage scenes >

[From NAIST lecture ]

Page 6: Japanese-English Code-switching Speech Data ….../20 Code-switching Code-switching (CS): A speaker switches languages within a conversation [Inter-sentential code-switching]: ああ、そうだってね。On

/20

Previous Research in ASR

Tried to train speech recognition to recognize code-switching

code-switching (GMM-HMM) [Vu+2012]

code-switching (DNN-HMM) [Yilmaz+2016]

→ case has received scant research

6

Frisian

English

Chinese

Dutch

English

Japanese

We focus on Japanese-English code-switching.

Copyright © Nara Institute of Science Technology All rights Reserved

Page 7: Japanese-English Code-switching Speech Data ….../20 Code-switching Code-switching (CS): A speaker switches languages within a conversation [Inter-sentential code-switching]: ああ、そうだってね。On

/20

Data Construction

Japanese-English code-switching ASR must be developed, but:

Problems

7

So we constructed

Japanese-English code-switching speech database by utilizing TTS

Copyright © Nara Institute of Science Technology All rights Reserved

Previous collected

code-switching speech data

Mainly from conversation

1. Mandarin-English

2. Frisian-Dutch

3. Spanish-English

4. Turkish-German

5. Mandarin-Taiwanese

[Lyu+2015]

[Heuvel+2016]

No large-scale data exists

for training model ①

②Time-consuming and expensive for collecting

etc…

[Solorio+2008]

[Herkenrath+2012]

[Lyu+2008]

Page 8: Japanese-English Code-switching Speech Data ….../20 Code-switching Code-switching (CS): A speaker switches languages within a conversation [Inter-sentential code-switching]: ああ、そうだってね。On

/20

The Overview of Code-switching Data Construction

8

1

Japanese

text

2

Select

word/phrase

3

Translate into

English

7

Synthesize

English part

5

Synthesize &

get time information

6

Find time to switch

& trimming

Text

Speech

4

Code-switching

text

8

Code-switching

speech

Copyright © Nara Institute of Science Technology All rights Reserved

Page 9: Japanese-English Code-switching Speech Data ….../20 Code-switching Code-switching (CS): A speaker switches languages within a conversation [Inter-sentential code-switching]: ああ、そうだってね。On

/20

Data Resources in Text Data Construction

Copyright © Nara Institute of Science Technology All rights Reserved 9

Available data resources:

Monolingual BTEC text data

The ATR Basic Travel Expression Corpus(BTEC)

Basic conversations in travel domains

We used Japanese/English BTEC text data called BTEC1,2,3, and 4.

1

BTEC

Japanese text

2

Select

word/phrase

3

Translate into

English

4

Code-switching

text

Text

Page 10: Japanese-English Code-switching Speech Data ….../20 Code-switching Code-switching (CS): A speaker switches languages within a conversation [Inter-sentential code-switching]: ああ、そうだってね。On

/20

Data Resourcesin Speech Data Construction

Copyright © Nara Institute of Science Technology All rights Reserved 10

Available data resources:

Monolingual Japanese and English TTS system Use HMM-based Speech Synthesis System(HTS) Generated based on a bilingual BTEC speech data

Bilingual BTEC speech data

Originally constructed to emphasize a speech translation study

Use the data from 1 bilingual speaker

7

Synthesize

English part

5

Synthesize &

get time information

6

Find time to switch

& trimming

8

Code-switching

speech

BTEC

Japanese

text

Code-switching

text

Join

Speech

Page 11: Japanese-English Code-switching Speech Data ….../20 Code-switching Code-switching (CS): A speaker switches languages within a conversation [Inter-sentential code-switching]: ああ、そうだってね。On

/20

Word-level in Text Data Construction

11

Intra-sentential word-level code-switching

Copyright © Nara Institute of Science Technology All rights Reserved

観光 バス の パンフレット は あり ます か

(do you have any brochures for the sightseeing bus?)

※Commonly select katakana characters

Page 12: Japanese-English Code-switching Speech Data ….../20 Code-switching Code-switching (CS): A speaker switches languages within a conversation [Inter-sentential code-switching]: ああ、そうだってね。On

/20

Word-level in Text Data Construction

12

Intra-sentential word-level code-switching

Copyright © Nara Institute of Science Technology All rights Reserved

観光 バス の パンフレット は あり ます か

(do you have any brochures for the sightseeing bus?)

※Commonly select katakana characters

観光 バス の は あり ます か

(do you have any brochures for the sightseeing bus?)

⇒ pamphlet

by Google Translation

API

パンフレット

Page 13: Japanese-English Code-switching Speech Data ….../20 Code-switching Code-switching (CS): A speaker switches languages within a conversation [Inter-sentential code-switching]: ああ、そうだってね。On

/20

Word-level in Text Data Construction

13

Intra-sentential word-level code-switching

Copyright © Nara Institute of Science Technology All rights Reserved

観光 バス の パンフレット は あり ます か

(do you have any brochures for the sightseeing bus?)

※Commonly select katakana characters

観光 バス の は あり ます か

(do you have any brochures for the sightseeing bus?)

⇒ pamphlet

by Google Translation

API

パンフレット

観光 バス の pamphlet は あり ます か

(do you have any brochures for the sightseeing bus?)

Page 14: Japanese-English Code-switching Speech Data ….../20 Code-switching Code-switching (CS): A speaker switches languages within a conversation [Inter-sentential code-switching]: ああ、そうだってね。On

/20

Reference

これ、ウルトラマンティガ か, what do you think this is?

(Yes, this (is) Ultraman-Tiga or what do you think this is?)

それ は、 that’s not his arm.

(Speaking of that, that’s not his arm)

Phrase-level in Text Data Construction

14

Intra-sentential phrase-level code-switching

Copyright © Nara Institute of Science Technology All rights Reserved

Trigger is Japanese particles:

wa, ga, wo, ni, he(e), to, ka, kara, yori, ba, temo, keredo,

noni, node, kara, nari, nagara, tari, tsutsu

ガス 入り の 炭酸 水 を 二 本 氷 と 一緒 に 持っ て き て ください(Bring me two bottles of carbonic minerals and some ice please)

[Nakamura+2005]①

Page 15: Japanese-English Code-switching Speech Data ….../20 Code-switching Code-switching (CS): A speaker switches languages within a conversation [Inter-sentential code-switching]: ああ、そうだってね。On

/20

Phrase-level in Text Data Construction

15

Intra-sentential phrase-level code-switching

Copyright © Nara Institute of Science Technology All rights Reserved

ガス 入り の 炭酸 水 を 二 本 氷 と 一緒 に 持っ て き て ください(Bring me two bottles of carbonic minerals and some ice please)

ガス 入り の 炭酸 水 を(Bring me two bottles of carbonic minerals and some ice please)

二 本 氷 と 一緒 に 持っ て き て ください⇩

bring me with two ice

by Google Translation API

Page 16: Japanese-English Code-switching Speech Data ….../20 Code-switching Code-switching (CS): A speaker switches languages within a conversation [Inter-sentential code-switching]: ああ、そうだってね。On

/20

Phrase-level in Text Data Construction

16

Intra-sentential phrase-level code-switching

Copyright © Nara Institute of Science Technology All rights Reserved

ガス 入り の 炭酸 水 を 二 本 氷 と 一緒 に 持っ て き て ください(Bring me two bottles of carbonic minerals and some ice please)

ガス 入り の 炭酸 水 を(Bring me two bottles of carbonic minerals and some ice please)

二 本 氷 と 一緒 に 持っ て き て ください⇩

bring me with two ice

by Google Translation API

ガス 入り の 炭酸 水 を bring me with two ice.

(Bring me two bottles of carbonic minerals and some ice please)③

Page 17: Japanese-English Code-switching Speech Data ….../20 Code-switching Code-switching (CS): A speaker switches languages within a conversation [Inter-sentential code-switching]: ああ、そうだってね。On

/20

Speech Data Construction

17

昼食と 夕食0 3.231.76

1.31

つきですか dinner

Trimming

Japanese sentence English

word/phrase

Copyright © Nara Institute of Science Technology All rights Reserved

(Does that include breakfast and dinner?)

Page 18: Japanese-English Code-switching Speech Data ….../20 Code-switching Code-switching (CS): A speaker switches languages within a conversation [Inter-sentential code-switching]: ああ、そうだってね。On

/20

Data Analysis

0%

20%

40%

60%

80%

100%

MonolingualJapanese

Word-level code-switching

Phrase-level code-switching

MonolingualEnglish

Japanese words English words

18

2295K 1147K

161K

632K

749K

8873K

Statistics of Japanese-English

code-switching speech utterances

Copyright © Nara Institute of Science Technology All rights Reserved

※The number in the bar is the number of words.

Page 19: Japanese-English Code-switching Speech Data ….../20 Code-switching Code-switching (CS): A speaker switches languages within a conversation [Inter-sentential code-switching]: ああ、そうだってね。On

/20

Conclusion

We constructed

a Japanese-English code-switching corpus

19

Utilizing

Japanese and English

bilingual TTS 146k

(12% English)

Word level

code-switching

146k

(54% English)

Phrase level

code-switching

Copyright © Nara Institute of Science Technology All rights Reserved

No large-scale Japanese-English code-switching data exists

② Time-consuming and expensive for collecting

ProblemFor Japanese-English

code-switching ASR

Page 20: Japanese-English Code-switching Speech Data ….../20 Code-switching Code-switching (CS): A speaker switches languages within a conversation [Inter-sentential code-switching]: ああ、そうだってね。On

/20

Future works

20

Investigate the quality by increasing speakers and comparing with natural code-switching speech

Utilize it to enhance our speech recognition system for the bilingual community

Copyright © Nara Institute of Science Technology All rights Reserved

Page 21: Japanese-English Code-switching Speech Data ….../20 Code-switching Code-switching (CS): A speaker switches languages within a conversation [Inter-sentential code-switching]: ああ、そうだってね。On

/20

The End

21Copyright © Nara Institute of Science Technology All rights Reserved

Page 22: Japanese-English Code-switching Speech Data ….../20 Code-switching Code-switching (CS): A speaker switches languages within a conversation [Inter-sentential code-switching]: ああ、そうだってね。On

/20

Appendix

22Copyright © Nara Institute of Science Technology All rights Reserved

Page 23: Japanese-English Code-switching Speech Data ….../20 Code-switching Code-switching (CS): A speaker switches languages within a conversation [Inter-sentential code-switching]: ああ、そうだってね。On

/20

Reference

M.Nakamura, “Developing codeswitching patterns of a Japanese/English bilingual child,” in Proceedings of the 4th International Symposium on Bilingualism, 2005, pp.1679-1689

K.Fujimura, “Inevitable Language Outcome: The Use of Code- switching and Code-mixing by Japanese People Living in London, England,” The bulletin of Yasuda women university, 2013, 41, 23-32

Jeff McSwan, “The architecture of the bilingual language faculty: Evidence from intrasentential codeswitching,” Bilingualism: Language and Cognition, vol.3, no. 1, pp.37-54, 2000.

23Copyright © Nara Institute of Science Technology All rights Reserved

Page 24: Japanese-English Code-switching Speech Data ….../20 Code-switching Code-switching (CS): A speaker switches languages within a conversation [Inter-sentential code-switching]: ああ、そうだってね。On

/20

Reference

Japanese Ministry of Health, Labour and Welfare,“Overviewof the population statistics in 2016 [in Japanese],” http://www.mhlw.go.jp/, 2016.

Japanese Ministry of Education, Culture, Sports, Science, and Technology, “School basic survey in 2015 [in Japanese],” http://www.mext.go.jp/, 2015.

Japan Student Service Organization, “Survey on japanesestudent abroad situation in 2016 [in Japanese],”http://www.jasso.go.jp/, 2016.

24Copyright © Nara Institute of Science Technology All rights Reserved

Page 25: Japanese-English Code-switching Speech Data ….../20 Code-switching Code-switching (CS): A speaker switches languages within a conversation [Inter-sentential code-switching]: ああ、そうだってね。On

/20

Reference

N. T. Vu, D. C. Lyu, J. Weiner, D. Telaar, T. Schlippe, F. Blaicher, E. S. Chng, T. Schultz, and H. Li, “A first speech recognition system for mandarin-englishcode-switch conversational speech,” in 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), March 2012, pp. 4889–4892.

Emre Ylmaz, Henk van den Heuvel, and David van

Leeuwen, “Investigating bilingual deep neural net-

works for automatic recognition of code-switching Frisian speech,” Procedia Computer Science, vol. 81, pp. 159 –

Copyright © Nara Institute of Science Technology All rights Reserved 25

Page 26: Japanese-English Code-switching Speech Data ….../20 Code-switching Code-switching (CS): A speaker switches languages within a conversation [Inter-sentential code-switching]: ああ、そうだってね。On

/20

Corpus sample

TextThis

study

Google

TTS

Word

level

私 の baggage が 見つかり ません。

(I can’t find my baggage)

これは declaration してませんね?(You haven’t declared this have you?)

Phrase

level

私 の 荷物 を please look for it.

(Please look for my luggage)

申告 する もの が Do you have?

(Do you have anything to declare?)

26Copyright © Nara Institute of Science Technology All rights Reserved

Page 27: Japanese-English Code-switching Speech Data ….../20 Code-switching Code-switching (CS): A speaker switches languages within a conversation [Inter-sentential code-switching]: ああ、そうだってね。On

/20

Available Data Resources

1. Monolingual BTEC text data

The ATR Basic Travel Expression Corpus(BTEC)

Basic conversations in travel domains

Collected by bilingual travel experts from Ja/En sentence pairs in travel domain phrasebooks

We used Japanese-English BTEC text data called BTEC1,2,3, and 4:

Copyright © Nara Institute of Science Technology All rights Reserved 27

Page 28: Japanese-English Code-switching Speech Data ….../20 Code-switching Code-switching (CS): A speaker switches languages within a conversation [Inter-sentential code-switching]: ああ、そうだってね。On

/20

Available Data Resources

2. Bilingual BTEC speech data

Constructed to emphasize a speech translation study

1015 Japanese/English sentence pairs were selected from 16,000 BTEC sentences

Recording was done with 3 bilingual speakers

WAV audio was recorded with a frequency of 16KHz,16 bits and a single channel

We only use the data from 1 bilingual speaker.

Copyright © Nara Institute of Science Technology All rights Reserved 28

Page 29: Japanese-English Code-switching Speech Data ….../20 Code-switching Code-switching (CS): A speaker switches languages within a conversation [Inter-sentential code-switching]: ああ、そうだってね。On

/20

Available Data Resources

29

Speech

synthesis

(TTS)

EngineHMM-based Speech Synthesis

System(HTS)

Text Japanese-English BTEC

Speaker One bilingual speaker

Training data 960 utterances

Sampling rate 16kHz

Frame size/shift 25ms / 5ms

Feature vector 40-dimensional MGC, log F0

Vocoder WORLD vocoder

Copyright © Nara Institute of Science Technology All rights Reserved

3. English and Japanese HMM-Based speech synthesis

Page 30: Japanese-English Code-switching Speech Data ….../20 Code-switching Code-switching (CS): A speaker switches languages within a conversation [Inter-sentential code-switching]: ああ、そうだってね。On

/20

SEAME(Mandarin-English) corpus

• South-East Asia Mandarin-English corpus

• 156 speakers(36.8% are Malaysian while the rest are Singaporean)

• 192 hours of wav audio records

• Speaking style: conversation and interview

• In total 110,145 segments

• 28,655(English)

• 24,438(Mandarin)

• 57,052(Code-switch)

Copyright © Nara Institute of Science Technology All rights Reserved 30

Page 31: Japanese-English Code-switching Speech Data ….../20 Code-switching Code-switching (CS): A speaker switches languages within a conversation [Inter-sentential code-switching]: ああ、そうだってね。On

/20

FAME!: Frisian radio broadcast database

• The regional public broadcaster of the province Frisian

• The total duration of the manually annotated radio broadcasts sums up to 18 hours extracted from radio programs almost 50 years

• Topic is about culture, history, literature, sports, nature, agriculture, politics, society and languages.

• 309 identified speakers and 233 unidentified speakers

• Total number of code-switching case is 3837

Copyright © Nara Institute of Science Technology All rights Reserved 31