CYBORG SPEECH: DEEP MULTILINGUAL SPEECH SYNTHESIS FOR GENERATING SEGMENTAL FOREIGN ACCENT WITH NATURAL PROSODY

被引：0

作者：

Henter, Gustav Eje ^{[1
]}

Lorenzo-Trueba, Jaime ^{[1
]}

Wang, Xin ^{[1
]}

Kondo, Mariko ^{[2
]}

Yamagishi, Junichi ^{[1
,3
]}

机构：

[1] Natl Inst Informat, Tokyo, Japan

[2] Waseda Univ, Tokyo, Japan

[3] Univ Edinburgh, Edinburgh, Midlothian, Scotland

来源：

2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2018年

关键词：

Multilingual speech synthesis; phonetic manipulation; foreign accent; DNN; RECURRENT NEURAL-NETWORK; ENGLISH; INTELLIGIBILITY;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

We describe a new application of deep-learning-based speech synthesis, namely multilingual speech synthesis for generating controllable foreign accent. Specifically, we train a DBLSTM-based acoustic model on non-accented multilingual speech recordings from a speaker native in several languages. By copying durations and pitch contours from a pre-recorded utterance of the desired prompt, natural prosody is achieved. We call this paradigm "cyborg speech" as it combines human and machine speech parameters. Segmentally accented speech is produced by interpolating specific quin-phone linguistic features towards phones from the other language that represent non-native mispronunciations. Experiments on synthetic American-English-accented Japanese speech show that subjective synthesis quality matches monolingual synthesis, that natural pitch is maintained, and that naturalistic phone substitutions generate output that is perceived as having an American foreign accent, even though only non-accented training data was used.

引用

页码：4799 / 4803

页数：5

共 50 条

[21] Foreign Accent: The Phenomenon of Non-native Speech
Uhrig, Peter
ZEITSCHRIFT FUR ANGLISTIK UND AMERIKANISTIK, 2013, 61 (03): : 311 - 313
[22] Foreign accent: the phenomenon of non-native speech
Davies, Alan
JOURNAL OF MULTILINGUAL AND MULTICULTURAL DEVELOPMENT, 2015, 36 (05) : 545 - 548
[23] Foreign Accent: The Phenomenon of Non-native Speech
Hayes-Harb, Rachel
JOURNAL OF SOCIOLINGUISTICS, 2014, 18 (03) : 414 - 418
[24] Foreign Accent: The Phenomenon of Non-native Speech
Bergeron, Annie
CANADIAN MODERN LANGUAGE REVIEW-REVUE CANADIENNE DES LANGUES VIVANTES, 2014, 70 (04): : 588 - 590
[25] The role of foreign accent and short-term exposure in speech-in-speech recognition
Brouwer, Susanne
ATTENTION PERCEPTION & PSYCHOPHYSICS, 2019, 81 (06) : 2053 - 2062
[26] Foreign accent: Implications for delivery of speech and language services
Langdon, HW
TOPICS IN LANGUAGE DISORDERS, 1999, 19 (04) : 49 - 65
[27] Foreign accent syndrome as a developmental motor speech disorder
Marien, Peter
Verhoeven, Jo
Wackenier, Peggy
Engelborghs, Sebastiaan
De Deyn, Peter P.
CORTEX, 2009, 45 (07) : 870 - 878
[28] Synthesis of emotional speech by prosody modification of vowel segments of neutral speech
Fahad M.S.
Singh S.
Gupta S.
Deepak A.
Abhinav
Recent Advances in Computer Science and Communications, 2021, 14 (04) : 1226 - 1235
[29] Relative Salience of Speech Rhythm and Speech Rate on Perceived Foreign Accent in a Second Language
Polyanskaya, Leona
Ordin, Mikhail
Busa, Maria Grazia
LANGUAGE AND SPEECH, 2017, 60 (03) : 333 - 355
[30] The role of foreign accent and short-term exposure in speech-in-speech recognition
Susanne Brouwer
Attention, Perception, & Psychophysics, 2019, 81 : 2053 - 2062

← 1 2 3 4 5 →