ACCENT CONVERSION USING PHONETIC POSTERIORGRAMS

被引:0
|
作者
Zhao, Guanlong [1 ]
Sonsaat, Sinem [2 ]
Levis, John [2 ]
Chukharev-Hudilainen, Evgeny [2 ]
Gutierrez-Osuna, Ricardo [1 ]
机构
[1] Texas A&M Univ, Dept Comp Sci & Engn, College Stn, TX 77843 USA
[2] Iowa State Univ, Dept English, Ames, IA USA
基金
美国国家科学基金会;
关键词
speech synthesis; accent conversion; frame pairing; posteriorgram; acoustic model; VOICE CONVERSION; FOREIGN ACCENT; SPEECH;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Accent conversion (AC) aims to transform non-native speech to sound as if the speaker had a native accent. This can be achieved by mapping source spectra from a native speaker into the acoustic space of the non-native speaker. In prior work, we proposed an AC approach that matches frames between the two speakers based on their acoustic similarity after compensating for differences in vocal tract length. In this paper, we propose an approach that matches frames between the two speakers based on their phonetic (rather than acoustic) similarity. Namely, we map frames from the two speakers into a phonetic posteriorgram using speaker-independent acoustic models trained on native speech. We evaluate the proposed algorithm on a corpus containing multiple native and non-native speakers. Compared to the previous AC algorithm, the proposed algorithm improves the ratings of acoustic quality (20% increase in mean opinion score) and native accent (69% preference) while retaining the voice identity of the non-native speaker.
引用
收藏
页码:5314 / 5318
页数:5
相关论文
共 50 条
  • [41] Prominence perception and accent detection in French: from phonetic processing to grammatical analysis
    Lacheret, Anne
    Simon, Anne Catherine
    Goldman, Jean-Philippe
    Avanzi, Mathieu
    LANGUAGE SCIENCES, 2013, 39 : 95 - 106
  • [42] Phonetic interpretation of the word accent contrast in Swedish: Evidence from spontaneous speech
    Engstrand, O
    PHONETICA, 1997, 54 (02) : 61 - 75
  • [43] PHONETIC REALIZATION OF ACCENT FROM CHINESE ENGLISH LEARNERS IN VARIOUS DIALECTAL REGIONS
    Jia, Yuan
    Li, Aijun
    2012 8TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, 2012, : 296 - 300
  • [44] Phonetic and linguistic aspects of accent imitation in a forensic Context. Production and Perception
    Schmid, Stephan
    ZEITSCHRIFT FUR FRANZOSISCHE SPRACHE UND LITERATUR, 2015, 125 (03): : 303 - 309
  • [45] A foreign speech accent in a case of conversion disorder
    Verhoeven, J
    Mariën, P
    Engelborghs, S
    D'Haenen, H
    De Deyn, P
    BEHAVIOURAL NEUROLOGY, 2005, 16 (04) : 225 - 232
  • [46] Foreign Accent Conversion through Voice Morphing
    Aryal, Sandesh
    Felps, Daniel
    Gutierrez-Osuna, Ricardo
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3076 - 3080
  • [47] Automatic Phonetic Conversion with Phonological Knowledge for Galician
    Garcia, Marcos
    Gonzalez Lopez, Isaac
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2011, (47): : 283 - 291
  • [48] Sequence-to-Sequence Acoustic-to-Phonetic Conversion Using Spectrograms and Deep Learning
    Qamhan, Mustafa A.
    Alotaibi, Yousef Ajami
    Seddiq, Yasser Mohammad
    Meftah, Ali Hamid
    Selouani, Sid Ahmed
    IEEE Access, 2021, 9 : 80209 - 80220
  • [49] Phonetic Speech Analysis for Speech to Text Conversion
    Bapat, Abhijit V.
    Nagalkar, Lalit K.
    IEEE REGION 10 COLLOQUIUM AND THIRD INTERNATIONAL CONFERENCE ON INDUSTRIAL AND INFORMATION SYSTEMS, VOLS 1 AND 2, 2008, : 320 - 323
  • [50] Query Word Retrieval From Continuous Speech Using GMM Posteriorgrams
    Reddy, Pappagari Raghavendra
    Rout, Kallola
    Murty, K. Sri Rama
    2014 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS (SPCOM), 2014,