ACCENT CONVERSION USING PHONETIC POSTERIORGRAMS

被引:0
|
作者
Zhao, Guanlong [1 ]
Sonsaat, Sinem [2 ]
Levis, John [2 ]
Chukharev-Hudilainen, Evgeny [2 ]
Gutierrez-Osuna, Ricardo [1 ]
机构
[1] Texas A&M Univ, Dept Comp Sci & Engn, College Stn, TX 77843 USA
[2] Iowa State Univ, Dept English, Ames, IA USA
基金
美国国家科学基金会;
关键词
speech synthesis; accent conversion; frame pairing; posteriorgram; acoustic model; VOICE CONVERSION; FOREIGN ACCENT; SPEECH;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Accent conversion (AC) aims to transform non-native speech to sound as if the speaker had a native accent. This can be achieved by mapping source spectra from a native speaker into the acoustic space of the non-native speaker. In prior work, we proposed an AC approach that matches frames between the two speakers based on their acoustic similarity after compensating for differences in vocal tract length. In this paper, we propose an approach that matches frames between the two speakers based on their phonetic (rather than acoustic) similarity. Namely, we map frames from the two speakers into a phonetic posteriorgram using speaker-independent acoustic models trained on native speech. We evaluate the proposed algorithm on a corpus containing multiple native and non-native speakers. Compared to the previous AC algorithm, the proposed algorithm improves the ratings of acoustic quality (20% increase in mean opinion score) and native accent (69% preference) while retaining the voice identity of the non-native speaker.
引用
收藏
页码:5314 / 5318
页数:5
相关论文
共 50 条
  • [21] Phrasal accent in phonetic, functional and semantic aspects
    Krivnova, Olga F.
    VOPROSY YAZYKOZNANIYA, 2019, (02): : 160 - 168
  • [22] PHONETIC ANALYSIS OF A CASE OF FOREIGN ACCENT SYNDROME
    INGRAM, JCL
    MCCORMACK, PF
    KENNEDY, M
    JOURNAL OF PHONETICS, 1992, 20 (04) : 457 - 474
  • [23] PHONETIC INTERPRETATION OF THE WORD ACCENT CONTRAST IN SWEDISH
    ENGSTRAND, O
    PHONETICA, 1995, 52 (03) : 171 - 179
  • [24] Phonological and phonetic marking of information status in Foreign Accent Syndrome
    Kuschmann, Anja
    Lowit, Anja
    INTERNATIONAL JOURNAL OF LANGUAGE & COMMUNICATION DISORDERS, 2012, 47 (06) : 738 - 749
  • [25] Accent Conversion with Articulatory Representations
    Siriwardena, Yashish M.
    Swedlow, Nathan
    Howard, Audrey
    Gitterman, Evan
    Darcy, Dan
    Espy-Wilson, Carol
    Fanelli, Andrea
    INTERSPEECH 2024, 2024, : 4383 - 4387
  • [26] ACCENT AND PHONETIC CHANGE IN THE ROMANCE LANGUAGES - GERMAN - GEISLER,H
    PFISTER, M
    ZEITSCHRIFT FUR ROMANISCHE PHILOLOGIE, 1995, 111 (01): : 81 - 84
  • [27] Improved Accent Classification Combining Phonetic Vowels with Acoustic Features
    Ge, Zhenhao
    2015 8th International Congress on Image and Signal Processing (CISP), 2015, : 1204 - 1209
  • [28] VOWEL CONVERSION BY PHONETIC SEGMENTATION
    de Obalda, Carlos
    DAFX-15: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON DIGITAL AUDIO EFFECTS, 2015, : 300 - 306
  • [29] Accent Conversion using Pre-trained Model and Synthesized Data from Voice Conversion
    Tuan Nam Nguyen
    Ngoc-Quan Pham
    Waibel, Alexander
    INTERSPEECH 2022, 2022, : 2583 - 2587
  • [30] END-TO-END ACCENT CONVERSION WITHOUT USING NATIVE UTTERANCES
    Liu, Songxiang
    Wang, Disong
    Cao, Yuewen
    Sun, Lifa
    Wu, Xixin
    Kang, Shiyin
    Wu, Zhiyong
    Liu, Xunying
    Su, Dan
    Yu, Dong
    Meng, Helen
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 6289 - 6293