Mandarin-Tibetan Cross-Lingual Voice Conversion System Based on Deep Neural Network

被引:1
|
作者
Gan, Zhenye [1 ,2 ]
Xing, Xiaotian [1 ]
Yang, Hongwu [1 ,2 ]
Zhao, Guangying [1 ]
机构
[1] Northwest Normal Univ, Coll Phys & Elect Engn, Lanzhou 730000, Gansu, Peoples R China
[2] Engn Res Ctr Gansu Prov Intelligent Informat Tech, Lanzhou 730000, Gansu, Peoples R China
来源
PROCEEDINGS OF 2018 THE 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ARTIFICIAL INTELLIGENCE (CSAI 2018) / 2018 THE 10TH INTERNATIONAL CONFERENCE ON INFORMATION AND MULTIMEDIA TECHNOLOGY (ICIMT 2018) | 2018年
基金
中国国家自然科学基金;
关键词
Cross-lingual voice conversion; speech recognition; speech synthesis; DNN;
D O I
10.1145/3297156.3297221
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper realizes a Mandarin-Tibetan cross-lingual voice conversion system to solve the communication problem between the Mandarin speaker and the Tibetan speaker. Mandarin speech recognition and Tibetan speech synthesis techniques based on deep neural network(DNN) are adopted to convert Mandarin to Tibetan. In this way, we can effectively avoid the problem of building large parallel corpus and complex conversion rules. Meanwhile, we modify the converted Tibetan speech features so that it is perceived as a sentence uttered by the Mandarin speaker. The experimental results show that Mean Opinion Score (MOS) is 3.26 points and the degradation mean opinion score (DMOS) of the timbre similarity between the converted Tibetan speech and the Mandarin speech is 3.07 points.
引用
收藏
页码:67 / 71
页数:5
相关论文
共 50 条
  • [41] Continuous vocoder applied in deep neural network based voice conversion
    Al-Radhi, Mohammed Salah
    Csapo, Tamas Gabor
    Nemeth, Geza
    MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (23) : 33549 - 33572
  • [42] Correction: Enhancing Cross-lingual Biomedical Concept Normalization Using Deep Neural Network Pretrained Language Models
    Ying-Chi Lin
    Phillip Hoffmann
    Erhard Rahm
    SN Computer Science, 3 (6)
  • [43] Enhancing Cross-lingual Semantic Annotations using Deep Network Sentence Embeddings
    Lin, Ying-Chi
    Hoffmann, Phillip
    Rahm, Erhard
    HEALTHINF: PROCEEDINGS OF THE 14TH INTERNATIONAL JOINT CONFERENCE ON BIOMEDICAL ENGINEERING SYSTEMS AND TECHNOLOGIES - VOL. 5: HEALTHINF, 2021, : 188 - 199
  • [44] Cross-lingual Aspect-level Sentiment Classification with Graph Neural Network
    Bao X.-Y.
    Jiang X.-T.
    Wang Z.-Q.
    Zhou G.-D.
    Ruan Jian Xue Bao/Journal of Software, 2023, 34 (02): : 676 - 689
  • [45] Bi-Neighborhood Graph Neural Network for cross-lingual entity alignment
    Shi, Xinchen
    Li, Bin
    Chen, Ling
    Yang, Chao
    KNOWLEDGE-BASED SYSTEMS, 2023, 277
  • [46] Cross-lingual Knowledge Graph Alignment via Graph Matching Neural Network
    Xu, Kun
    Wang, Liwei
    Yu, Mo
    Feng, Yansong
    Song, Yan
    Wang, Zhiguo
    Yu, Dong
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 3156 - 3161
  • [47] Neural Cross-Lingual Relation Extraction Based on BilingualWord Embedding Mapping
    Ni, Jian
    Florian, Radu
    2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 399 - 409
  • [48] A Semantic Content Based Recommendation System for Cross-Lingual News
    Ferdous, Syeda Nyma
    Ali, Muhammad Masroor
    2017 IEEE INTERNATIONAL CONFERENCE ON IMAGING, VISION & PATTERN RECOGNITION (ICIVPR), 2017,
  • [49] Text-To-Speech with cross-lingual Neural Network-based grapheme-to-phoneme models
    Gonzalvo, Xavi
    Podsiadlo, Monika
    15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 765 - 769
  • [50] DISENTANGLED SPEECH REPRESENTATION LEARNING FOR ONE-SHOT CROSS-LINGUAL VOICE CONVERSION USING β-VAE
    Lu, Hui
    Wang, Disong
    Wu, Xixin
    Wu, Zhiyong
    Liu, Xunying
    Meng, Helen
    2022 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, SLT, 2022, : 814 - 821