Semi-supervised voice conversion with amortized variational inference

被引:2
|
作者
Stephenson, Cory [1 ]
Keskin, Gokce [1 ]
Thomas, Anil [1 ]
Elibol, Oguz H. [1 ]
机构
[1] Intel AI Lab, Santa Clara, CA 95054 USA
来源
关键词
voice conversion; semi-supervised learning; variational inference; deep learning;
D O I
10.21437/Interspeech.2019-1840
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
In this work we introduce a semi-supervised approach to the voice conversion problem, in which speech from a source speaker is converted into speech of a target speaker. The proposed method makes use of both parallel and non-parallel utterances from the source and target simultaneously during training. This approach can be used to extend existing parallel data voice conversion systems such that they can be trained with semi-supervision. We show that incorporating semi-supervision improves the voice conversion performance compared to fully supervised training when the number of parallel utterances is limited as in many practical applications. Additionally, we find that increasing the number non-parallel utterances used in training continues to improve performance when the amount of parallel training data is held constant.
引用
收藏
页码:729 / 733
页数:5
相关论文
共 50 条
  • [41] Variational quantum semi-supervised classifier based on label propagation
    侯艳艳
    李剑
    陈秀波
    叶崇强
    Chinese Physics B, 2023, 32 (07) : 326 - 336
  • [42] Gaussian Mixture Variational Autoencoder for Semi-Supervised Topic Modeling
    Zhou, Cangqi
    Ban, Hao
    Zhang, Jing
    Li, Qianmu
    Zhang, Yinghua
    IEEE ACCESS, 2020, 8 : 106843 - 106854
  • [43] Hypergraph Variational Autoencoder for Multimodal Semi-supervised Representation Learning
    Liu, Jingquan
    Du, Xiaoyong
    Li, Yuanzhe
    Hu, Weidong
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT IV, 2022, 13532 : 395 - 406
  • [44] Interpretable Operational Risk Classification with Semi-Supervised Variational Autoencoder
    Zhou, Fan
    Zhang, Shengming
    Yang, Yi
    58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 846 - 852
  • [45] Semi-supervised Variational Multi-view Anomaly Detection
    Wang, Shaoshen
    Chen, Ling
    Hussain, Farookh
    Zhang, Chengqi
    WEB AND BIG DATA, APWEB-WAIM 2021, PT I, 2021, 12858 : 125 - 133
  • [46] Semi-Supervised Variational Autoencoders for Out-of-Distribution Generation
    Lavda, Frantzeska
    Kalousis, Alexandros
    ENTROPY, 2023, 25 (12)
  • [47] Disentangled Variational Auto-Encoder for semi-supervised learning
    Li, Yang
    Pan, Quan
    Wang, Suhang
    Peng, Haiyun
    Yang, Tao
    Cambria, Erik
    INFORMATION SCIENCES, 2019, 482 : 73 - 85
  • [48] Graph Regularized Variational Ladder Networks for Semi-Supervised Learning
    Hu, Cong
    Song, Xiao-Ning
    IEEE ACCESS, 2020, 8 : 206280 - 206288
  • [49] Variational quantum semi-supervised classifier based on label propagation
    Hou, Yan-Yan
    Li, Jian
    Chen, Xiu-Bo
    Ye, Chong-Qiang
    CHINESE PHYSICS B, 2023, 32 (07)
  • [50] SEMI-SUPERVISED AND POPULATION BASED TRAINING FOR VOICE COMMANDS RECOGNITION
    Elibol, Oguz H.
    Keskin, Gokce
    Thomas, Anil
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6371 - 6375