Semi-supervised voice conversion with amortized variational inference

被引:2
|
作者
Stephenson, Cory [1 ]
Keskin, Gokce [1 ]
Thomas, Anil [1 ]
Elibol, Oguz H. [1 ]
机构
[1] Intel AI Lab, Santa Clara, CA 95054 USA
来源
关键词
voice conversion; semi-supervised learning; variational inference; deep learning;
D O I
10.21437/Interspeech.2019-1840
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
In this work we introduce a semi-supervised approach to the voice conversion problem, in which speech from a source speaker is converted into speech of a target speaker. The proposed method makes use of both parallel and non-parallel utterances from the source and target simultaneously during training. This approach can be used to extend existing parallel data voice conversion systems such that they can be trained with semi-supervision. We show that incorporating semi-supervision improves the voice conversion performance compared to fully supervised training when the number of parallel utterances is limited as in many practical applications. Additionally, we find that increasing the number non-parallel utterances used in training continues to improve performance when the amount of parallel training data is held constant.
引用
收藏
页码:729 / 733
页数:5
相关论文
共 50 条
  • [21] Amortized Variational Inference: A Systematic Review
    Ganguly, Ankush
    Jain, Sanjana
    Watchareeruetai, Ukrit
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2023, 78 : 167 - 215
  • [22] Semi-supervised dimensional sentiment analysis with variational autoencoder
    Wu, Chuhan
    Wu, Fangzhao
    Wu, Sixing
    Yuan, Zhigang
    Liu, Junxin
    Huang, Yongfeng
    KNOWLEDGE-BASED SYSTEMS, 2019, 165 : 30 - 39
  • [23] Semi-supervised Variational Autoencoder for WiFi Indoor Localization
    Chidlovskii, Boris
    Antsfeld, Leonid
    2019 INTERNATIONAL CONFERENCE ON INDOOR POSITIONING AND INDOOR NAVIGATION (IPIN), 2019,
  • [24] Adversarial Variational Embedding for Robust Semi-supervised Learning
    Zhang, Xiang
    Yao, Lina
    Yuan, Feng
    KDD'19: PROCEEDINGS OF THE 25TH ACM SIGKDD INTERNATIONAL CONFERENCCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2019, : 139 - 147
  • [25] Semi-Supervised Channel Equalization Using Variational Autoencoders
    Burshtein, David
    Bery, Eli
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2024, 23 (12) : 19681 - 19695
  • [26] Semi-Supervised Variational Reasoning for Medical Dialogue Generation
    Li, Dongdong
    Ren, Zhaochun
    Ren, Pengjie
    Chen, Zhumin
    Fan, Miao
    Ma, Jun
    de Rijke, Maarten
    SIGIR '21 - PROCEEDINGS OF THE 44TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2021, : 544 - 554
  • [27] Customization of latent space in semi-supervised Variational AutoEncoder
    An, Seunghwan
    Jeon, Jong-June
    PATTERN RECOGNITION LETTERS, 2024, 177 : 54 - 60
  • [28] Truncated Variational EM for Semi-Supervised Neural Simpletrons
    Forster, Dennis
    Luecke, Joerg
    2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 3769 - 3776
  • [29] ViVA: Semi-supervised Visualization via Variational Autoencoders
    An, Sungtae
    Hong, Shenda
    Sun, Jimeng
    20TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2020), 2020, : 22 - 31
  • [30] Gene Regulatory Network Inference: A Semi-supervised Approach
    Augustine, Jisha
    Jereesh, A. S.
    2017 INTERNATIONAL CONFERENCE OF ELECTRONICS, COMMUNICATION AND AEROSPACE TECHNOLOGY (ICECA), VOL 1, 2017, : 68 - 72