Semi-supervised voice conversion with amortized variational inference

被引:2
|
作者
Stephenson, Cory [1 ]
Keskin, Gokce [1 ]
Thomas, Anil [1 ]
Elibol, Oguz H. [1 ]
机构
[1] Intel AI Lab, Santa Clara, CA 95054 USA
来源
关键词
voice conversion; semi-supervised learning; variational inference; deep learning;
D O I
10.21437/Interspeech.2019-1840
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
In this work we introduce a semi-supervised approach to the voice conversion problem, in which speech from a source speaker is converted into speech of a target speaker. The proposed method makes use of both parallel and non-parallel utterances from the source and target simultaneously during training. This approach can be used to extend existing parallel data voice conversion systems such that they can be trained with semi-supervision. We show that incorporating semi-supervision improves the voice conversion performance compared to fully supervised training when the number of parallel utterances is limited as in many practical applications. Additionally, we find that increasing the number non-parallel utterances used in training continues to improve performance when the amount of parallel training data is held constant.
引用
收藏
页码:729 / 733
页数:5
相关论文
共 50 条
  • [1] Emotional Voice Conversion with Semi-Supervised Generative Modeling
    Zhu, Hai
    Zhan, Huayi
    Cheng, Hong
    Wu, Ying
    INTERSPEECH 2023, 2023, : 2278 - 2282
  • [2] Semi-Supervised Learning with Variational Bayesian Inference and Maximum Uncertainty Regularization
    Do, Kien
    Truyen Tran
    Venkatesh, Svetha
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 7236 - 7244
  • [3] Semi-supervised Stochastic Multi-Domain Learning using Variational Inference
    Li, Yitong
    Baldwin, Timothy
    Cohn, Trevor
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 1923 - 1934
  • [4] Semi-Supervised Adversarial Variational Autoencoder
    Zemouri, Ryad
    MACHINE LEARNING AND KNOWLEDGE EXTRACTION, 2020, 2 (03): : 361 - 378
  • [5] Semi-supervised Parsing with Variational Autoencoding Parser
    Zhang, Xiao
    Goldwasser, Dan
    16TH INTERNATIONAL CONFERENCE ON PARSING TECHNOLOGIES AND IWPT 2020 SHARED TASK ON PARSING INTO ENHANCED UNIVERSAL DEPENDENCIES, 2020, : 40 - 47
  • [6] Variational Autoencoder for Semi-Supervised Text Classification
    Xu, Weidi
    Sun, Haoze
    Deng, Chao
    Tan, Ying
    THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 3358 - 3364
  • [7] Variational Sequential Labelers for Semi-Supervised Learning
    Chen, Mingda
    Tang, Qingming
    Livescu, Karen
    Gimpel, Kevin
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 215 - 226
  • [8] Variational Information Bottleneck for Semi-Supervised Classification
    Voloshynovskiy, Slava
    Taran, Olga
    Kondah, Mouad
    Holotyak, Taras
    Rezende, Danilo
    ENTROPY, 2020, 22 (09)
  • [9] Unified Robust Semi-Supervised Variational Autoencoder
    Chen, Xu
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [10] Infinite Variational Autoencoder for Semi-Supervised Learning
    Abbasnejad, M. Ehsan
    Dick, Anthony
    van den Hengel, Anton
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 781 - 790