Duration Controllable Voice Conversion via Phoneme-Based Information Bottleneck

被引:0
|
作者
Lee, Sang-Hoon [1 ]
Noh, Hyeong-Rae [1 ]
Nam, Woo-Jeoung [2 ]
Lee, Seong-Whan [3 ]
机构
[1] Department of Brain and Cognitive Engineering, Korea University, Seoul,02841, Korea, Republic of
[2] Department of Computer and Radio Communications Engineering, Korea University, Seoul,02841, Korea, Republic of
[3] Department of Artificial Intelligence, Korea University, Seoul,02841, Korea, Republic of
关键词
D O I
暂无
中图分类号
学科分类号
摘要
引用
收藏
页码:1173 / 1183
相关论文
共 36 条
  • [31] Speaker Dependent Approach for Enhancing a Glossectomy Patient's Speech via GMM-based Voice Conversion
    Tanaka, Kei
    Hara, Sunao
    Abe, Masanobu
    Sato, Masaaki
    Minagi, Shogo
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 3384 - 3388
  • [32] Deep Learning-Based Forward-Aware Quantization for Satellite-Aided Communications via Information Bottleneck Method
    Hummert, Matthias
    Hassanpour, Shayan
    Wuebben, Dirk
    Dekorsy, Armin
    2024 JOINT EUROPEAN CONFERENCE ON NETWORKS AND COMMUNICATIONS & 6G SUMMIT, EUCNC/6G SUMMIT 2024, 2024, : 634 - 639
  • [33] VQMIVC: Vector Quantization and Mutual Information-Based Unsupervised Speech Representation Disentanglement for One-shot Voice Conversion
    Wang, Disong
    Deng, Liqun
    Yeung, Yu Ting
    Chen, Xiao
    Liu, Xunying
    Meng, Helen
    INTERSPEECH 2021, 2021, : 1344 - 1348
  • [34] LCM-SVC: Latent Diffusion Model Based Singing Voice Conversion with Inference Acceleration via Latent Consistency Distillation
    Chen, Shihao
    Gu, Yu
    Cui, Jianwei
    Zhang, Jie
    Chen, Rilin
    Dai, Lirong
    2024 IEEE 14TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, ISCSLP 2024, 2024, : 309 - 313
  • [35] Speech Chain VC: Linking Linguistic and Acoustic Levels via Latent Distinctive Features for RBM-Based Voice Conversion
    Kishida, Takuya
    Nakashika, Toru
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2020, E103D (11) : 2340 - 2350
  • [36] 12-Lead ECG signal classification for detecting ECG arrhythmia via an information bottleneck-based multi-scale network
    Zhang, Siyuan
    Lian, Cheng
    Xu, Bingrong
    Su, Yixin
    Alhudhaif, Adi
    INFORMATION SCIENCES, 2024, 662