VOICE CONVERSION BASED ON MATRIX VARIATE GAUSSIAN MIXTURE MODEL

被引:0
|
作者
Saito, Daisuke [1 ]
Doi, Hidenobu [1 ]
Minematsu, Nobuaki [1 ]
Hirose, Keikichi [1 ]
机构
[1] Univ Tokyo, Tokyo, Japan
关键词
Voice conversion; Gaussian mixture model; matrix variate distribution; matrix variate normal; matrix variate Gaussian mixture model; SPEECH RECOGNITION;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper describes a novel approach to construct a mapping function between a given speaker pair using probability density functions (PDF) of matrix variate. In voice conversion studies, two important functions should be realized: 1) precise modeling of both the source and target feature spaces, and 2) construction of a proper transform function between these spaces. Voice conversion based on Gaussian mixture model (GMM) is widely used because of their flexibility and easiness in handling. In GMM-based approaches, a joint vector space of the source and target is first constructed, and the joint PDF of the two vectors is modeled as GMM in the joint vector space. The joint vector approach mainly focuses on precise modeling of the 'joint' feature space, and does not always construct a proper transform between two feature spaces. In contrast, the proposed method constructs the joint PDF as GMM in a matrix variate space whose row and column respectively correspond to the two functions, and it has potential to precisely model both the characteristics of the feature spaces and the relation between the source and target spaces. Experimental results show that the proposed method contributes to improve the performance of voice conversion.
引用
收藏
页码:567 / 571
页数:5
相关论文
共 50 条
  • [41] Speaker Adaptive Training for One-to-Many Eigenvoice Conversion Based on Gaussian Mixture Model
    Ohtani, Yamato
    Toda, Tomoki
    Saruwatari, Hiroshi
    Shikano, Kiyohiro
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2496 - 2499
  • [42] A constrained matrix-variate Gaussian process for transposable data
    Oluwasanmi Koyejo
    Cheng Lee
    Joydeep Ghosh
    Machine Learning, 2014, 97 : 103 - 127
  • [43] A constrained matrix-variate Gaussian process for transposable data
    Koyejo, Oluwasanmi
    Lee, Cheng
    Ghosh, Joydeep
    MACHINE LEARNING, 2014, 97 (1-2) : 103 - 127
  • [44] A Comparative Study of Gaussian Mixture Model and Radial Basis Function for Voice Recognition
    Anifowose, Fatai Adesina
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2010, 1 (03) : 1 - 9
  • [45] Supervised Gaussian Process Latent Variable Model Based on Gaussian Mixture Model
    Zhang, Jiayuan
    Zhu, Ziqi
    Zou, Jixin
    2017 INTERNATIONAL CONFERENCE ON SECURITY, PATTERN ANALYSIS, AND CYBERNETICS (SPAC), 2017, : 124 - 129
  • [46] Background Subtraction Based on Gaussian Mixture Model
    Liu, Defang
    Deng, Ming
    Wang, Daimu
    MANUFACTURING PROCESS AND EQUIPMENT, PTS 1-4, 2013, 694-697 : 2021 - 2026
  • [47] Transfer Clustering Based on Gaussian Mixture Model
    Wang, Rongrong
    Zhou, Jin
    Liu, Xiangdao
    Han, Shiyuan
    Wang, Lin
    Chen, Yuehui
    2019 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2019), 2019, : 2522 - 2526
  • [48] Effect of the covariance matrix type on the CPT based soil stratification utilizing the Gaussian mixture model
    Thajeel, Jawad K.
    Adel, Raghad
    Ali, Haneen Muhammed
    Shakir, Ressol R.
    JOURNAL OF THE MECHANICAL BEHAVIOR OF MATERIALS, 2022, 31 (01) : 492 - 500
  • [49] Improved Gaussian mixture model and Gaussian mixture regression for learning from demonstration based on Gaussian noise scattering
    Feng, Chunhua
    Liu, Zhuang
    Li, Weidong
    Lu, Xin
    Jing, Yanguo
    Ma, Yongsheng
    ADVANCED ENGINEERING INFORMATICS, 2025, 65
  • [50] Multimode Dynamic Process Monitoring Based on Mixture Canonical Variate Analysis Model
    Wen, Qiaojun
    Ge, Zhiqiang
    Song, Zhihuan
    INDUSTRIAL & ENGINEERING CHEMISTRY RESEARCH, 2015, 54 (05) : 1605 - 1614