Multimodal voice conversion based on non-negative matrix factorization

被引:0
|
作者
Kenta Masaka
Ryo Aihara
Tetsuya Takiguchi
Yasuo Ariki
机构
[1] Kobe University,Graduate School of System Informatics
[2] Kobe University,Organization of Advanced Science and Technology
关键词
Voice conversion; Multimodal; Image features; Non-negative matrix factorization; Noise robustness;
D O I
暂无
中图分类号
学科分类号
摘要
A multimodal voice conversion (VC) method for noisy environments is proposed. In our previous non-negative matrix factorization (NMF)-based VC method, source and target exemplars are extracted from parallel training data, in which the same texts are uttered by the source and target speakers. The input source signal is then decomposed into source exemplars, noise exemplars, and their weights. Then, the converted speech is constructed from the target exemplars and the weights related to the source exemplars. In this study, we propose multimodal VC that improves the noise robustness of our NMF-based VC method. Furthermore, we introduce the combination weight between audio and visual features and formulate a new cost function to estimate audio-visual exemplars. Using the joint audio-visual features as source features, VC performance is improved compared with that of a previous audio-input exemplar-based VC method. The effectiveness of the proposed method is confirmed by comparing its effectiveness with that of a conventional audio-input NMF-based method and a Gaussian mixture model-based method.
引用
收藏
相关论文
共 50 条
  • [31] Non-negative Matrix Factorization for EEG
    Jahan, Ibrahim Salem
    Snasel, Vaclav
    2013 INTERNATIONAL CONFERENCE ON TECHNOLOGICAL ADVANCES IN ELECTRICAL, ELECTRONICS AND COMPUTER ENGINEERING (TAEECE), 2013, : 183 - 187
  • [32] Algorithms for non-negative matrix factorization
    Lee, DD
    Seung, HS
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 13, 2001, 13 : 556 - 562
  • [33] Non-negative matrix factorization with α-divergence
    Cichocki, Andrzej
    Lee, Hyekyoung
    Kim, Yong-Deok
    Choi, Seungjin
    PATTERN RECOGNITION LETTERS, 2008, 29 (09) : 1433 - 1440
  • [34] Dropout non-negative matrix factorization
    He, Zhicheng
    Liu, Jie
    Liu, Caihua
    Wang, Yuan
    Yin, Airu
    Huang, Yalou
    KNOWLEDGE AND INFORMATION SYSTEMS, 2019, 60 (02) : 781 - 806
  • [35] Non-Negative Matrix Factorization with Constraints
    Liu, Haifeng
    Wu, Zhaohui
    PROCEEDINGS OF THE TWENTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-10), 2010, : 506 - 511
  • [36] Stretched non-negative matrix factorization
    Gu, Ran
    Rakita, Yevgeny
    Lan, Ling
    Thatcher, Zach
    Kamm, Gabrielle E.
    O'Nolan, Daniel
    Mcbride, Brennan
    Wustrow, Allison
    Neilson, James R.
    Chapman, Karena W.
    Du, Qiang
    Billinge, Simon J. L.
    NPJ COMPUTATIONAL MATERIALS, 2024, 10 (01)
  • [37] Uniqueness of non-negative matrix factorization
    Laurberg, Hans
    2007 IEEE/SP 14TH WORKSHOP ON STATISTICAL SIGNAL PROCESSING, VOLS 1 AND 2, 2007, : 44 - 48
  • [38] Non-negative Matrix Factorization on Manifold
    Cai, Deng
    He, Xiaofei
    Wu, Xiaoyun
    Han, Jiawei
    ICDM 2008: EIGHTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2008, : 63 - +
  • [39] Bayesian Non-negative Matrix Factorization
    Schmidt, Mikkel N.
    Winther, Ole
    Hansen, Lars Kai
    INDEPENDENT COMPONENT ANALYSIS AND SIGNAL SEPARATION, PROCEEDINGS, 2009, 5441 : 540 - +
  • [40] Non-negative Matrix Factorization on GPU
    Platos, Jan
    Gajdos, Petr
    Kroemer, Pavel
    Snasel, Vaclav
    NETWORKED DIGITAL TECHNOLOGIES, PT 1, 2010, 87 : 21 - 30