Voice Conversion in High-order Eigen Space Using Deep Belief Nets

被引:0
|
作者
Nakashika, Toru [1 ]
Takashima, Ryoichi [1 ]
Takiguchi, Tetsuya [2 ]
Ariki, Yasuo [2 ]
机构
[1] Kobe Univ, Grad Sch Syst Informat, 1-1 Rokkodai, Kobe, Hyogo, Japan
[2] Kobe Univ, Org Adv Sci & Technol, Kobe, Hyogo, Japan
关键词
voice conversion; deep learning; deep belief nets; SPEECH RECOGNITION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a voice conversion technique using Deep Belief Nets (DBNs) to build high-order eigen spaces of the source/target speakers, where it is easier to convert the source speech to the target speech than in the traditional cepstrum space. DBNs have a deep architecture that automatically discovers abstractions to maximally express the original input features. If we train the DBNs using only the speech of an individual speaker, it can be considered that there is less phonological information and relatively more speaker individuality in the output features at the highest layer. Training the DBNs for a source speaker and a target speaker, we can then connect and convert the speaker individuality abstractions using Neural Networks (NNs). The converted abstraction of the source speaker is then brought back to the cepstrum space using an inverse process of the DBNs of the target speaker. We conducted speaker voice conversion experiments and confirmed the efficacy of our method with respect to subjective and objective criteria, comparing it with the conventional Gaussian Mixture Model -based method.
引用
收藏
页码:369 / 372
页数:4
相关论文
共 50 条
  • [31] High-order disclinations in space-variant polarization
    Khajavi, B.
    Galvez, E. J.
    JOURNAL OF OPTICS, 2016, 18 (08)
  • [32] Evaluating High-Order Predictive Distributions in Deep Learning
    Osband, Ian
    Wen, Zheng
    Asghari, Seyed Mohammad
    Dwaracherla, Vikranth
    Lu, Xiuyuan
    Van Roy, Benjamin
    UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, VOL 180, 2022, 180 : 1552 - 1560
  • [33] Deep High-order Supervised Hashing for Image Retrieval
    Cheng, Jingdong
    Sun, Qiule
    Zhang, Jianxin
    Wei, Xiaopeng
    Zhang, Qiang
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 2693 - 2698
  • [34] Improving Deep Forest by Exploiting High-order Interactions
    Chen, Yi-He
    Lyu, Shen-Huan
    Jiang, Yuan
    2021 21ST IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2021), 2021, : 1030 - 1035
  • [35] Learning Sparse Feature Representations Using Probabilistic Quadtrees and Deep Belief Nets
    Basu, Saikat
    Karki, Manohar
    Ganguly, Sangram
    DiBiano, Robert
    Mukhopadhyay, Supratik
    Gayaka, Shreekant
    Kannan, Rajgopal
    Nemani, Ramakrishna
    NEURAL PROCESSING LETTERS, 2017, 45 (03) : 855 - 867
  • [36] Learning Sparse Feature Representations Using Probabilistic Quadtrees and Deep Belief Nets
    Saikat Basu
    Manohar Karki
    Sangram Ganguly
    Robert DiBiano
    Supratik Mukhopadhyay
    Shreekant Gayaka
    Rajgopal Kannan
    Ramakrishna Nemani
    Neural Processing Letters, 2017, 45 : 855 - 867
  • [37] High-Order Mode Conversion Employing a Fiber Bundle Scheme
    Chen, Liang
    Guo, Huiyi
    Shi, Zekun
    Hao, Yundong
    Wang, Zhi
    Liu, Yan-ge
    JOURNAL OF LIGHTWAVE TECHNOLOGY, 2024, 42 (23) : 8399 - 8404
  • [38] Seeded and unseeded high-order parametric down-conversion
    Okoth, Cameron
    Cavanna, Andrea
    Joly, Nicolas Y.
    Chekhova, Maria, V
    PHYSICAL REVIEW A, 2019, 99 (04)
  • [39] THz conversion efficiency in DSTMS limited by high-order nonlinearities
    Li, J.
    Rana, R.
    Zhu, L. G.
    Liu, C. L.
    Schneider, H.
    Pashkin, A.
    2021 46TH INTERNATIONAL CONFERENCE ON INFRARED, MILLIMETER AND TERAHERTZ WAVES (IRMMW-THZ), 2021,
  • [40] Joint High-Order Simulation of Spatially Correlated Variables Using High-Order Spatial Statistics
    Ilnur Minniakhmetov
    Roussos Dimitrakopoulos
    Mathematical Geosciences, 2017, 49 : 39 - 66