Voice Conversion in High-order Eigen Space Using Deep Belief Nets

被引:0
|
作者
Nakashika, Toru [1 ]
Takashima, Ryoichi [1 ]
Takiguchi, Tetsuya [2 ]
Ariki, Yasuo [2 ]
机构
[1] Kobe Univ, Grad Sch Syst Informat, 1-1 Rokkodai, Kobe, Hyogo, Japan
[2] Kobe Univ, Org Adv Sci & Technol, Kobe, Hyogo, Japan
关键词
voice conversion; deep learning; deep belief nets; SPEECH RECOGNITION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a voice conversion technique using Deep Belief Nets (DBNs) to build high-order eigen spaces of the source/target speakers, where it is easier to convert the source speech to the target speech than in the traditional cepstrum space. DBNs have a deep architecture that automatically discovers abstractions to maximally express the original input features. If we train the DBNs using only the speech of an individual speaker, it can be considered that there is less phonological information and relatively more speaker individuality in the output features at the highest layer. Training the DBNs for a source speaker and a target speaker, we can then connect and convert the speaker individuality abstractions using Neural Networks (NNs). The converted abstraction of the source speaker is then brought back to the cepstrum space using an inverse process of the DBNs of the target speaker. We conducted speaker voice conversion experiments and confirmed the efficacy of our method with respect to subjective and objective criteria, comparing it with the conventional Gaussian Mixture Model -based method.
引用
收藏
页码:369 / 372
页数:4
相关论文
共 50 条
  • [21] A Characterization of the Parameter Space for High-order Epistasis
    Yang, W.
    Gu, C. C.
    GENETIC EPIDEMIOLOGY, 2008, 32 (07) : 722 - 722
  • [22] AUTOMATIC DISCOVERY OF DESIGN TASK STRUCTURE USING DEEP BELIEF NETS
    Lan, Lijun
    Liu, Ying
    Lu, Wen Feng
    Alghamdi, Awn
    INTERNATIONAL DESIGN ENGINEERING TECHNICAL CONFERENCES AND COMPUTERS AND INFORMATION IN ENGINEERING CONFERENCE, 2015, VOL 7, 2016,
  • [23] Automatic Discovery of Design Task Structure Using Deep Belief Nets
    Lan, Lijun
    Liu, Ying
    Lu, Wen Feng
    JOURNAL OF COMPUTING AND INFORMATION SCIENCE IN ENGINEERING, 2017, 17 (04)
  • [24] High-Order Conversion from Boolean to Arithmetic Masking
    Coron, Jean-Sebastien
    CRYPTOGRAPHIC HARDWARE AND EMBEDDED SYSTEMS - CHES 2017, 2017, 10529 : 93 - 114
  • [25] Feature Space Analysis of Modulation Classification Using Very High-Order Statistics
    Su, Wei
    IEEE COMMUNICATIONS LETTERS, 2013, 17 (09) : 1688 - 1691
  • [26] Transient analysis of a synchronous generator using a high-order state space representation
    Campero-Littlewood, E.
    Espinosa-Perez, G.
    Escarela-Perez, R.
    CERMA2006: ELECTRONICS, ROBOTICS AND AUTOMOTIVE MECHANICS CONFERENCE VOL 2, PROCEEDINGS, 2006, : 258 - +
  • [27] Voice conversion using partitions of spectral feature space
    Verhelst, W
    Mertens, J
    1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 365 - 368
  • [28] High-order mode based dispersion compensating modules using spatial mode conversion
    School of Electrical Engineering, Tel-Aviv University, Tel-Aviv, 69978, Israel
    不详
    不详
    J. Opt. Fiber Commun. Rep., 2007, 2 (110-172): : 110 - 172
  • [29] Evanescent-to-Propagating Wave Conversion Using Continuous High-Order Dielectric Metasurfaces
    Chelaresi, Hamid Akbari
    Salami, Pooria
    Yousefi, Leila
    2021 29TH IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2021, : 831 - 835
  • [30] A Deep Learning Method for Pathological Voice Detection using Convolutional Deep Belief Network
    Wu, Huiyi
    Soraghan, John
    Lowit, Anja
    Di Caterina, Gaetano
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 446 - 450