Wavelet Analysis of Speaker Dependent and Independent Prosody for Voice Conversion

被引:0
|
作者
Sisman, Berrak [1 ]
Li, Haizhou [1 ]
机构
[1] Natl Univ Singapore, Singapore, Singapore
关键词
Wavelet transform; prosody analysis; voice conversion;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Thus far, voice conversion studies are mainly focused on the conversion of spectrum. However, speaker identity is also characterized by its prosody features, such as fundamental frequency (F0) and energy contour. We believe that with a better understanding of speaker dependent/independent prosody features, we can devise an analytic approach that addresses voice conversion in a better way. We consider that speaker dependent features reflect speaker's individuality, while speaker independent features reflect the expression of linguistic content. Therefore, the former is to be converted while the latter is to be carried over from source to target during the conversion. To achieve this, we provide an analysis of speaker dependent and speaker independent prosody patterns in different temporal scales by using wavelet transform. The centrepiece of this paper is based on the understanding that a speech utterance can be characterized by speaker dependent and independent features in its prosodic manifestations. Experiments show that the proposed prosody analysis scheme improves the prosody conversion performance consistently under the sparse representation framework.
引用
收藏
页码:52 / 56
页数:5
相关论文
共 50 条
  • [41] Detection of Voice Disorders based on Wavelet and Prosody-related Properties
    Shahnaz, C.
    Fattah, S. A.
    Mahbub, U.
    Zhu, W. -P.
    Ahmad, M. O.
    2012 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS 2012), 2012, : 1030 - 1033
  • [42] Target speaker filtration by mask estimation for source speaker traceability in voice conversion
    Zhang, Junfei
    Zhang, Xiongwei
    Sun, Meng
    Zou, Xia
    Jia, Chong
    Li, Yihao
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 136
  • [43] Speaker Independent Sinhala Speech Recognition for Voice Dialling
    Amarasingh, W. G. T. N.
    Gamini, D. D. A.
    INTERNATIONAL CONFERENCE ON ADVANCES IN ICT FOR EMERGING REGIONS (ICTER2012), 2012, : 3 - 6
  • [44] Speaker Dependent Voice Controlled Robotic Arm
    Akcmar, Doga
    Ariturk, Mustafa Kemal
    Yildirim, Tulay
    2018 INNOVATIONS IN INTELLIGENT SYSTEMS AND APPLICATIONS (INISTA), 2018,
  • [45] Speaker independent voice recognition with a fuzzy neural network
    Nava, PA
    Taylor, JM
    FUZZ-IEEE '96 - PROCEEDINGS OF THE FIFTH IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-3, 1996, : 2049 - 2052
  • [46] Voice conversion and spoofing attack on speaker verification systems
    Wu, Zhizheng
    Li, Haizhou
    2013 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2013,
  • [47] Automatic speaker recognition as a measurement of voice imitation and conversion
    Farrus, Mireia
    Wagner, Michael
    Erro, Daniel
    Hernando, Javier
    INTERNATIONAL JOURNAL OF SPEECH LANGUAGE AND THE LAW, 2010, 17 (01) : 119 - 142
  • [48] Voice text-independent system for speaker identification
    Babenko, LK
    Makarevich, OB
    Fedorov, VM
    Yurkov, PY
    IZVESTIYA VYSSHIKH UCHEBNYKH ZAVEDENII RADIOELEKTRONIKA, 2004, 47 (3-4): : 66 - 70
  • [49] PHONEME INDEPENDENT HMM VOICE CONVERSION
    Percybrooks, Winston
    Moore, Elliot
    McMillan, Correy
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 6925 - 6929
  • [50] Large Vocabulary Speech Recognition: Speaker Dependent and Speaker Independent
    Hemakumar, G.
    Punitha, P.
    INFORMATION SYSTEMS DESIGN AND INTELLIGENT APPLICATIONS, VOL 1, 2015, 339 : 73 - 80