PITCH MODIFICATIONS OF SPEECH BASED ON AN ADAPTIVE HARMONIC MODEL

被引:0
|
作者
Kafentzis, George P. [1 ,2 ]
Degottex, Gilles [2 ]
Rosec, Olivier [3 ]
Stylianou, Yannis [2 ]
机构
[1] TECH ACTS MAS, Orange Labs, Lannion, France
[2] Univ Crete, Comp Sci Dept, Multimedia Informat Lab, Rethimnon, Greece
[3] Voxygen SA, Pole Phoenix, Pleumeur, France
关键词
Pitch modification; Speech analysis; Adaptive quasi-harmonic model; Adaptive harmonic model; TRANSFORMATION; DECOMPOSITION;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, a simple method for pitch-scale modifications of speech based on a recently suggested model for AM-FM decomposition of speech signals, is presented. This model is referred to as the adaptive Harmonic Model (aHM). The aHM models speech as a sum of harmonically related sinusoids that can adapt to the local characteristics of the signal. It was shown that this model provides high quality reconstruction of speech and thus, it can also provide high quality pitch-scale modifications. For the latter, the amplitude envelope is estimated using the Discrete All-Pole (DAP) method, and the phase envelope estimation is performed by utilizing the concept of relative phase. Formal listening tests on a database of several languages show that the synthetic pitch-scaled waveforms are natural and free of some common artefacts encountered in other state-of-the-art models, such as HNM and STRAIGHT.
引用
收藏
页数:5
相关论文
共 50 条
  • [21] A statistics-based pitch contour model for Mandarin speech
    Chen, SH
    Lai, WH
    Wang, YR
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2005, 117 (02): : 908 - 925
  • [22] Adaptive pitch-based speech detection for hands-free applications
    Abu-El-Quran, AR
    Goubran, RA
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 305 - 308
  • [23] Unified model for voice conversion of speech and singing voice using adaptive pitch constraints
    Fukawa, Shogo
    Nose, Takashi
    Imai, Shuhei
    Ito, Akinori
    ACOUSTICAL SCIENCE AND TECHNOLOGY, 2025, 46 (01) : 120 - 123
  • [24] Speech enhancement by harmonic modeling via map pitch tracking
    Tabrikian, J
    Dubnov, S
    Dickalov, Y
    2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 549 - 552
  • [25] Determination of pitch of noisy speech using dominant harmonic frequency
    Hasan, MK
    Shahnaz, C
    Fatath, A
    PROCEEDINGS OF THE 2003 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOL II: COMMUNICATIONS-MULTIMEDIA SYSTEMS & APPLICATIONS, 2003, : 556 - 559
  • [26] Improving the harmonic structure of speech spectrum for robust pitch estimation
    Chowdhury, Husne Ara
    Rahman, Mohammad Shahidur
    ACOUSTICAL SCIENCE AND TECHNOLOGY, 2025, 46 (01) : 34 - 37
  • [27] Adaptive Harmonic Spectral Decomposition for Multiple Pitch Estimation
    Vincent, Emmanuel
    Bertin, Nancy
    Badeau, Roland
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (03): : 528 - 537
  • [28] Adaptive model-based speech enhancement
    Logan, B
    Robinson, T
    SPEECH COMMUNICATION, 2001, 34 (04) : 351 - 368
  • [29] Adaptive speech processing based on reference model
    Chen, Zhangwei
    Zhao, Yugang
    2003, Chinese Vibration Engineering Society (22):
  • [30] Analysis and Synthesis of Speech Using an Adaptive Full-Band Harmonic Model
    Degottex, Gilles
    Stylianou, Yannis
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (10): : 2085 - 2095