PITCH MODIFICATIONS OF SPEECH BASED ON AN ADAPTIVE HARMONIC MODEL

被引:0
|
作者
Kafentzis, George P. [1 ,2 ]
Degottex, Gilles [2 ]
Rosec, Olivier [3 ]
Stylianou, Yannis [2 ]
机构
[1] TECH ACTS MAS, Orange Labs, Lannion, France
[2] Univ Crete, Comp Sci Dept, Multimedia Informat Lab, Rethimnon, Greece
[3] Voxygen SA, Pole Phoenix, Pleumeur, France
关键词
Pitch modification; Speech analysis; Adaptive quasi-harmonic model; Adaptive harmonic model; TRANSFORMATION; DECOMPOSITION;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, a simple method for pitch-scale modifications of speech based on a recently suggested model for AM-FM decomposition of speech signals, is presented. This model is referred to as the adaptive Harmonic Model (aHM). The aHM models speech as a sum of harmonically related sinusoids that can adapt to the local characteristics of the signal. It was shown that this model provides high quality reconstruction of speech and thus, it can also provide high quality pitch-scale modifications. For the latter, the amplitude envelope is estimated using the Discrete All-Pole (DAP) method, and the phase envelope estimation is performed by utilizing the concept of relative phase. Formal listening tests on a database of several languages show that the synthetic pitch-scaled waveforms are natural and free of some common artefacts encountered in other state-of-the-art models, such as HNM and STRAIGHT.
引用
收藏
页数:5
相关论文
共 50 条
  • [41] The pitch of a mistuned harmonic: Evidence for a template model
    Lin, JY
    Hartmann, WM
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1998, 103 (05): : 2608 - 2617
  • [42] Signal reshaping using dominant harmonic for pitch estimation of noisy speech
    Hasan, MK
    Hussain, S
    Setu, MTH
    Nazrul, MNI
    SIGNAL PROCESSING, 2006, 86 (05) : 1010 - 1018
  • [43] Performance of the pitch-scaled harmonic filter and applications in speech analysis
    Jackson, PJB
    Shadle, CH
    2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1311 - 1314
  • [44] Pitch estimation using a modulation model of speech
    Gopalan, K
    2000 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I-III, 2000, : 786 - 791
  • [45] Exploiting temporal correlation in pitch-adaptive speech enhancement
    Stahl, Johannes
    Mowlaee, Pejman
    SPEECH COMMUNICATION, 2019, 111 : 1 - 13
  • [46] Emotional speech recognition based on modified parameter and distance of statistical model of pitch
    Department of Radio Engineering, Southeast University, Nanjing 210096, China
    Shengxue Xuebao, 2006, 1 (28-34):
  • [47] Adaptive pitch period decimation and its application in speech compression
    Logan, J
    Gowdy, J
    PROCEEDINGS OF THE IEEE SOUTHEASTCON '96: BRINGING TOGETHER EDUCATION, SCIENCE AND TECHNOLOGY, 1996, : 220 - 222
  • [48] Speech enhancement using a pitch predictive model
    Buera, Luis
    Droppo, Jasha
    Acero, Alex
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4885 - +
  • [49] Estimation of pitch of noisy speech using ar model based inverse filtering
    Ahmed, Kazi Jamir Uddin
    khan, Md. Rezwan
    ICECE 2006: PROCEEDINGS OF THE 4TH INTERNATIONAL CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, 2006, : 447 - +
  • [50] Time-scale and pitch modification for Chinese speech based on sinusoidal model
    Zhou, J.Y.
    Chai, P.Q.
    Tongji Daxue Xuebao/Journal of Tongji University, 2001, 29 (03): : 312 - 316