Estimation of the instantaneous pitch of speech

被引:30
|
作者
Resch, Barbara [1 ]
Nilsson, Mattias [1 ]
Ekman, Anders [1 ]
Kleijn, W. Bastiaan [1 ]
机构
[1] Royal Inst Technol, Sound & Image Proc Lab, KTH, S-10044 Stockholm, Sweden
来源
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2007年 / 15卷 / 03期
关键词
instantaneous pitch; pitch estimation; pitch-synchronous processing; splines;
D O I
10.1109/TASL.2006.885242
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
An accurate estimation of the pitch is essential for many speech processing applications, such as speech synthesis, speech coding, and speech enhancement. A widely used assumption in most common pitch estimation methods is that pitch is constant over a segment of short duration. This assumption does not apply in reality and leads to inaccurate pitch estimates. In this paper, we present a method for continuous pitch estimation that is able to track fast changes. In the presented framework, the pitch is modeled by a B-spline expansion and optimized in a multistage procedure for increased robustness. The performance of the continuous optimization procedure is compared to state-of-the-art pitch estimation methods and is evaluated both for artificial speech-like signals with known pitch, and for real speech signals. The results of the experiments show that our method leads to a higher accuracy of the estimate of the pitch than state-of-the-art methods.
引用
收藏
页码:813 / 822
页数:10
相关论文
共 50 条
  • [21] A Novel Pitch Detection Algorithm Based on Instantaneous Frequency for Clean and Noisy Speech
    Mnasri, Zied
    Rovetta, Stefano
    Masulli, Francesco
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2022, 41 (11) : 6266 - 6294
  • [22] A Novel Pitch Detection Algorithm Based on Instantaneous Frequency for Clean and Noisy Speech
    Zied Mnasri
    Stefano Rovetta
    Francesco Masulli
    Circuits, Systems, and Signal Processing, 2022, 41 : 6266 - 6294
  • [23] Identification and Instantaneous Frequency Estimation of Effective Vibration Signal for Pitch Gearbox
    Liu, Changliang
    Liu, Shaokang
    Liu, Shuai
    Liu, Weiliang
    Wu, Yingjie
    Wang, Ziqi
    Luo, Zhihong
    IEEE SENSORS JOURNAL, 2024, 24 (18) : 29086 - 29096
  • [24] Wavelet algorithm for the estimation of pitch period of speech signal
    Obaidat, MS
    Lee, C
    Zhang, Y
    Khalid, H
    Nelson, D
    ICECS 96 - PROCEEDINGS OF THE THIRD IEEE INTERNATIONAL CONFERENCE ON ELECTRONICS, CIRCUITS, AND SYSTEMS, VOLS 1 AND 2, 1996, : 471 - 474
  • [25] On SNR Estimation by the Likelihood of near Pitch for Speech Detection
    Song, Young-Hwan
    Kyun, Doo-Heon
    Kim, Jong-Kuk
    Bae, Myung-Jin
    PROCEEDINGS OF WORLD ACADEMY OF SCIENCE, ENGINEERING AND TECHNOLOGY, VOL 26, PARTS 1 AND 2, DECEMBER 2007, 2007, 26 : 47 - 50
  • [26] A Tandem Algorithm for Pitch Estimation and Voiced Speech Segregation
    Hu, Guoning
    Wang, DeLiang
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (08): : 2067 - 2079
  • [27] Speech pitch period estimation using circular AMDF
    Xu, G
    Tang, LR
    PIMRC 2003: 14TH IEEE 2003 INTERNATIONAL SYMPOSIUM ON PERSONAL, INDOOR AND MOBILE RADIO COMMUNICATIONS PROCEEDINGS, VOLS 1-3 2003, 2003, : 2452 - 2455
  • [28] Robust pitch estimation with harmonics enhancement in noisy environments based on instantaneous frequency
    Abe, T
    Kobayashi, T
    Imai, S
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1277 - 1280
  • [29] Pitch estimation using models of voiced speech on three levels
    Joho, Dominik
    Bennewitz, Maren
    Behnke, Sven
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 1077 - +
  • [30] Nonlinear estimation of DEGG signals with applications to speech pitch detection
    Barner, KE
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 2243 - 2246