Improving the Accuracy and the Robustness of Harmonic Model for Pitch Estimation

被引:0
|
作者
Asgari, Meysam [1 ]
Shafran, Izhak [1 ]
机构
[1] Oregon Hlth & Sci Univ, Ctr Spoken Language Understanding, Portland, OR 97201 USA
基金
美国国家科学基金会;
关键词
fundamental frequency estimation; robust pitch estimation;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Accurate and robust estimation of pitch plays a central role in speech processing. Various methods in time, frequency and cepstral domain have been proposed for generating pitch candidates. Most algorithms excel when the background noise is minimal or for specific types of background noise. In this work, our aim is to improve the robustness and accuracy of pitch estimation across a wide variety of background noise conditions. For this we have chosen to adopt, the harmonic model of speech, a model that has gained considerable attention recently. We address two major weakness of this model. The problem of pitch halving and doubling, and the need to specify the number of harmonics. We exploit the energy of frequency in the neighborhood to alleviate halving and doubling. Using a model complexity term with a BIC criterion, we chose the optimal number of harmonics. We evaluated our proposed pitch estimation method with other state of the art techniques on Keele data set in terms of gross pitch error and fine pitch error. Through extensive experiments on several noisy conditions, we demonstrate that the proposed improvements provide substantial gains over other popular methods under different noise levels and environments.
引用
收藏
页码:1935 / 1939
页数:5
相关论文
共 50 条
  • [1] Joint estimation of pitch and direction of arrival: improving robustness and accuracy for multi-speaker scenarios
    Gerlach, Stephan
    Bitzer, Joerg
    Goetze, Stefan
    Doclo, Simon
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2014,
  • [2] Joint estimation of pitch and direction of arrival: improving robustness and accuracy for multi-speaker scenarios
    Stephan Gerlach
    Jörg Bitzer
    Stefan Goetze
    Simon Doclo
    EURASIP Journal on Audio, Speech, and Music Processing, 2014 (1)
  • [3] Improving the harmonic structure of speech spectrum for robust pitch estimation
    Chowdhury, Husne Ara
    Rahman, Mohammad Shahidur
    ACOUSTICAL SCIENCE AND TECHNOLOGY, 2025, 46 (01) : 34 - 37
  • [4] On Improving the Accuracy and Robustness of Time Delay Estimation of Broadband Signals
    B. H. V. S. Narayanamurthy
    J. V. Satyanarayana
    B. Yegnanarayana
    Circuits, Systems, and Signal Processing, 2022, 41 : 514 - 531
  • [5] On Improving the Accuracy and Robustness of Time Delay Estimation of Broadband Signals
    Narayanamurthy, B. H. V. S.
    Satyanarayana, J. V.
    Yegnanarayana, B.
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2022, 41 (01) : 514 - 531
  • [6] Pitch estimation based on harmonic salience
    Song, Liming
    Li, Ming
    Yan, Yonghong
    Shengxue Xuebao/Acta Acustica, 2015, 40 (02): : 294 - 299
  • [7] Pitch estimation based on harmonic salience
    Song, Liming
    Li, Ming
    Yan, Yonghong
    ELECTRONICS LETTERS, 2013, 49 (23) : 1491 - 1492
  • [8] Melody pitch estimation based on range estimation and candidate extraction using harmonic structure model
    Jo, Seokhwan
    Joo, Sihyun
    Yoo, Chang D.
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2902 - 2905
  • [9] Multi-pitch estimation using harmonic music
    Christensen, Mads Graesboll
    Jakobsson, Andreas
    Jensen, Soren Holdt
    2006 FORTIETH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS, VOLS 1-5, 2006, : 521 - +
  • [10] PITCH ESTIMATION AND TRACKING WITH HARMONIC EMPHASIS ON THE ACOUSTIC SPECTRUM
    Karimian-Azari, Sam
    Mohammadiha, Nasser
    Jensen, Jesper R.
    Christensen, Mads G.
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4330 - 4334