Improving the Accuracy and the Robustness of Harmonic Model for Pitch Estimation

被引:0
|
作者
Asgari, Meysam [1 ]
Shafran, Izhak [1 ]
机构
[1] Oregon Hlth & Sci Univ, Ctr Spoken Language Understanding, Portland, OR 97201 USA
基金
美国国家科学基金会;
关键词
fundamental frequency estimation; robust pitch estimation;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Accurate and robust estimation of pitch plays a central role in speech processing. Various methods in time, frequency and cepstral domain have been proposed for generating pitch candidates. Most algorithms excel when the background noise is minimal or for specific types of background noise. In this work, our aim is to improve the robustness and accuracy of pitch estimation across a wide variety of background noise conditions. For this we have chosen to adopt, the harmonic model of speech, a model that has gained considerable attention recently. We address two major weakness of this model. The problem of pitch halving and doubling, and the need to specify the number of harmonics. We exploit the energy of frequency in the neighborhood to alleviate halving and doubling. Using a model complexity term with a BIC criterion, we chose the optimal number of harmonics. We evaluated our proposed pitch estimation method with other state of the art techniques on Keele data set in terms of gross pitch error and fine pitch error. Through extensive experiments on several noisy conditions, we demonstrate that the proposed improvements provide substantial gains over other popular methods under different noise levels and environments.
引用
收藏
页码:1935 / 1939
页数:5
相关论文
共 50 条
  • [21] The pitch of a mistuned harmonic: Evidence for a template model
    Lin, JY
    Hartmann, WM
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1998, 103 (05): : 2608 - 2617
  • [22] A Study on the Robustness of Pitch Range Estimation from Brief Speech Segments
    Peng, Wenjie
    Fu, Kaiqi
    Zhang, Wei
    Xie, Yanlu
    Zhang, Jinsong
    PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2019, : 172 - 176
  • [23] Robustness and Accuracy of Time Delay Estimation in a Live Room
    Bayya, Yegnanarayana
    Murthy, B. H. V. S. Narayana
    Satyanarayana, J. V.
    Pannala, Vishala
    Chennupati, Nivedita
    2021 NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2021, : 440 - 445
  • [24] ON THE ROBUSTNESS OF THE QUASI-HARMONIC MODEL OF SPEECH
    Pantazis, Yannis
    Rosec, Olivier
    Stylianou, Yannis
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4210 - 4213
  • [25] Improving Accuracy and Robustness of Clock Parameters Estimation Using Multi-Link Overhearing in Wireless Sensor Networks
    Liu, Xiaojiang
    Wang, Heng
    ICC 2023-IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, 2023, : 517 - 522
  • [26] Robust Harmonic Features for Classification-Based Pitch Estimation
    Wang, Dongmei
    Yu, Chengzhu
    Hansen, John H. L.
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (05) : 952 - 964
  • [27] An improved sub-harmonic to harmonic ratio method for pitch estimation and Shadja detection
    Kumaraswamy, Balachandra
    Poonacha, P. G.
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2023, 35 (07):
  • [28] Improving the Robustness and Applicability of Higher-accuracy EBR Schemes
    Gorobets, A.
    LOBACHEVSKII JOURNAL OF MATHEMATICS, 2024, 45 (10) : 5002 - 5013
  • [29] Plenoptic Imaging Techniques for Improving Accuracy and Robustness of Object Tracking
    Bae, Dae Hyun
    Kim, Jae Woo
    Noh, Hae Chan
    Kim, Do Hyung
    Heo, Jae-Pil
    THREE-DIMENSIONAL IMAGING, VISUALIZATION, AND DISPLAY 2018, 2018, 10666
  • [30] Improving the Accuracy, Robustness, and Dynamic Range of Digital Bead Assays
    Zhang, Jianli
    Wiener, Alexander D.
    Meyer, Raymond E.
    Kan, Cheuk W.
    Rissin, David M.
    Kolluru, Bharathi
    George, Christopher
    Tobos, Carmen I.
    Shan, Dandan
    Duffy, David C.
    ANALYTICAL CHEMISTRY, 2023, 95 (22) : 8613 - 8620