Robust F0 estimation using ELS-based robust complex speech analysis

被引:0
|
作者
Funaki, Keiichi [1 ]
Kinjo, Tatsuhiko [2 ]
机构
[1] Univ Ryukyus, Comp & Networking Ctr, Nishihara, Okinawa 9030213, Japan
[2] Toyota Commun Syst CO LTD, Higashi Ku, Nagoya, Aichi 4610005, Japan
关键词
F0; estimation; analytic signal; ELS (Extended Least Square); robust complex speech analysis; IRS filtered speech;
D O I
10.1093/ietfec/e91-a.3.868
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Complex speech analysis for an analytic speech signal can accurately estimate the spectrum in low frequencies since the analytic signal provides spectrum only over positive frequencies. The remarkable feature makes it possible to realize more accurate F0 estimation using complex residual signal extracted by complex-valued speech analysis. We have already proposed F0 estimation using complex LPC residual, in which the autocorrelation function weighted by AMDF was adopted as the criterion. The method adopted MMSE-based complex LPC analysis and it has been reported that it can estimate more accurate F0 for IRS filtered speech corrupted by white Gauss noise although it can not work better for the IRS filtered speech corrupted by pink noise. In this paper, robust complex speech analysis based on ELS (Extended Least Square) method is introduced in order to overcome the drawback. The experimental results for additive white Gauss or pink noise demonstrate that the proposed algorithm based on robust ELS-based complex AR analysis can perform better than other methods.
引用
收藏
页码:868 / 871
页数:4
相关论文
共 50 条
  • [31] Comparative evaluations of robust and accurate F0 estimates in reverberant environments
    Unoki, Masashi
    Hosorogiya, Toshihiro
    Ishimoto, Yuichi
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4569 - +
  • [32] F0 Estimation and Voicing Detection With Cascade Architecture in Noisy Speech
    Zhang, Yixuan
    Wang, Heming
    Wang, Deliang
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 31 : 3760 - 3770
  • [33] Investigation of Prosodic F0 Layers in Hierarchical F0 Modeling for HMM-based Speech Synthesis
    Lei, Ming
    Wu, Yi-Jian
    Ling, Zhen-Hua
    Dai, Li-Rong
    2010 IEEE 10TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS (ICSP2010), VOLS I-III, 2010, : 613 - +
  • [34] F0 ESTIMATION FOR NOISY SPEECH BASED ON EXPLORING LOCAL TIME-FREQUENCY SEGMENT
    Wang, Dongmei
    Hansen, John H. L.
    Tobey, Emily
    2015 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2015,
  • [35] JOINT ANALYSIS OF F0 AND SPEECH RATE WITH FUNCTIONAL DATA ANALYSIS
    Gubian, Michele
    Boves, Lou
    Cangemi, Francesco
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4972 - 4975
  • [36] A Study of F0 Estimation Based on RAPT Framework using Sustained Vowel
    Karunaimathi, Prarthana, V
    Gladis, Dennis
    Dalvi, Usha
    2015 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2015, : 2290 - 2295
  • [37] Robust f0 extraction from monophonic signals using adaptive sub-band filtering
    Rengaswamy, Pradeep
    Reddy, M. Kiran
    Rao, Krothapalli Sreenivasa
    Dasgupta, Pallab
    SPEECH COMMUNICATION, 2020, 116 : 77 - 85
  • [38] SAFE: a Statistical Algorithm for F0 Estimation for Both Clean and Noisy Speech
    Chu, Wei
    Alwan, Abeer
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2598 - 2601
  • [39] An Analogy of F0 Estimation Algorithms Using Sustained Vowel
    Karunaimathi, Prarthana, V
    Gladis, Dennis
    Balakrishnan, D.
    PROCEEDING OF THE THIRD INTERNATIONAL SYMPOSIUM ON WOMEN IN COMPUTING AND INFORMATICS (WCI-2015), 2015, : 217 - 221
  • [40] NOISE-ROBUST F0 ESTIMATION USING SNR-WEIGHTED SUMMARY CORRELOGRAMS FROM MULTI-BAND COMB FILTERS
    Tan, Lee Ngee
    Alwan, Abeer
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4464 - 4467