On a Robust F0 Estimation of Speech based on IRAPT using Robust TV-CAR Analysis

被引:0
|
作者
Hotta, Kazushi [1 ]
Funaki, Keiichi [2 ]
机构
[1] Univ Ryukyus, Grad Sch Engn & Sci, Nishihara, Okinawa 90301, Japan
[2] Univ Ryukyus, Comp & Networking Ctr, Nishihara, Okinawa 90301, Japan
关键词
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Fundamental frequency (F-0) estimation is important in speech processing such as speech coding, synthesis, recognition and so on. A present F-0 estimation method performs well under clean condition, however the performance deteriorates significantly in noisy environment. As a result, robust F-0 estimation against additive noise is demanded. We have previously proposed F-0 estimation methods based on Time-Varying Complex AR (TV-CAR) analysis whose criterion is the weighted correlation of the complex residual obtained by the TV-CAR analysis, sum of the harmonics for the complex residual spectrum, or so on. On the other hand, E.Azarov et al. have proposed an improved method of RAPT (Robust Algorithm for Pitch Tracking) using an instantaneous harmonics that is called IRAPT (Instantaneous RAPT). The IRAPT can perform better estimation than RAPT. Since IRAPT uses band-limited analytic signal to obtain harmonic frequencies, the complex residual signal obtained by the TV-CAR analysis can also be applied to the IRAPT. In this paper, novel F-0 estimation method using the instantaneous frequency based on the robust ELS (Extended Least Square) TV-CAR residual is proposed and evaluated.
引用
收藏
页数:4
相关论文
共 50 条
  • [21] Sparse Time-Varying Complex AR (TV-CAR) Speech Analysis Based on Adaptive LASSO
    Funaki, Keiichi
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2019, E102A (12) : 1910 - 1914
  • [22] F0 CONTOUR ESTIMATION USING PHONETIC FEATURE IN ELECTROLARYNGEAL SPEECH ENHANCEMENT
    Cai, Zexin
    Xu, Zhicheng
    Li, Ming
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6490 - 6494
  • [23] F0 ESTIMATION FOR DNN-BASED ULTRASOUND SILENT SPEECH INTERFACES
    Grosz, Tamas
    Gosztolya, Gabor
    Toth, Laszlo
    Csapo, Tamas Gabor
    Marko, Alexandra
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 291 - 295
  • [24] F0, LPC, and MFCC Analysis for Emotion Recognition Based on Speech
    Teixeira, Felipe L.
    Teixeira, Joao Paulo
    Soares, Salviano F. P.
    Pio Abreu, J. L.
    OPTIMIZATION, LEARNING ALGORITHMS AND APPLICATIONS, OL2A 2022, 2022, 1754 : 389 - 404
  • [25] F0 generation in a text-to-speech system using a database of natural F0 patterns
    da Silva, CH
    Nagle, EJ
    Runstein, F
    Violaro, F
    ITS '98 PROCEEDINGS - SBT/IEEE INTERNATIONAL TELECOMMUNICATIONS SYMPOSIUM, VOLS 1 AND 2, 1998, : 213 - 218
  • [26] Robust F0 Estimation Based on Log-Time Scale Autocorrelation and Its Application to Mandarin Tone Recognition
    Kida, Yusuke
    Sakai, Masaru
    Masuko, Takashi
    Kawamura, Akinori
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 2931 - 2934
  • [27] Combining Atom Decomposition of the F0 Track and HMM-based Phonological Phrase Modelling for Robust Stress Detection in Speech
    Szaszak, Gyorgy
    Tundik, Mate Akos
    Gerazov, Branislav
    Gjoreski, Aleksandar
    SPEECH AND COMPUTER, 2016, 9811 : 165 - 173
  • [28] F0 analysis for Japanese conversational speech synthesis
    Nakajima, Hideharu
    Sagisaka, Yoshinori
    2009 EIGHTH INTERNATIONAL SYMPOSIUM ON NATURAL LANGUAGE PROCESSING, PROCEEDINGS, 2009, : 137 - +
  • [29] Effects of F0 Estimation Algorithms on Ultrasound- Based Silent Speech Interfaces
    Dai, Pengyu
    Al-Radhi, Mohammed Salah
    Csapo, Tamas Gabor
    2021 INTERNATIONAL CONFERENCE ON SPEECH TECHNOLOGY AND HUMAN-COMPUTER DIALOGUE (SPED), 2021, : 47 - 51
  • [30] TV-CAR speech analysis based on the l2-norm regularization in the time-domain and frequency domain
    Funaki, Keiichi
    2020 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2020, : 568 - 571