A pitch determination and voiced/unvoiced decision algorithm for noisy speech

被引:46
|
作者
Rouat, J
Liu, YC
Morissette, D
机构
基金
加拿大自然科学与工程研究理事会;
关键词
auditory model; car speech; telephone speech; multi-channel selection; Teager energy operator; amplitude modulation; residue pitch;
D O I
10.1016/S0167-6393(97)00002-2
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The design of a pitch tracking system for noisy speech is a challenging and yet unsolved issue due to the association of ''traditional'' pitch determination problems with those of noise processing. We have developed a multi-channel pitch determination algorithm (PDA) that has been tested on three speech databases (0 dB SNR telephone speech, speech recorded in a car and clean speech) involving fifty-eight speakers. Our system has been compared to a multi-channel PDA based on auditory modelling (AMPEX), to hand-labelled and to Laryngograph pitch contours. Our PDA is comprised of an automatic channel selection module and a pitch extraction module that relies on a pseudo-periodic histogram (combination of normalised scalar products for the less corrupted channels) in order to find pitch. Our PDA excelled in performance over the reference system on 0 dB telephone and car speech. The automatic selection of channels was effective on the very noisy telephone speech (0 dB) but performed less significantly on car speech where the robustness of the system is mainly due to the pitch extraction module in comparison to AMPEX, This paper reports in details the voiced/unvoiced, unvoiced/voiced performance and pitch estimation errors for the proposed PDA and the reference system while utilising three speech databases.
引用
收藏
页码:191 / 207
页数:17
相关论文
共 50 条
  • [1] A multifeature voiced/unvoiced decision algorithm for noisy speech
    Shahnaz, C.
    Zhu, W. -P.
    Ahmad, M. O.
    2006 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-11, PROCEEDINGS, 2006, : 2525 - +
  • [2] Pitch detection and voiced/unvoiced decision algorithm based on wavelet transforms
    Janer, L
    Bonet, JJ
    LleidaSolano, E
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1209 - 1212
  • [3] PITCH AND VOICED UNVOICED DETERMINATION WITH AN AUDITORY MODEL
    VANIMMERSEEL, LM
    MARTENS, JP
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1992, 91 (06): : 3511 - 3526
  • [4] Voiced/Unvoiced Decision Algorithm for HMM-based Speech Synthesis
    Kang, Shiyin
    Shuang, Zhiwei
    Duan, Quansheng
    Qin, Yong
    Cai, Lianhong
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 420 - +
  • [5] Voiced/unvoiced classification algorithm for wideband speech
    Institute of Signal and Information Processing, Nanjing University of Posts and Telecommunications, Nanjing 210003, China
    Shu Ju Cai Ji Yu Chu Li, 2008, 3 (288-293):
  • [6] VOICED-UNVOICED DECISION WITHOUT PITCH DETECTION
    ATAL, BS
    RABINER, LR
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1975, 58 : S62 - S62
  • [7] A Collelogram based Pitch and Voiced/Unvoiced Classification Method for Real-Time Speech Analysis in Noisy Environment
    Hamid, Md Ekramul
    Molla, Md. Khademul Islam
    2017 4TH ASIA-PACIFIC WORLD CONGRESS ON COMPUTER SCIENCE AND ENGINEERING (APWCONCSE 2017), 2017, : 93 - 98
  • [8] Generalized likelihood ratio test for voiced-unvoiced decision in noisy speech using the harmonic model
    Fisher, E
    Tabrikian, J
    Dubnov, S
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (02): : 502 - 510
  • [9] Two-speaker Voiced/Unvoiced Decision for Monaural Speech
    Zeremdini, Jihen
    Ben Messaoud, Mohamed Anouar
    Bouzid, Aicha
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2020, 39 (09) : 4399 - 4415
  • [10] Significance of sonority information for voiced/unvoiced decision in speech synthesis
    Sharma, Bidisha
    Prasanna, S. R. Mahadeva
    SPEECH COMMUNICATION, 2018, 99 : 201 - 210