A pitch determination and voiced/unvoiced decision algorithm for noisy speech

被引:46
|
作者
Rouat, J
Liu, YC
Morissette, D
机构
基金
加拿大自然科学与工程研究理事会;
关键词
auditory model; car speech; telephone speech; multi-channel selection; Teager energy operator; amplitude modulation; residue pitch;
D O I
10.1016/S0167-6393(97)00002-2
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The design of a pitch tracking system for noisy speech is a challenging and yet unsolved issue due to the association of ''traditional'' pitch determination problems with those of noise processing. We have developed a multi-channel pitch determination algorithm (PDA) that has been tested on three speech databases (0 dB SNR telephone speech, speech recorded in a car and clean speech) involving fifty-eight speakers. Our system has been compared to a multi-channel PDA based on auditory modelling (AMPEX), to hand-labelled and to Laryngograph pitch contours. Our PDA is comprised of an automatic channel selection module and a pitch extraction module that relies on a pseudo-periodic histogram (combination of normalised scalar products for the less corrupted channels) in order to find pitch. Our PDA excelled in performance over the reference system on 0 dB telephone and car speech. The automatic selection of channels was effective on the very noisy telephone speech (0 dB) but performed less significantly on car speech where the robustness of the system is mainly due to the pitch extraction module in comparison to AMPEX, This paper reports in details the voiced/unvoiced, unvoiced/voiced performance and pitch estimation errors for the proposed PDA and the reference system while utilising three speech databases.
引用
收藏
页码:191 / 207
页数:17
相关论文
共 50 条
  • [31] Robust and High-resolution Voiced/Unvoiced Classification in Noisy Speech Using A Signal Smoothness Criterion
    Murthy, A. Sreenivasa
    Sekhar, S. Chandra
    Sreenivas, T. V.
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2260 - 2263
  • [32] Research on Pitch Extraction Algorithm of Noisy Speech
    Xing Hongyan
    Yu Cuihua
    Li Peng
    MATERIALS SCIENCE AND INFORMATION TECHNOLOGY, PTS 1-8, 2012, 433-440 : 4675 - 4678
  • [33] Improved flexible signal segmentation algorithm and automatic speech voiced-unvoiced segmentation
    Dong, En-Qing
    Liu, Gui-Zhong
    Zhou, Ya-Tong
    Dun, Yu-Jie
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2001, 29 (10): : 1364 - 1367
  • [34] Towards a robust/fast continuous speech recognition system using a voiced-unvoiced decision
    O'Shaughnessy, Douglas
    Tolba, Hesham
    ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 1999, 1 : 413 - 416
  • [35] Voiced/unvoiced decision based on recurrence quantification analysis
    Yan, Run-Qiang
    Zhu, Yi-Sheng
    Dianzi Yu Xinxi Xuebao/Journal of Electronics and Information Technology, 2007, 29 (07): : 1703 - 1706
  • [36] DECISION FUNCTIONS FOR VOICED-UNVOICED-SILENCE DETECTION
    LOCHBAUM, CC
    DAVID, EE
    MATHEWS, MV
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1961, 33 (06): : 852 - &
  • [37] A SPECTRO-TEMPORAL TECHNIQUE FOR ESTIMATING APERIODICITY AND VOICED/UNVOICED DECISION BOUNDARIES OF SPEECH SIGNALS
    Dhiman, Jitendra Kumar
    Seelamantula, Chandra Sekhar
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6510 - 6514
  • [38] Towards a robust/fast continuous speech recognition system using a voiced-unvoiced decision
    O'Shaughnessy, D
    Tolba, H
    ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 413 - 416
  • [39] Segregation of voiced and unvoiced components from residual of speech signal
    JO Cheol-woo
    KIM Jae-hee
    JournalofCentralSouthUniversity, 2012, 19 (02) : 496 - 503
  • [40] Efficiency of the KLT on Voiced & Unvoiced Speech as a Function of Segment Size
    McDowell, William K.
    Mikhael, Wasfy B.
    Berg, Albert P.
    2012 PROCEEDINGS OF IEEE SOUTHEASTCON, 2012,