Visual-speech-pass filtering for robust automatic lip-reading

被引:0
|
作者
Jong-Seok Lee
机构
[1] Yonsei University,School of Integrated Technology
来源
关键词
Automatic lip-reading; Visual-speech-pass filtering (VSPF); Feature extraction; Temporal filtering; Noise-robustness;
D O I
暂无
中图分类号
学科分类号
摘要
This paper proposes a temporal filtering technique used in extraction of visual features for improved robustness of automatic lip-reading, called visual-speech-pass filtering. A band-pass filter is applied to the pixel value sequence of the images containing the speaker’s lip region to remove unwanted variations that are not relevant to the speech information. The filter is carefully designed based on psychological, spectral, and experimental analyses. Experimental results on two speaker-independent and one speaker-dependent recognition tasks demonstrate that the proposed technique significantly improves recognition performance in both clean and visually noisy conditions.
引用
收藏
页码:611 / 621
页数:10
相关论文
共 50 条
  • [2] Visual speech features representation for automatic lip-reading
    Sagheer, A
    Tsuruta, N
    Taniguchi, RK
    Maeda, S
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 781 - 784
  • [3] Visual units and confusion modelling for automatic lip-reading
    Howell, Dominic
    Cox, Stephen
    Theobald, Barry
    IMAGE AND VISION COMPUTING, 2016, 51 : 1 - 12
  • [4] Visual words for lip-reading
    Hassanat, Ahmad B. A.
    Jassim, Sabah
    MOBILE MULTIMEDIA/IMAGE PROCESSING, SECURITY, AND APPLICATIONS 2010, 2010, 7708
  • [5] SPEECH AND LIP-READING FOR DEAF CHILDREN
    不详
    VOLTA REVIEW, 1921, 23 (01) : 45 - 46
  • [6] Automatic lip localization and feature extraction for lip-reading
    Werda, Salah
    Mahdi, Walid
    Ben Hamadou, Abdehnajid
    VISAPP 2007: PROCEEDINGS OF THE SECOND INTERNATIONAL CONFERENCE ON COMPUTER VISION THEORY AND APPLICATIONS, VOLUME IU/MTSV, 2007, : 268 - +
  • [7] Emotional Speech Recognition Based on Lip-Reading
    Ryumina, Elena
    Ivanko, Denis
    SPEECH AND COMPUTER, SPECOM 2022, 2022, 13721 : 616 - 625
  • [8] Automated lip-reading for improved speech intelligibility
    McClain, M
    Brady, K
    Brandstein, M
    Quatieri, T
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 701 - 704
  • [9] AUTOMATIC LIP-READING OF HEARING IMPAIRED PEOPLE
    Ivanko, D.
    Ryumin, D.
    Karpov, A.
    INTERNATIONAL WORKSHOP ON PHOTOGRAMMETRIC AND COMPUTER VISION TECHNIQUES FOR VIDEO SURVEILLANCE, BIOMETRICS AND BIOMEDICINE, 2019, 42-2 (W12): : 97 - 101
  • [10] Method for visual analysis of driver's face for automatic lip-reading in the wild
    Axyonov, A. A.
    Ryumin, D. A.
    Kashevnik, A. M.
    Ivanko, D., V
    Karpov, A. A.
    COMPUTER OPTICS, 2022, 46 (06) : 955 - +