Visual-speech-pass filtering for robust automatic lip-reading

被引：0

作者：

Jong-Seok Lee

机构：

[1] Yonsei University,School of Integrated Technology

来源：

Pattern Analysis and Applications | 2014年 / 17卷

关键词：

Automatic lip-reading; Visual-speech-pass filtering (VSPF); Feature extraction; Temporal filtering; Noise-robustness;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

This paper proposes a temporal filtering technique used in extraction of visual features for improved robustness of automatic lip-reading, called visual-speech-pass filtering. A band-pass filter is applied to the pixel value sequence of the images containing the speaker’s lip region to remove unwanted variations that are not relevant to the speech information. The filter is carefully designed based on psychological, spectral, and experimental analyses. Experimental results on two speaker-independent and one speaker-dependent recognition tasks demonstrate that the proposed technique significantly improves recognition performance in both clean and visually noisy conditions.

引用

页码：611 / 621

页数：10

共 50 条

[1] Visual-speech-pass filtering for robust automatic lip-reading
Lee, Jong-Seok
PATTERN ANALYSIS AND APPLICATIONS, 2014, 17 (03) : 611 - 621
[2] Visual speech features representation for automatic lip-reading
Sagheer, A
Tsuruta, N
Taniguchi, RK
Maeda, S
2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 781 - 784
[3] Visual units and confusion modelling for automatic lip-reading
Howell, Dominic
Cox, Stephen
Theobald, Barry
IMAGE AND VISION COMPUTING, 2016, 51 : 1 - 12
[4] Visual words for lip-reading
Hassanat, Ahmad B. A.
Jassim, Sabah
MOBILE MULTIMEDIA/IMAGE PROCESSING, SECURITY, AND APPLICATIONS 2010, 2010, 7708
[5] SPEECH AND LIP-READING FOR DEAF CHILDREN
不详
VOLTA REVIEW, 1921, 23 (01) : 45 - 46
[6] Automatic lip localization and feature extraction for lip-reading
Werda, Salah
Mahdi, Walid
Ben Hamadou, Abdehnajid
VISAPP 2007: PROCEEDINGS OF THE SECOND INTERNATIONAL CONFERENCE ON COMPUTER VISION THEORY AND APPLICATIONS, VOLUME IU/MTSV, 2007, : 268 - +
[7] Emotional Speech Recognition Based on Lip-Reading
Ryumina, Elena
Ivanko, Denis
SPEECH AND COMPUTER, SPECOM 2022, 2022, 13721 : 616 - 625
[8] Automated lip-reading for improved speech intelligibility
McClain, M
Brady, K
Brandstein, M
Quatieri, T
2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 701 - 704
[9] AUTOMATIC LIP-READING OF HEARING IMPAIRED PEOPLE
Ivanko, D.
Ryumin, D.
Karpov, A.
INTERNATIONAL WORKSHOP ON PHOTOGRAMMETRIC AND COMPUTER VISION TECHNIQUES FOR VIDEO SURVEILLANCE, BIOMETRICS AND BIOMEDICINE, 2019, 42-2 (W12): : 97 - 101
[10] Method for visual analysis of driver's face for automatic lip-reading in the wild
Axyonov, A. A.
Ryumin, D. A.
Kashevnik, A. M.
Ivanko, D., V
Karpov, A. A.
COMPUTER OPTICS, 2022, 46 (06) : 955 - +

← 1 2 3 4 5 →