Importance-Aware Information Bottleneck Learning Paradigm for Lip Reading

被引:4
|
作者
Sheng, Changchong [1 ]
Liu, Li [2 ]
Deng, Wanxia [1 ]
Bai, Liang [2 ]
Liu, Zhong [2 ]
Lao, Songyang [2 ]
Kuang, Gangyao [1 ]
Pietikainen, Matti [3 ]
机构
[1] Natl Univ Def Technol, Coll Elect Sci & Technol, Changsha 410073, Peoples R China
[2] Natl Univ Def Technol, Coll Syst Engn, Lab Big Data & Decis, Changsha 410073, Peoples R China
[3] Univ Oulu, Ctr Machine Vis & Signal Anal, Oulu 90570, Finland
基金
国家重点研发计划; 中国国家自然科学基金; 芬兰科学院;
关键词
Lips; Visualization; Task analysis; Feature extraction; Speech recognition; Hidden Markov models; Noise measurement; Deep learning; information bottleneck; lip reading; visual speech recognition; NETWORK; FEATURES;
D O I
10.1109/TMM.2022.3210761
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Lip reading is the task of decoding text from speakers' mouth movements. Numerous deep learning-based methods have been proposed to address this task. However, these existing deep lip reading models suffer from poor generalization due to overfitting the training data. To resolve this issue, we present a novel learning paradigm that aims to improve the interpretability and generalization of lip reading models. In specific, aVariationalTemporalMask (VTM) module is customized to automatically analyze the importance of frame-level features. Furthermore, the prediction consistency constraints of global information and local temporal important features are introduced to strengthen the model generalization. We evaluate the novel learning paradigm with multiple lip reading baseline models on the LRW and LRW-1000 datasets. Experiments show that the proposed framework significantly improves the generalization performance and interpretability of lip reading models.
引用
收藏
页码:6563 / 6574
页数:12
相关论文
共 50 条
  • [31] Illustrating volume data sets and layered models with importance-aware composition
    Pinto, Francisco de Moura
    Dal Sasso Freitas, Carla Maria
    VISUAL COMPUTER, 2011, 27 (10): : 875 - 886
  • [32] Neuron importance-aware coverage analysis for deep neural network testing
    Guo, Hongjing
    Tao, Chuanqi
    Huang, Zhiqiu
    EMPIRICAL SOFTWARE ENGINEERING, 2024, 29 (05)
  • [33] Boosting Adversarial Transferability via Relative Feature Importance-Aware Attacks
    Li, Jian-Wei
    Shao, Wen-Ze
    Sun, Yu-Bao
    Wang, Li-Qian
    Ge, Qi
    Xiao, Liang
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2025, 20 : 3489 - 3504
  • [34] Predicting Response in Mobile Advertising with Hierarchical Importance-Aware Factorization Machine
    Oentaryo, Richard J.
    Lim, Ee-Peng
    Low, Jia-Wei
    Lo, David
    Finegold, Michael
    WSDM'14: PROCEEDINGS OF THE 7TH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2014, : 123 - 132
  • [35] Feature Importance-Aware Task-Oriented Semantic Transmission and Optimization
    Wang, Yining
    Han, Shujun
    Xu, Xiaodong
    Liang, Haotai
    Meng, Rui
    Dong, Chen
    Zhang, Ping
    IEEE TRANSACTIONS ON COGNITIVE COMMUNICATIONS AND NETWORKING, 2024, 10 (04) : 1175 - 1189
  • [36] THE IMPORTANCE OF PRACTISE IN LIP-READING
    Trask, Alice N.
    VOLTA REVIEW, 1917, 19 (01) : 27 - 28
  • [37] Importance-Aware Data Augmentation for Document-Level Neural Machine Translation
    Wu, Minghao
    Wang, Yufei
    Foster, George
    Qiu, Lizhen
    Haffari, Gholamreza
    PROCEEDINGS OF THE 18TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, : 740 - 752
  • [38] RAPNet: Residual Atrous Pyramid Network for Importance-Aware Street Scene Parsing
    Zhang, Pingping
    Liu, Wei
    Lei, Yinjie
    Wang, Hongyu
    Lu, Huchuan
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 5010 - 5021
  • [39] Importance-Aware Semantic Segmentation in Self-Driving with Discrete Wasserstein Training
    Liu, Xiaofeng
    Han, Yuzhuo
    Bai, Song
    Ge, Yi
    Wang, Tianxing
    Han, Xu
    Li, Site
    You, Jane
    Lu, Jun
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 11629 - 11636
  • [40] Importance-aware Co-teaching for Offline Model-based Optimization
    Yuan, Ye
    Chen, Can
    Liu, Zixuan
    Neiswanger, Willie
    Liu, Xue
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,