Importance-Aware Information Bottleneck Learning Paradigm for Lip Reading

被引:4
|
作者
Sheng, Changchong [1 ]
Liu, Li [2 ]
Deng, Wanxia [1 ]
Bai, Liang [2 ]
Liu, Zhong [2 ]
Lao, Songyang [2 ]
Kuang, Gangyao [1 ]
Pietikainen, Matti [3 ]
机构
[1] Natl Univ Def Technol, Coll Elect Sci & Technol, Changsha 410073, Peoples R China
[2] Natl Univ Def Technol, Coll Syst Engn, Lab Big Data & Decis, Changsha 410073, Peoples R China
[3] Univ Oulu, Ctr Machine Vis & Signal Anal, Oulu 90570, Finland
基金
国家重点研发计划; 中国国家自然科学基金; 芬兰科学院;
关键词
Lips; Visualization; Task analysis; Feature extraction; Speech recognition; Hidden Markov models; Noise measurement; Deep learning; information bottleneck; lip reading; visual speech recognition; NETWORK; FEATURES;
D O I
10.1109/TMM.2022.3210761
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Lip reading is the task of decoding text from speakers' mouth movements. Numerous deep learning-based methods have been proposed to address this task. However, these existing deep lip reading models suffer from poor generalization due to overfitting the training data. To resolve this issue, we present a novel learning paradigm that aims to improve the interpretability and generalization of lip reading models. In specific, aVariationalTemporalMask (VTM) module is customized to automatically analyze the importance of frame-level features. Furthermore, the prediction consistency constraints of global information and local temporal important features are introduced to strengthen the model generalization. We evaluate the novel learning paradigm with multiple lip reading baseline models on the LRW and LRW-1000 datasets. Experiments show that the proposed framework significantly improves the generalization performance and interpretability of lip reading models.
引用
收藏
页码:6563 / 6574
页数:12
相关论文
共 50 条
  • [21] IALight: Importance-Aware Multi-Agent Reinforcement Learning for Arterial Traffic Cooperative Control
    Wei, Lu
    Zhang, Xiaoyan
    Fan, Lijun
    Gao, Lei
    Yang, Jian
    PROMET-TRAFFIC & TRANSPORTATION, 2025, 37 (01): : 151 - 169
  • [22] Importance-aware personalized learning for early risk prediction using static and dynamic health data
    Tan, Qingxiong
    Ye, Mang
    Ma, Andy Jinhua
    Yip, Terry Cheuk-Fung
    Wong, Grace Lai-Hung
    Yuen, Pong C.
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2021, 28 (04) : 713 - 726
  • [23] Neighbor importance-aware graph collaborative filtering for item recommendation
    Wang, Qingxian
    Wu, Suqiang
    Bai, Yanan
    Liu, Quanliang
    Shi, Xiaoyu
    NEUROCOMPUTING, 2023, 549
  • [24] Importance-Aware Genetic Programming for Automated Scheduling Heuristics Learning in Dynamic Flexible Job Shop Scheduling
    Zhang, Fangfang
    Mei, Yi
    Nguyen, Su
    Zhang, Mengjie
    PARALLEL PROBLEM SOLVING FROM NATURE - PPSN XVII, PPSN 2022, PT II, 2022, 13399 : 48 - 62
  • [25] Efficient Multi-View Learning Based on the Information Bottleneck Paradigm
    Huang, Teng-Hui
    El Gamal, Aly
    El Gamal, Hesham
    2022 56TH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS, AND COMPUTERS, 2022, : 1393 - 1397
  • [26] iJoin: Importance-aware join approximation over data streams
    Kulkarni, Dhananjay
    Ravishankar, Chinya V.
    SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT, PROCEEDINGS, 2008, 5069 : 541 - +
  • [27] Adversarial Adaptive Neighborhood With Feature Importance-Aware Convex Interpolation
    Li, Qian
    Qi, Yong
    Hu, Qingyuan
    Qi, Saiyu
    Lin, Yun
    Dong, Jin Song
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2021, 16 : 2447 - 2460
  • [28] THE IMPORTANCE OF LIP-READING
    Kessler, Emma B.
    VOLTA REVIEW, 1920, 22 (09) : 586 - 586
  • [29] Importance-Aware Data Pre-Processing and Device Scheduling for Multi-Channel Edge Learning
    Huang X.
    Zhou S.
    Journal of Communications and Information Networks, 2022, 7 (04): : 394 - 407
  • [30] Illustrating volume data sets and layered models with importance-aware composition
    Francisco de Moura Pinto
    Carla Maria Dal Sasso Freitas
    The Visual Computer, 2011, 27 : 875 - 886