Importance-Aware Information Bottleneck Learning Paradigm for Lip Reading

被引:4
|
作者
Sheng, Changchong [1 ]
Liu, Li [2 ]
Deng, Wanxia [1 ]
Bai, Liang [2 ]
Liu, Zhong [2 ]
Lao, Songyang [2 ]
Kuang, Gangyao [1 ]
Pietikainen, Matti [3 ]
机构
[1] Natl Univ Def Technol, Coll Elect Sci & Technol, Changsha 410073, Peoples R China
[2] Natl Univ Def Technol, Coll Syst Engn, Lab Big Data & Decis, Changsha 410073, Peoples R China
[3] Univ Oulu, Ctr Machine Vis & Signal Anal, Oulu 90570, Finland
基金
国家重点研发计划; 中国国家自然科学基金; 芬兰科学院;
关键词
Lips; Visualization; Task analysis; Feature extraction; Speech recognition; Hidden Markov models; Noise measurement; Deep learning; information bottleneck; lip reading; visual speech recognition; NETWORK; FEATURES;
D O I
10.1109/TMM.2022.3210761
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Lip reading is the task of decoding text from speakers' mouth movements. Numerous deep learning-based methods have been proposed to address this task. However, these existing deep lip reading models suffer from poor generalization due to overfitting the training data. To resolve this issue, we present a novel learning paradigm that aims to improve the interpretability and generalization of lip reading models. In specific, aVariationalTemporalMask (VTM) module is customized to automatically analyze the importance of frame-level features. Furthermore, the prediction consistency constraints of global information and local temporal important features are introduced to strengthen the model generalization. We evaluate the novel learning paradigm with multiple lip reading baseline models on the LRW and LRW-1000 datasets. Experiments show that the proposed framework significantly improves the generalization performance and interpretability of lip reading models.
引用
收藏
页码:6563 / 6574
页数:12
相关论文
共 50 条
  • [41] Importance-aware Bloom Filter for Managing Set Membership Queries on Streaming Data
    Bhoraskar, Ravi
    Gabale, Vijay
    Kulkarni, Purushottam
    Kulkarni, Dhananjay
    2013 FIFTH INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS AND NETWORKS (COMSNETS), 2013,
  • [42] Semantic Importance-Aware Communications Using Pre-Trained Language Models
    Guo, Shuaishuai
    Wang, Yanhu
    Li, Shujing
    Saeed, Nasir
    IEEE COMMUNICATIONS LETTERS, 2023, 27 (09) : 2328 - 2332
  • [43] Utility-Aware Privacy-Preserving Federated Learning through Information Bottleneck
    Guo, Shaolong
    Su, Zhou
    Tian, Zhiyi
    Yu, Shui
    2022 IEEE INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS, TRUSTCOM, 2022, : 680 - 686
  • [44] Learning and generalization with the information bottleneck
    Shamir, Ohad
    Sabato, Sivan
    Tishby, Naftali
    THEORETICAL COMPUTER SCIENCE, 2010, 411 (29-30) : 2696 - 2711
  • [45] Learning and Generalization with the Information Bottleneck
    Shamir, Ohad
    Sabato, Sivan
    Tishby, Naftali
    ALGORITHMIC LEARNING THEORY, PROCEEDINGS, 2008, 5254 : 92 - 107
  • [46] Information Bottleneck and Aggregated Learning
    Soflaei, Masoumeh
    Zhang, Richong
    Guo, Hongyu
    Al-Bashabsheh, Ali
    Mao, Yongyi
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (12) : 14807 - 14820
  • [47] Importance-aware SDN Control Mechanism for Real-time Data Distribution Services
    Yun, Seongjin
    Park, Jun-Hong
    Kim, Hyeong-Su
    Kim, Won-Tae
    2018 INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY CONVERGENCE (ICTC), 2018, : 1113 - 1118
  • [48] Instance importance-Aware graph convolutional network for 3D medical diagnosis
    Chen, Zhen
    Liu, Jie
    Zhu, Meilu
    Woo, Peter Y. M.
    Yuan, Yixuan
    MEDICAL IMAGE ANALYSIS, 2022, 78
  • [49] Lip reading of words with lip segmentation and deep learning
    Malek Miled
    Mohammed Anouar Ben Messaoud
    Aicha Bouzid
    Multimedia Tools and Applications, 2023, 82 : 551 - 571
  • [50] Lip reading of words with lip segmentation and deep learning
    Miled, Malek
    Ben Messaoud, Mohammed Anouar
    Bouzid, Aicha
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (01) : 551 - 571