Focal Channel Knowledge Distillation for Multi-Modality Action Recognition

被引:1
|
作者
Gan, Lipeng [1 ]
Cao, Runze [1 ]
Li, Ning [1 ]
Yang, Man [1 ]
Li, Xiaochao [1 ,2 ,3 ]
机构
[1] Xiamen Univ, Dept Microelect & lntegrated Circuit, Xiamen 361005, Peoples R China
[2] Xiamen Univ Malaysia, Dept Elect & Elect Engn, Sepang 43900, Selangor, Malaysia
[3] Univ Sydney, Sch Elect & Informat Engn, Sydney, NSW 2006, Australia
来源
IEEE ACCESS | 2023年 / 11卷
关键词
Action recognition; knowledge distillation; multi-modality;
D O I
10.1109/ACCESS.2023.3298647
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The multi-modality action recognition aims to learn the complementary information from multiple modalities to improve the action recognition performance. However, there exists a significant modality channel difference, the equal transferring channel semantic features from multi-modalities to RGB will result in competition and redundancy during knowledge distillation. To address this issue, we propose a focal channel knowledge distillation strategy to transfer the key semantic correlations and distributions of multi-modality teachers into the RGB student network. The focal channel correlations provide intrinsic relationships and diversity properties of key semantics, and focal channel distributions provide salient channel activation of features. By ignoring the less-discriminative and irrelevant channels, the student can more efficiently utilize the channel capability to learn the complementary semantic features from the other modalities. Our focal channel knowledge distillation achieves 91.2%, 95.6%, 98.3% and 81.0% accuracy with 4.5%, 4.2%, 3.7% and 7.1% improvement on NTU 60 (CS), UTD-MHAD, N-UCLA and HMDB51 datasets comparing to unimodal RGB models. This focal channel knowledge distillation framework can also be integrated with the unimodal models to achieve the state-of-the-art performance. The extensive experiments show that the proposed method achieves 92.5%, 96.0%, 98.9%, and 82.3% accuracy on NTU 60 (CS), UTD-MHAD, N-UCLA, and HMDB51 datasets respectively.
引用
收藏
页码:78285 / 78298
页数:14
相关论文
共 50 条
  • [21] Focal prostate brachytherapy: aspects of multi-modality registration and dosimetry feasibility
    Brun, T.
    Ken, S.
    Popotte, C.
    Bachaud, J.
    Graff-Cailleaud, P.
    Delannes, M.
    Malavaud, B.
    Portalez, D.
    Aziza, R.
    RADIOTHERAPY AND ONCOLOGY, 2016, 119 : S946 - S947
  • [22] Multi-modality Empowered Network for Facial Action Unit Detection
    Liu, Peng
    Zhang, Zheng
    Yang, Huiyuan
    Yin, Lijun
    2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, : 2175 - 2184
  • [23] An Encoder Generative Adversarial Network for Multi-modality Image Recognition
    Chen, Yu
    Yang, Chunling
    Zhu, Min
    Yang, ShiYan
    IECON 2018 - 44TH ANNUAL CONFERENCE OF THE IEEE INDUSTRIAL ELECTRONICS SOCIETY, 2018, : 2689 - 2694
  • [24] A Novel Two-Stream Transformer-Based Framework for Multi-Modality Human Action Recognition
    Shi, Jing
    Zhang, Yuanyuan
    Wang, Weihang
    Xing, Bin
    Hu, Dasha
    Chen, Liangyin
    APPLIED SCIENCES-BASEL, 2023, 13 (04):
  • [25] Multi-Modality Adaptive Feature Fusion Graph Convolutional Network for Skeleton-Based Action Recognition
    Zhang, Haiping
    Zhang, Xinhao
    Yu, Dongjin
    Guan, Liming
    Wang, Dongjing
    Zhou, Fuxing
    Zhang, Wanjun
    SENSORS, 2023, 23 (12)
  • [26] OpenSR: Open-Modality Speech Recognition via Maintaining Multi-Modality Alignment
    Cheng, Xize
    Jin, Tao
    Li, Linjun
    Lin, Wang
    Duan, Xinyu
    Zhao, Zhou
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 6592 - 6607
  • [27] Multi-Modal Knowledge Distillation for Domain-Adaptive Action Recognition
    Zhu, Xiaoyu
    Liu, Wenhe
    de Melo, Celso M.
    Hauptmann, Alexander
    SYNTHETIC DATA FOR ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING: TOOLS, TECHNIQUES, AND APPLICATIONS II, 2024, 13035
  • [28] Joint multi-type feature learning for multi-modality FKP recognition
    Yang, Yeping
    Fei, Lunke
    Alshehri, Adel Homoud
    Zhao, Shuping
    Sun, Weijun
    Teng, Shaohua
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 126
  • [29] Online Knowledge Distillation for Efficient Action Recognition
    Wang, Jiazheng
    Bian, Cunlin
    Zhou, Xian
    Lyu, Fan
    Niu, Zhibin
    Feng, Wei
    2022 IEEE 2ND INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND ARTIFICIAL INTELLIGENCE (CCAI 2022), 2022, : 177 - 181
  • [30] PROGRESSIVE KNOWLEDGE DISTILLATION FOR EARLY ACTION RECOGNITION
    Vinh Than
    Balasubramanian, Niranjan
    Minh Hoai
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 2583 - 2587