Visual-to-EEG cross-modal knowledge distillation for continuous emotion recognition

被引:27
|
作者
Zhang, Su [1 ]
Tang, Chuangao [2 ]
Guan, Cuntai [1 ]
机构
[1] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore 639798, Singapore
[2] Southeast Univ, Sch Biol Sci & Med Engn, Key Lab Child Dev & Learning Sci, Minist Educ, Nanjing 210096, Peoples R China
关键词
Continuous emotion recognition; Knowledge distillation; Cross-modality; BRAIN;
D O I
10.1016/j.patcog.2022.108833
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Visual modality is one of the most dominant modalities for current continuous emotion recognition methods. Compared to which the EEG modality is relatively less sound due to its intrinsic limitation such as subject bias and low spatial resolution. This work attempts to improve the continuous prediction of the EEG modality by using the dark knowledge from the visual modality. The teacher model is built by a cascade convolutional neural network - temporal convolutional network (CNN-TCN) architecture, and the student model is built by TCNs. They are fed by video frames and EEG average band power features, respectively. Two data partitioning schemes are employed, i.e., the trial-level random shuffling (TRS) and the leave-one-subject-out (LOSO). The standalone teacher and student can produce continuous prediction superior to the baseline method, and the employment of the visual-to-EEG cross-modal KD further improves the prediction with statistical significance, i.e., p-value < 0.01 for TRS and p-value < 0.05 for LOSO partitioning. The saliency maps of the trained student model show that the brain areas associated with the active valence state are not located in precise brain areas. Instead, it results from synchronized activity among various brain areas. And the fast beta and gamma waves, with the frequency of 18 - 30Hz and 30 - 45Hz, contribute the most to the human emotion process compared to other bands. The code is available at https://github.com/sucv/Visual_to_EEG_Cross_Modal_KD_for_CER. (C) 2022 The Authors. Published by Elsevier Ltd.
引用
收藏
页数:11
相关论文
共 50 条
  • [41] Multimode Fiber Image Transmission via Cross-Modal Knowledge distillation
    Lin, Weixuan
    Wu, Di
    Boulet, Benoit
    2024 IEEE CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, CCECE 2024, 2024, : 13 - 19
  • [42] Incongruity-Aware Cross-Modal Attention for Audio-Visual Fusion in Dimensional Emotion Recognition
    Praveen, R. Gnana
    Alam, Jahangir
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2024, 18 (03) : 444 - 458
  • [43] Muti-Modal Emotion Recognition via Hierarchical Knowledge Distillation
    Sun, Teng
    Wei, Yinwei
    Ni, Juntong
    Liu, Zixin
    Song, Xuemeng
    Wang, Yaowei
    Nie, Liqiang
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 9036 - 9046
  • [44] Auditory to Visual Cross-Modal Adaptation for Emotion: Psychophysical and Neural Correlates
    Wang, Xiaodong
    Guo, Xiaotao
    Chen, Lin
    Liu, Yijun
    Goldberg, Michael E.
    Xu, Hong
    CEREBRAL CORTEX, 2017, 27 (02) : 1337 - 1346
  • [45] SELF-SUPERVISED LEARNING WITH CROSS-MODAL TRANSFORMERS FOR EMOTION RECOGNITION
    Khare, Aparna
    Parthasarathy, Srinivas
    Sundaram, Shiva
    2021 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP (SLT), 2021, : 381 - 388
  • [46] AUDIOVISUAL EMOTION RECOGNITION VIA CROSS-MODAL ASSOCIATION IN KERNEL SPACE
    Wang, Yongjin
    Guan, Ling
    Venetsanopoulos, A. N.
    2011 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2011,
  • [47] EEG emotion recognition based on knowledge distillation optimized residual networks
    Wang, Pai
    Guo, Chunyong
    Xi, Shuangqiang
    Qiao, Xiang
    Mao, Lili
    Fu, Xianyong
    2022 IEEE 6TH ADVANCED INFORMATION TECHNOLOGY, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (IAEAC), 2022, : 574 - 581
  • [48] HAPTIC TO VISUAL CROSS-MODAL RECOGNITION OF OBJECTS IN THE VERVET MONKEY
    ECHENIQUE, CR
    SOSA, MV
    ACTA NEUROBIOLOGIAE EXPERIMENTALIS, 1981, 41 (01) : 113 - 118
  • [49] A biphasic effect of cross-modal priming on visual shape recognition
    Kwok, Sze Chai
    Fantoni, Carlo
    Tamburini, Laura
    Wang, Lei
    Gerbino, Walter
    ACTA PSYCHOLOGICA, 2018, 183 : 43 - 50
  • [50] Continuous cross-modal hashing
    Zheng, Hao
    Wang, Jinbao
    Zhen, Xiantong
    Song, Jingkuan
    Zheng, Feng
    Lu, Ke
    Qi, Guo-Jun
    PATTERN RECOGNITION, 2023, 142