Visual-to-EEG cross-modal knowledge distillation for continuous emotion recognition

被引：27

作者：

Zhang, Su ^{[1
]}

Tang, Chuangao ^{[2
]}

Guan, Cuntai ^{[1
]}

机构：

[1] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore 639798, Singapore

[2] Southeast Univ, Sch Biol Sci & Med Engn, Key Lab Child Dev & Learning Sci, Minist Educ, Nanjing 210096, Peoples R China

来源：

PATTERN RECOGNITION | 2022年 / 130卷

关键词：

Continuous emotion recognition; Knowledge distillation; Cross-modality; BRAIN;

D O I：

10.1016/j.patcog.2022.108833

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Visual modality is one of the most dominant modalities for current continuous emotion recognition methods. Compared to which the EEG modality is relatively less sound due to its intrinsic limitation such as subject bias and low spatial resolution. This work attempts to improve the continuous prediction of the EEG modality by using the dark knowledge from the visual modality. The teacher model is built by a cascade convolutional neural network - temporal convolutional network (CNN-TCN) architecture, and the student model is built by TCNs. They are fed by video frames and EEG average band power features, respectively. Two data partitioning schemes are employed, i.e., the trial-level random shuffling (TRS) and the leave-one-subject-out (LOSO). The standalone teacher and student can produce continuous prediction superior to the baseline method, and the employment of the visual-to-EEG cross-modal KD further improves the prediction with statistical significance, i.e., p-value < 0.01 for TRS and p-value < 0.05 for LOSO partitioning. The saliency maps of the trained student model show that the brain areas associated with the active valence state are not located in precise brain areas. Instead, it results from synchronized activity among various brain areas. And the fast beta and gamma waves, with the frequency of 18 - 30Hz and 30 - 45Hz, contribute the most to the human emotion process compared to other bands. The code is available at https://github.com/sucv/Visual_to_EEG_Cross_Modal_KD_for_CER. (C) 2022 The Authors. Published by Elsevier Ltd.

引用

页数：11

共 50 条

[41] Multimode Fiber Image Transmission via Cross-Modal Knowledge distillation
Lin, Weixuan
Wu, Di
Boulet, Benoit
2024 IEEE CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, CCECE 2024, 2024, : 13 - 19
[42] Incongruity-Aware Cross-Modal Attention for Audio-Visual Fusion in Dimensional Emotion Recognition
Praveen, R. Gnana
Alam, Jahangir
IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2024, 18 (03) : 444 - 458
[43] Muti-Modal Emotion Recognition via Hierarchical Knowledge Distillation
Sun, Teng
Wei, Yinwei
Ni, Juntong
Liu, Zixin
Song, Xuemeng
Wang, Yaowei
Nie, Liqiang
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 9036 - 9046
[44] Auditory to Visual Cross-Modal Adaptation for Emotion: Psychophysical and Neural Correlates
Wang, Xiaodong
Guo, Xiaotao
Chen, Lin
Liu, Yijun
Goldberg, Michael E.
Xu, Hong
CEREBRAL CORTEX, 2017, 27 (02) : 1337 - 1346
[45] SELF-SUPERVISED LEARNING WITH CROSS-MODAL TRANSFORMERS FOR EMOTION RECOGNITION
Khare, Aparna
Parthasarathy, Srinivas
Sundaram, Shiva
2021 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP (SLT), 2021, : 381 - 388
[46] AUDIOVISUAL EMOTION RECOGNITION VIA CROSS-MODAL ASSOCIATION IN KERNEL SPACE
Wang, Yongjin
Guan, Ling
Venetsanopoulos, A. N.
2011 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2011,
[47] EEG emotion recognition based on knowledge distillation optimized residual networks
Wang, Pai
Guo, Chunyong
Xi, Shuangqiang
Qiao, Xiang
Mao, Lili
Fu, Xianyong
2022 IEEE 6TH ADVANCED INFORMATION TECHNOLOGY, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (IAEAC), 2022, : 574 - 581
[48] HAPTIC TO VISUAL CROSS-MODAL RECOGNITION OF OBJECTS IN THE VERVET MONKEY
ECHENIQUE, CR
SOSA, MV
ACTA NEUROBIOLOGIAE EXPERIMENTALIS, 1981, 41 (01) : 113 - 118
[49] A biphasic effect of cross-modal priming on visual shape recognition
Kwok, Sze Chai
Fantoni, Carlo
Tamburini, Laura
Wang, Lei
Gerbino, Walter
ACTA PSYCHOLOGICA, 2018, 183 : 43 - 50
[50] Continuous cross-modal hashing
Zheng, Hao
Wang, Jinbao
Zhen, Xiantong
Song, Jingkuan
Zheng, Feng
Lu, Ke
Qi, Guo-Jun
PATTERN RECOGNITION, 2023, 142

← 1 2 3 4 5 →