Are You Paying Attention? Multimodal Linear Attention Transformers for Affect Prediction in Video Conversations

被引:0
|
作者
Poh, Jia Qing [1 ]
See, John [1 ]
El Gayar, Neamat [2 ]
Wong, Lai-Kuan [3 ]
机构
[1] Heriot Watt Univ Malaysia, Sch Math & Comp Sci, Putrajaya, Malaysia
[2] Heriot Watt Univ Dubai, Sch Math & Comp Sci, Dubai, U Arab Emirates
[3] Multimedia Univ, Fac Comp & Informat, Cyberjaya, Malaysia
关键词
multimodal transformers; linear attention; affect prediction; video conversations;
D O I
10.1145/3689092.3689409
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The post-COVID-19 era has seen continual adoption and reliance on video-based communication, underscoring the need for unobtrusive affect recognition in digital interactions. This paper proposes an efficient multimodal approach to emotion recognition in video conversational scenarios, leveraging linear attention-based Transformer networks to process both visual and audio cues. We explore various linear attention mechanisms, comparing them with classical self-attention. Using the K-EmoCon dataset, we demonstrate that the proposed approach yields competitive performance in predicting the affective states of conversing persons while significantly improving memory efficiency. Our ablation studies reveal that carefully tuned simple fusion methods can match or exceed more complex approaches. This research contributes to developing more accessible and efficient multimodal emotion recognition systems for video-based conversations, with applications for enhancing remote communication and monitoring digital well-being in the post-pandemic era.
引用
收藏
页码:15 / 23
页数:9
相关论文
共 50 条
  • [1] Are you paying attention?
    LeBlanc, Amy
    ANTIGONISH REVIEW, 2018, (192): : 28 - 28
  • [2] ARE YOU PAYING ATTENTION
    KOHN, DW
    JOURNAL OF DENTISTRY FOR CHILDREN, 1992, 59 (02): : 160 - 160
  • [3] Are you paying attention?
    Brewer, T
    AMERICAN JOURNAL OF NURSING, 2003, 103 (07) : 58 - 63
  • [4] Paying attention to uncertainty: A stochastic multimodal transformers for post-traumatic stress disorder detection using video
    Dia, Mamadou
    Khodabandelou, Ghazaleh
    Othmani, Alice
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2024, 257
  • [5] Are you paying attention yet?
    Kimmel, C
    ATHLETIC THERAPY TODAY, 2004, 9 (06): : 1 - 1
  • [7] Paying Attention to Video Generation
    Bhagwatkar, Rishika
    Fitter, Khurshed
    Bachu, Saketh
    Kulkarni, Akshay
    Chiddarwar, Shital
    NEURIPS 2020 WORKSHOP ON PRE-REGISTRATION IN MACHINE LEARNING, VOL 148, 2020, 148 : 139 - 154
  • [8] TICI: If You Are Not Confused, Then You Are Not Paying Attention
    Kallmes, D. F.
    AMERICAN JOURNAL OF NEURORADIOLOGY, 2012, 33 (05) : 975 - 976
  • [9] Are You a Disruptor and Is Anyone Paying Attention?
    Doarn, Charles R.
    Merrell, Ronald C.
    TELEMEDICINE AND E-HEALTH, 2015, 21 (05) : 319 - 320
  • [10] Paying Attention to Wildfire: Using U-Net with Attention Blocks on Multimodal Data for Next Day Prediction
    Fitzgerald, Jack
    Seefried, Ethan
    Yost, James
    Pallickara, Sangmi
    Blanchard, Nathaniel
    PROCEEDINGS OF THE 25TH INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, ICMI 2023, 2023, : 470 - 480