Momentum Distillation Improves Multimodal Sentiment Analysis

被引:2
|
作者
Li, Siqi [1 ]
Deng, Weihong [1 ]
Hu, Jiani [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Artificial Intelligence, Beijing 100876, Peoples R China
关键词
Multimodal sentiment analysis; Sarcasm detection; Momentum distillation;
D O I
10.1007/978-3-031-18907-4_33
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the development of computer technology, the Internet floods with abundant multimodal data. For better understanding users' feelings, multimodal sentiment analysis and sarcasm detection have become popular research topics. However, previous studies did not take noise into account when designing models. In this paper, based on designing a novel architecture, we also introduce a momentum distillation method to improve the model's performance from noisy data. Specifically, we propose the Transformer-Based Network with Momentum Distillation (TBNMD). For model architecture, we first encode different modalities to obtain hidden representations. Then we use a multimodal interaction module to obtain text-guided image features and image-guided text features. After that, we use a multimodal fusion module to obtain the fusion features. For momentum distillation, it is a self-distillation method. During the training process, the teacher model generates semantically similar samples as additional supervision of the student model. Experimental results on five publicly available datasets demonstrate the effectiveness of our method.
引用
收藏
页码:423 / 435
页数:13
相关论文
共 50 条
  • [21] A Multimodal Approach to Image Sentiment Analysis
    Gaspar, Antonio
    Alexandre, Luis A.
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2019, PT I, 2019, 11871 : 302 - 309
  • [22] Sentiment analysis of multimodal twitter data
    Akshi Kumar
    Geetanjali Garg
    Multimedia Tools and Applications, 2019, 78 : 24103 - 24119
  • [23] Trustworthy Multimodal Fusion for Sentiment Analysis in Ordinal Sentiment Space
    Xie, Zhuyang
    Yang, Yan
    Wang, Jie
    Liu, Xiaorong
    Li, Xiaofan
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (08) : 7657 - 7670
  • [24] UsbVisdaNet: User Behavior Visual Distillation and Attention Network for Multimodal Sentiment Classification
    Hou, Shangwu
    Tuerhong, Gulanbaier
    Wushouer, Mairidan
    SENSORS, 2023, 23 (10)
  • [25] Multi-grained fusion network with self-distillation for aspect-based multimodal sentiment analysis
    Yang, Juan
    Xiao, Yali
    Du, Xu
    KNOWLEDGE-BASED SYSTEMS, 2024, 293
  • [26] Multimodal transformer with adaptive modality weighting for multimodal sentiment analysis
    Wang, Yifeng
    He, Jiahao
    Wang, Di
    Wang, Quan
    Wan, Bo
    Luo, Xuemei
    NEUROCOMPUTING, 2024, 572
  • [27] Joint training strategy of unimodal and multimodal for multimodal sentiment analysis
    Li, Meng
    Zhu, Zhenfang
    Li, Kefeng
    Zhou, Lihua
    Zhao, Zhen
    Pei, Hongli
    IMAGE AND VISION COMPUTING, 2024, 149
  • [28] RSMoDM: Multimodal Momentum Distillation Model for Remote Sensing Visual Question Answering
    Li, Pengfei
    Liu, Gang
    He, Jinlong
    Meng, Xiangxu
    Zhong, Shenjun
    Chen, Xun
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 16799 - 16814
  • [29] Multimodal sentiment analysis with unidirectional modality translation
    Yang, Bo
    Shao, Bo
    Wu, Lijun
    Lin, Xiaola
    NEUROCOMPUTING, 2022, 467 : 130 - 137
  • [30] Attention fusion network for multimodal sentiment analysis
    Yuanyi Luo
    Rui Wu
    Jiafeng Liu
    Xianglong Tang
    Multimedia Tools and Applications, 2024, 83 : 8207 - 8217