Momentum Distillation Improves Multimodal Sentiment Analysis

被引:2
|
作者
Li, Siqi [1 ]
Deng, Weihong [1 ]
Hu, Jiani [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Artificial Intelligence, Beijing 100876, Peoples R China
关键词
Multimodal sentiment analysis; Sarcasm detection; Momentum distillation;
D O I
10.1007/978-3-031-18907-4_33
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the development of computer technology, the Internet floods with abundant multimodal data. For better understanding users' feelings, multimodal sentiment analysis and sarcasm detection have become popular research topics. However, previous studies did not take noise into account when designing models. In this paper, based on designing a novel architecture, we also introduce a momentum distillation method to improve the model's performance from noisy data. Specifically, we propose the Transformer-Based Network with Momentum Distillation (TBNMD). For model architecture, we first encode different modalities to obtain hidden representations. Then we use a multimodal interaction module to obtain text-guided image features and image-guided text features. After that, we use a multimodal fusion module to obtain the fusion features. For momentum distillation, it is a self-distillation method. During the training process, the teacher model generates semantically similar samples as additional supervision of the student model. Experimental results on five publicly available datasets demonstrate the effectiveness of our method.
引用
收藏
页码:423 / 435
页数:13
相关论文
共 50 条
  • [41] MemoSen: A Multimodal Dataset for Sentiment Analysis of Memes
    Hossain, Eftekhar
    Sharif, Omar
    Hoque, Mohammed Moshiul
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 1542 - 1554
  • [42] PowMix: A Versatile Regularizer for Multimodal Sentiment Analysis
    Georgiou, Efthymios
    Avrithis, Yannis
    Potamianos, Alexandros
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 5010 - 5023
  • [43] Multimodal learning for topic sentiment analysis in microblogging
    Huang, Faliang
    Zhang, Shichao
    Zhang, Jilian
    Yu, Ge
    NEUROCOMPUTING, 2017, 253 : 144 - 153
  • [44] Disentanglement Translation Network for multimodal sentiment analysis
    Zeng, Ying
    Yan, Wenjun
    Mai, Sijie
    Hu, Haifeng
    INFORMATION FUSION, 2024, 102
  • [45] Analyzing Modality Robustness in Multimodal Sentiment Analysis
    Hazarika, Devamanyu
    Li, Yingting
    Cheng, Bo
    Zhao, Shuai
    Zimmermann, Roger
    Pone, Soujanya
    NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 685 - 696
  • [46] Deep Learning Approaches on Multimodal Sentiment Analysis
    Cai, Zisheng
    Gao, Han
    Li, Jiaye
    Wang, Xinyi
    2022 IEEE INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING, BIG DATA AND ALGORITHMS (EEBDA), 2022, : 1127 - 1131
  • [47] Multimodal Sentiment Analysis To Explore the Structure of Emotions
    Hu, Anthony
    Flaxman, Seth
    KDD'18: PROCEEDINGS OF THE 24TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2018, : 350 - 358
  • [48] Multimodal Sentiment Analysis Using Deep Learning
    Sharma, Rakhee
    Le Ngoc Tan
    Sadat, Fatiha
    2018 17TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2018, : 1475 - 1478
  • [49] Multimodal PEAR Chain-of-Thought Reasoning for Multimodal Sentiment Analysis
    Li, Yan
    Lan, Xiangyuan
    Chen, Haifeng
    Lu, Ke
    Jiang, Dongmei
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (09)
  • [50] MAG plus : AN EXTENDED MULTIMODAL ADAPTATION GATE FOR MULTIMODAL SENTIMENT ANALYSIS
    Zhao, Xianbing
    Chen, Yixin
    Li, Wanting
    Gao, Lei
    Tang, Buzhou
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 4753 - 4757