Multi-level correlation mining framework with self-supervised label generation for multimodal sentiment analysis

被引:17
|
作者
Li, Zuhe [1 ]
Guo, Qingbing [1 ]
Pan, Yushan [2 ]
Ding, Weiping [3 ]
Yu, Jun [1 ]
Zhang, Yazhou [4 ]
Liu, Weihua [5 ]
Chen, Haoran [1 ]
Wang, Hao [6 ]
Xie, Ying [7 ]
机构
[1] Zhengzhou Univ Light Ind, Sch Comp & Commun Engn, Zhengzhou 450002, Peoples R China
[2] Xian Jiaotong Liverpool Univ, Sch Adv Technol, Dept Comp, Suzhou 215123, Peoples R China
[3] Nantong Univ, Sch Informat Sci & Technol, Nantong 226019, Peoples R China
[4] Zhengzhou Univ Light Ind, Coll Software Engn, Zhengzhou 450002, Peoples R China
[5] China Mobile Res Inst, Beijing 100053, Peoples R China
[6] Xidian Univ, Xian 710071, Peoples R China
[7] Putian Univ, Putian 351100, Peoples R China
基金
中国国家自然科学基金;
关键词
Multimodal sentiment analysis; Unimodal feature fusion; Linguistic-guided transformer; Self-supervised label generation; INFORMATION FUSION; MECHANISM; LSTM;
D O I
10.1016/j.inffus.2023.101891
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Fusion and co-learning are major challenges in multimodal sentiment analysis. Most existing methods either ignore the basic relationships among modalities or fail to maximize their potential correlations. They also do not leverage the knowledge from resource-rich modalities in the analysis of resource-poor modalities. To address these challenges, we propose a multimodal sentiment analysis method based on multilevel correlation mining and self-supervised multi-task learning. First, we propose a unimodal feature fusion-and linguistics guided Transformer-based framework, multi-level correlation mining framework, to overcome the difficulty of multimodal information fusion. The module exploits the correlation information between modalities from low to high levels. Second, we divided the multimodal sentiment analysis task into one multimodal task and three unimodal tasks (linguistic, acoustic, and visual tasks), and designed a self-supervised label generation module (SLGM) to generate sentiment labels for unimodal tasks. SLGM-based multi-task learning overcomes the lack of unimodal labels in co-learning. Through extensive experiments on the CMU-MOSI and CMU-MOSEI datasets, we demonstrated the superiority of the proposed multi-level correlation mining framework to state-of-the-art methods.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] DRSS: a multimodal sentiment analysis approach based on dual representation and self-supervised learning strategy
    Meng, Jing
    Zhu, Zhenfang
    Qi, Jiangtao
    Zhang, Huaxiang
    JOURNAL OF SUPERCOMPUTING, 2025, 81 (01):
  • [22] Interpretability in Sentiment Analysis: A Self-Supervised Approach to Sentiment Cue Extraction
    Sun, Yawei
    He, Saike
    Han, Xu
    Luo, Yan
    APPLIED SCIENCES-BASEL, 2024, 14 (07):
  • [23] Enhancing Self-supervised Video Representation Learning via Multi-level Feature Optimization
    Qian, Rui
    Li, Yuxi
    Liu, Huabin
    See, John
    Ding, Shuangrui
    Liu, Xian
    Li, Dian
    Lin, Weiyao
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 7970 - 7981
  • [24] Self-HCL: Self-Supervised Multitask Learning with Hybrid Contrastive Learning Strategy for Multimodal Sentiment Analysis
    Fu, Youjia
    Fu, Junsong
    Xue, Huixia
    Xu, Zihao
    ELECTRONICS, 2024, 13 (14)
  • [25] Multi-level Multi-task representation learning with adaptive fusion for multimodal sentiment analysis
    Chuanbo Zhu
    Min Chen
    Haomin Li
    Sheng Zhang
    Han Liang
    Chao Sun
    Yifan Liu
    Jincai Chen
    Neural Computing and Applications, 2025, 37 (3) : 1491 - 1508
  • [26] Multimodal Self-supervised Learning for Medical Image Analysis
    Taleb, Aiham
    Lippert, Christoph
    Klein, Tassilo
    Nabi, Moin
    INFORMATION PROCESSING IN MEDICAL IMAGING, IPMI 2021, 2021, 12729 : 661 - 673
  • [27] Text-guided deep correlation mining and self-learning feature fusion framework for multimodal sentiment analysis
    Zhu, Minghui
    He, Xiaojiang
    Qiao, Baojie
    Luo, Yiming
    Li, Zuhe
    Pan, Yushan
    KNOWLEDGE-BASED SYSTEMS, 2025, 315
  • [28] Mining Nuanced Weibo Sentiment with Hierarchical Graph Modeling and Self-Supervised Learning
    Wang, Chuyang
    Konpang, Jessada
    Sirikham, Adisorn
    Tian, Shasha
    ELECTRONICS, 2025, 14 (01):
  • [29] Sentiment Lexical Strength Enhanced Self-supervised Attention Learning for sentiment analysis
    Wang, Xi
    Fan, Mengmeng
    Kong, Mingming
    Pei, Zheng
    KNOWLEDGE-BASED SYSTEMS, 2022, 252
  • [30] Multi-level feature optimization and multimodal contextual fusion for sentiment analysis and emotion classification
    Huddar, Mahesh G.
    Sannakki, Sanjeev S.
    Rajpurohit, Vijay S.
    COMPUTATIONAL INTELLIGENCE, 2020, 36 (02) : 861 - 881