Image-to-Text Conversion and Aspect-Oriented Filtration for Multimodal Aspect-Based Sentiment Analysis

被引:4
|
作者
Wang, Qianlong [1 ]
Xu, Hongling [1 ]
Wen, Zhiyuan [1 ]
Liang, Bin [1 ]
Yang, Min [2 ]
Qin, Bing [3 ]
Xu, Ruifeng [1 ,4 ,5 ]
机构
[1] Harbin Inst Technol Shenzhen, Sch Comp Sci & Technol, Shenzhen 518055, Peoples R China
[2] Chinese Acad Sci, Shenzhen Inst Adv Technol, Shenzhen 518055, Peoples R China
[3] Harbin Inst Technol, Harbin 150001, Peoples R China
[4] Peng Cheng Lab, Shenzhen 518000, Peoples R China
[5] Guangdong Prov Key Lab Novel Secur Intelligence Te, Shenzhen 518055, Peoples R China
基金
中国国家自然科学基金;
关键词
Sentiment analysis; Visualization; Task analysis; Social networking (online); Filtration; Analytical models; Electronic mail; Aspect-Based sentiment analysis; multimodal sentiment analysis; natural language processing; pre-trained language model; CLASSIFICATION;
D O I
10.1109/TAFFC.2023.3333200
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multimodal aspect-based sentiment analysis (MABSA) aims to determine the sentiment polarity of each aspect mentioned in the text based on multimodal content. Various approaches have been proposed to model multimodal sentiment features for each aspect via modal interactions. However, most existing approaches have two shortcomings: (1) The representation gap between textual and visual modalities may increase the risk of misalignment in modal interactions; (2) In some examples where the image is not related to the text, the visual information may not enrich the textual modality when learning aspect-based sentiment features. In such cases, blindly leveraging visual information may introduce noises in reasoning the aspect-based sentiment expressions. To tackle these shortcomings, we propose an end-to-end MABSA framework with image conversion and noise filtration. Specifically, to bridge the representation gap in different modalities, we attempt to translate images into the input space of a pre-trained language model (PLM). To this end, we develop an image-to-text conversion module that can convert an image to an implicit sequence of token embedding. Moreover, an aspect-oriented filtration module is devised to alleviate the noise in the implicit token embeddings, which consists of two attention operations. After filtering the noise, we leverage a PLM to encode the text, aspect, and image prompt derived from filtered implicit token embeddings as sentiment features to perform aspect-based sentiment prediction. Experimental results on two MABSA datasets show that our framework achieves state-of-the-art performance. Furthermore, extensive experimental analysis demonstrates the proposed framework has superior robustness and efficiency.
引用
收藏
页码:1264 / 1278
页数:15
相关论文
共 50 条
  • [31] Joint Modal Circular Complementary Attention for Multimodal Aspect-Based Sentiment Analysis
    Liu, Hao
    He, Lijun
    Liang, Jiaxi
    2024 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO WORKSHOPS, ICMEW 2024, 2024,
  • [32] Aspect Term Information Enhancement Network for Aspect-Based Sentiment Analysis
    Shen, Yafei
    Chen, Zhuo
    Di, Jiaqi
    Meng, Ying
    2024 9th International Conference on Intelligent Computing and Signal Processing, ICSP 2024, 2024, : 1198 - 1204
  • [33] Self-adaptive attention fusion for multimodal aspect-based sentiment analysis
    Wang, Ziyue
    Guo, Junjun
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2024, 21 (01) : 1305 - 1320
  • [34] Aspect-Based Sentiment Analysis for User Reviews
    Yin Zhang
    Jinyang Du
    Xiao Ma
    Haoyu Wen
    Giancarlo Fortino
    Cognitive Computation, 2021, 13 : 1114 - 1127
  • [35] Multilayer interactive attention bottleneck transformer for aspect-based multimodal sentiment analysis
    Sun, Jiachang
    Zhu, Fuxian
    MULTIMEDIA SYSTEMS, 2025, 31 (01)
  • [36] Targeted Aspect-Based Sentiment Analysis by Utilizing Dynamic Aspect Representation
    Miao, Siqi
    Lu, Meilian
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT I, 2022, 13529 : 647 - 659
  • [37] Aspect-Specific Context Modeling for Aspect-Based Sentiment Analysis
    Ma, Fang
    Zhang, Chen
    Zhang, Bo
    Song, Dawei
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2022, PT I, 2022, 13551 : 513 - 526
  • [38] MASAD: A large-scale dataset for multimodal aspect-based sentiment analysis
    Zhou, Jie
    Zhao, Jiabao
    Huang, Jimmy Xiangji
    Hu, Qinmin Vivian
    He, Liang
    NEUROCOMPUTING, 2021, 455 : 47 - 58
  • [39] Datasets for Aspect-Based Sentiment Analysis in French
    Apidianaki, Marianna
    Tannier, Xavier
    Richart, Cecile
    LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2016, : 1122 - 1126
  • [40] Data augmentation for aspect-based sentiment analysis
    Guangmin Li
    Hui Wang
    Yi Ding
    Kangan Zhou
    Xiaowei Yan
    International Journal of Machine Learning and Cybernetics, 2023, 14 : 125 - 133