Image-to-Text Conversion and Aspect-Oriented Filtration for Multimodal Aspect-Based Sentiment Analysis

被引:4
|
作者
Wang, Qianlong [1 ]
Xu, Hongling [1 ]
Wen, Zhiyuan [1 ]
Liang, Bin [1 ]
Yang, Min [2 ]
Qin, Bing [3 ]
Xu, Ruifeng [1 ,4 ,5 ]
机构
[1] Harbin Inst Technol Shenzhen, Sch Comp Sci & Technol, Shenzhen 518055, Peoples R China
[2] Chinese Acad Sci, Shenzhen Inst Adv Technol, Shenzhen 518055, Peoples R China
[3] Harbin Inst Technol, Harbin 150001, Peoples R China
[4] Peng Cheng Lab, Shenzhen 518000, Peoples R China
[5] Guangdong Prov Key Lab Novel Secur Intelligence Te, Shenzhen 518055, Peoples R China
基金
中国国家自然科学基金;
关键词
Sentiment analysis; Visualization; Task analysis; Social networking (online); Filtration; Analytical models; Electronic mail; Aspect-Based sentiment analysis; multimodal sentiment analysis; natural language processing; pre-trained language model; CLASSIFICATION;
D O I
10.1109/TAFFC.2023.3333200
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multimodal aspect-based sentiment analysis (MABSA) aims to determine the sentiment polarity of each aspect mentioned in the text based on multimodal content. Various approaches have been proposed to model multimodal sentiment features for each aspect via modal interactions. However, most existing approaches have two shortcomings: (1) The representation gap between textual and visual modalities may increase the risk of misalignment in modal interactions; (2) In some examples where the image is not related to the text, the visual information may not enrich the textual modality when learning aspect-based sentiment features. In such cases, blindly leveraging visual information may introduce noises in reasoning the aspect-based sentiment expressions. To tackle these shortcomings, we propose an end-to-end MABSA framework with image conversion and noise filtration. Specifically, to bridge the representation gap in different modalities, we attempt to translate images into the input space of a pre-trained language model (PLM). To this end, we develop an image-to-text conversion module that can convert an image to an implicit sequence of token embedding. Moreover, an aspect-oriented filtration module is devised to alleviate the noise in the implicit token embeddings, which consists of two attention operations. After filtering the noise, we leverage a PLM to encode the text, aspect, and image prompt derived from filtered implicit token embeddings as sentiment features to perform aspect-based sentiment prediction. Experimental results on two MABSA datasets show that our framework achieves state-of-the-art performance. Furthermore, extensive experimental analysis demonstrates the proposed framework has superior robustness and efficiency.
引用
收藏
页码:1264 / 1278
页数:15
相关论文
共 50 条
  • [41] Aspect-Based Sentiment Analysis Approach with CNN
    Mulyo, Budi M.
    Widyantoro, Dwi H.
    2018 5TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING, COMPUTER SCIENCE AND INFORMATICS (EECSI 2018), 2018, : 142 - 147
  • [42] Aspect-based sentiment analysis of mobile reviews
    Gupta, Vedika
    Singh, Vivek Kumar
    Mukhija, Pankaj
    Ghose, Udayan
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2019, 36 (05) : 4721 - 4730
  • [43] A corpus for aspect-based sentiment analysis in Vietnamese
    Nguyen, Minh-Hao
    Nguyen, Tri Minh
    Thin, Dang Van
    Nguyen, Ngan Luu-Thuy
    PROCEEDINGS OF 2019 11TH INTERNATIONAL CONFERENCE ON KNOWLEDGE AND SYSTEMS ENGINEERING (KSE 2019), 2019, : 317 - 321
  • [44] Towards Generative Aspect-Based Sentiment Analysis
    Zhang, Wenxuan
    Li, Xin
    Deng, Yang
    Bing, Lidong
    Lam, Wai
    ACL-IJCNLP 2021: THE 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 2, 2021, : 504 - 510
  • [45] Retrieving Users' Opinions on Social Media with Multimodal Aspect-Based Sentiment Analysis
    Anschuetz, Miriam
    Eder, Tobias
    Groh, Georg
    2023 IEEE 17TH INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING, ICSC, 2023, : 1 - 8
  • [46] MSFNet: modality smoothing fusion network for multimodal aspect-based sentiment analysis
    Xiang, Yan
    Cai, Yunjia
    Guo, Junjun
    FRONTIERS IN PHYSICS, 2023, 11
  • [47] Interactive Fusion Network with Recurrent Attention for Multimodal Aspect-based Sentiment Analysis
    Wang, Jun
    Wang, Qianlong
    Wen, Zhiyuan
    Liang, Xingwei
    Xu, Ruifeng
    ARTIFICIAL INTELLIGENCE, CICAI 2022, PT III, 2022, 13606 : 298 - 309
  • [48] MCPR: A Chinese Product Review Dataset for Multimodal Aspect-Based Sentiment Analysis
    Xu, Carol
    Luo, Xuan
    Wang, Dan
    COGNITIVE COMPUTING, ICCC 2022, 2022, 13734 : 83 - 90
  • [49] Aspect-Based Sentiment Quantification
    Matsiiako, Vladyslav
    Frasincar, Flavius
    Boekestijn, David
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2022, 13 (04) : 1718 - 1729
  • [50] Aspect Feature Distillation and Enhancement Network for Aspect-based Sentiment Analysis
    Liu, Rui
    Cao, Jiahao
    Sun, Nannan
    Jiang, Lei
    PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22), 2022, : 1577 - 1587