Image-to-Text Conversion and Aspect-Oriented Filtration for Multimodal Aspect-Based Sentiment Analysis

被引：4

作者：

Wang, Qianlong ^{[1
]}

Xu, Hongling ^{[1
]}

Wen, Zhiyuan ^{[1
]}

Liang, Bin ^{[1
]}

Yang, Min ^{[2
]}

Qin, Bing ^{[3
]}

Xu, Ruifeng ^{[1
,4
,5
]}

机构：

[1] Harbin Inst Technol Shenzhen, Sch Comp Sci & Technol, Shenzhen 518055, Peoples R China

[2] Chinese Acad Sci, Shenzhen Inst Adv Technol, Shenzhen 518055, Peoples R China

[3] Harbin Inst Technol, Harbin 150001, Peoples R China

[4] Peng Cheng Lab, Shenzhen 518000, Peoples R China

[5] Guangdong Prov Key Lab Novel Secur Intelligence Te, Shenzhen 518055, Peoples R China

来源：

IEEE TRANSACTIONS ON AFFECTIVE COMPUTING | 2024年 / 15卷 / 03期

基金：

中国国家自然科学基金;

关键词：

Sentiment analysis; Visualization; Task analysis; Social networking (online); Filtration; Analytical models; Electronic mail; Aspect-Based sentiment analysis; multimodal sentiment analysis; natural language processing; pre-trained language model; CLASSIFICATION;

D O I：

10.1109/TAFFC.2023.3333200

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Multimodal aspect-based sentiment analysis (MABSA) aims to determine the sentiment polarity of each aspect mentioned in the text based on multimodal content. Various approaches have been proposed to model multimodal sentiment features for each aspect via modal interactions. However, most existing approaches have two shortcomings: (1) The representation gap between textual and visual modalities may increase the risk of misalignment in modal interactions; (2) In some examples where the image is not related to the text, the visual information may not enrich the textual modality when learning aspect-based sentiment features. In such cases, blindly leveraging visual information may introduce noises in reasoning the aspect-based sentiment expressions. To tackle these shortcomings, we propose an end-to-end MABSA framework with image conversion and noise filtration. Specifically, to bridge the representation gap in different modalities, we attempt to translate images into the input space of a pre-trained language model (PLM). To this end, we develop an image-to-text conversion module that can convert an image to an implicit sequence of token embedding. Moreover, an aspect-oriented filtration module is devised to alleviate the noise in the implicit token embeddings, which consists of two attention operations. After filtering the noise, we leverage a PLM to encode the text, aspect, and image prompt derived from filtered implicit token embeddings as sentiment features to perform aspect-based sentiment prediction. Experimental results on two MABSA datasets show that our framework achieves state-of-the-art performance. Furthermore, extensive experimental analysis demonstrates the proposed framework has superior robustness and efficiency.

引用

页码：1264 / 1278

页数：15

共 50 条

[41] Aspect-Based Sentiment Analysis Approach with CNN
Mulyo, Budi M.
Widyantoro, Dwi H.
2018 5TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING, COMPUTER SCIENCE AND INFORMATICS (EECSI 2018), 2018, : 142 - 147
[42] Aspect-based sentiment analysis of mobile reviews
Gupta, Vedika
Singh, Vivek Kumar
Mukhija, Pankaj
Ghose, Udayan
JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2019, 36 (05) : 4721 - 4730
[43] A corpus for aspect-based sentiment analysis in Vietnamese
Nguyen, Minh-Hao
Nguyen, Tri Minh
Thin, Dang Van
Nguyen, Ngan Luu-Thuy
PROCEEDINGS OF 2019 11TH INTERNATIONAL CONFERENCE ON KNOWLEDGE AND SYSTEMS ENGINEERING (KSE 2019), 2019, : 317 - 321
[44] Towards Generative Aspect-Based Sentiment Analysis
Zhang, Wenxuan
Li, Xin
Deng, Yang
Bing, Lidong
Lam, Wai
ACL-IJCNLP 2021: THE 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 2, 2021, : 504 - 510
[45] Retrieving Users' Opinions on Social Media with Multimodal Aspect-Based Sentiment Analysis
Anschuetz, Miriam
Eder, Tobias
Groh, Georg
2023 IEEE 17TH INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING, ICSC, 2023, : 1 - 8
[46] MSFNet: modality smoothing fusion network for multimodal aspect-based sentiment analysis
Xiang, Yan
Cai, Yunjia
Guo, Junjun
FRONTIERS IN PHYSICS, 2023, 11
[47] Interactive Fusion Network with Recurrent Attention for Multimodal Aspect-based Sentiment Analysis
Wang, Jun
Wang, Qianlong
Wen, Zhiyuan
Liang, Xingwei
Xu, Ruifeng
ARTIFICIAL INTELLIGENCE, CICAI 2022, PT III, 2022, 13606 : 298 - 309
[48] MCPR: A Chinese Product Review Dataset for Multimodal Aspect-Based Sentiment Analysis
Xu, Carol
Luo, Xuan
Wang, Dan
COGNITIVE COMPUTING, ICCC 2022, 2022, 13734 : 83 - 90
[49] Aspect-Based Sentiment Quantification
Matsiiako, Vladyslav
Frasincar, Flavius
Boekestijn, David
IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2022, 13 (04) : 1718 - 1729
[50] Aspect Feature Distillation and Enhancement Network for Aspect-based Sentiment Analysis
Liu, Rui
Cao, Jiahao
Sun, Nannan
Jiang, Lei
PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22), 2022, : 1577 - 1587

← 1 2 3 4 5 →