Multi-Level Multi-Modal Cross-Attention Network for Fake News Detection

被引：31

作者：

Ying, Long ^{[1
]}

Yu, Hui ^{[1
]}

Wang, Jinguang ^{[2
]}

Ji, Yongze ^{[3
]}

Qian, Shengsheng ^{[4
]}

机构：

[1] Nanjing Univ Informat Sci & Technol, Sch Comp & Software, Nanjing 210044, Peoples R China

[2] Hefei Univ Technol, Sch Comp Sci & Informat Engn, Hefei 230601, Peoples R China

[3] China Univ Petr, Sch Informat Sci & Engn, Beijing 102249, Peoples R China

[4] Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100190, Peoples R China

来源：

IEEE ACCESS | 2021年 / 9卷

基金：

中国国家自然科学基金;

关键词：

Feature extraction; Semantics; Visualization; Task analysis; Bit error rate; Convolutional neural networks; Social networking (online); Multi-level neural networks; fake news detection; multi-modal fusion;

D O I：

10.1109/ACCESS.2021.3114093

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

With the development of the Mobile Internet, more and more users publish multi-modal posts on social media platforms. Fake news detection has become an increasingly challenging task. Although there are many works using deep schemes to extract and combine textual and visual representation in the post, most existing methods do not sufficiently utilize the complementary multi-modal information containing semantic concepts and entities to complement and enhance each modality. Moreover, these methods do not model and incorporate the rich multi-level semantics of text information to improve fake news detection tasks. In this paper, we propose a novel end-to-end Multi-level Multi-modal Cross-attention Network (MMCN) which exploits the multi-level semantics of textual content and jointly integrates the relationships of duplicate and different modalities (textual and visual modality) of social multimedia posts in a unified framework. Pre-trained BERT and ResNet models are employed to generate high-quality representations for text words and image regions respectively. A multi-modal cross-attention network is then designed to fuse the feature embeddings of the text words and image regions by simultaneously considering data relationships in duplicate and different modalities. Specially, due to different layers of the transformer architecture have different feature representations, we employ a multi-level encoding network to capture the rich multi-level semantics to enhance the presentations of posts. Extensive experiments on the two public datasets (WEIBO and PHEME) demonstrate that compared with the state-of-the-art models, the proposed MMCN has an advantageous performance.

引用

页码：132363 / 132373

页数：11

共 50 条

[21] A Multi-Reading Habits Fusion Adversarial Network for Multi-Modal Fake News Detection
Wang, Bofan
Zhang, Shenwu
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (07) : 403 - 413
[22] Fake News Detection Based on BERT Multi-domain and Multi-modal Fusion Network
Yu, Kai
Jiao, Shiming
Ma, Zhilong
COMPUTER VISION AND IMAGE UNDERSTANDING, 2025, 252
[23] Multi-level Cross-attention Siamese Network For Visual Object Tracking
Zhang, Jianwei
Wang, Jingchao
Zhang, Huanlong
Miao, Mengen
Cai, Zengyu
Chen, Fuguo
KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2022, 16 (12): : 3976 - 3990
[24] Multi-Modal fusion with multi-level attention for Visual Dialog
Zhang, Jingping
Wang, Qiang
Han, Yahong
INFORMATION PROCESSING & MANAGEMENT, 2020, 57 (04)
[25] MMCAN: Multi-Modal Cross-Attention Network for Free-Space Detection with Uncalibrated Hyperspectral Sensors
Fang, Feiyi
Zhou, Tao
Song, Zhenbo
Lu, Jianfeng
REMOTE SENSING, 2023, 15 (04)
[26] Multi-Modal fake news Detection on Social Media with Dual Attention Fusion Networks
Yang, Haitian
Zhao, Xuan
Sun, Degang
Wang, Yan
Zhu, He
Ma, Chao
Huang, Weiqing
26TH IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS (IEEE ISCC 2021), 2021,
[27] TRICAN: Multi-Modal Hateful Memes Detection with Triplet-Relation Information Cross-Attention Network
Liang, Xiaolin
Huang, Yajuan
Liu, Wen
Zhu, He
Liang, Zhao
Chen, Libo
2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
[28] Dual-stream fusion network with multi-head self-attention for multi-modal fake news detection
Yang, Yimei
Liu, Jinping
Yang, Yujun
Cen, Lihui
APPLIED SOFT COMPUTING, 2024, 167
[29] Semantics-Enhanced Multi-Modal Fake News Detection
Qi P.
Cao J.
Sheng Q.
Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2021, 58 (07): : 1456 - 1465
[30] Fake news detection based on multi-modal domain adaptation
Xiaopei Wang
Jiana Meng
Di Zhao
Xuan Meng
Hewen Sun
Neural Computing and Applications, 2025, 37 (7) : 5781 - 5793

← 1 2 3 4 5 →