Multi-Level Multi-Modal Cross-Attention Network for Fake News Detection

被引:31
|
作者
Ying, Long [1 ]
Yu, Hui [1 ]
Wang, Jinguang [2 ]
Ji, Yongze [3 ]
Qian, Shengsheng [4 ]
机构
[1] Nanjing Univ Informat Sci & Technol, Sch Comp & Software, Nanjing 210044, Peoples R China
[2] Hefei Univ Technol, Sch Comp Sci & Informat Engn, Hefei 230601, Peoples R China
[3] China Univ Petr, Sch Informat Sci & Engn, Beijing 102249, Peoples R China
[4] Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100190, Peoples R China
来源
IEEE ACCESS | 2021年 / 9卷
基金
中国国家自然科学基金;
关键词
Feature extraction; Semantics; Visualization; Task analysis; Bit error rate; Convolutional neural networks; Social networking (online); Multi-level neural networks; fake news detection; multi-modal fusion;
D O I
10.1109/ACCESS.2021.3114093
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the development of the Mobile Internet, more and more users publish multi-modal posts on social media platforms. Fake news detection has become an increasingly challenging task. Although there are many works using deep schemes to extract and combine textual and visual representation in the post, most existing methods do not sufficiently utilize the complementary multi-modal information containing semantic concepts and entities to complement and enhance each modality. Moreover, these methods do not model and incorporate the rich multi-level semantics of text information to improve fake news detection tasks. In this paper, we propose a novel end-to-end Multi-level Multi-modal Cross-attention Network (MMCN) which exploits the multi-level semantics of textual content and jointly integrates the relationships of duplicate and different modalities (textual and visual modality) of social multimedia posts in a unified framework. Pre-trained BERT and ResNet models are employed to generate high-quality representations for text words and image regions respectively. A multi-modal cross-attention network is then designed to fuse the feature embeddings of the text words and image regions by simultaneously considering data relationships in duplicate and different modalities. Specially, due to different layers of the transformer architecture have different feature representations, we employ a multi-level encoding network to capture the rich multi-level semantics to enhance the presentations of posts. Extensive experiments on the two public datasets (WEIBO and PHEME) demonstrate that compared with the state-of-the-art models, the proposed MMCN has an advantageous performance.
引用
收藏
页码:132363 / 132373
页数:11
相关论文
共 50 条
  • [21] A Multi-Reading Habits Fusion Adversarial Network for Multi-Modal Fake News Detection
    Wang, Bofan
    Zhang, Shenwu
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (07) : 403 - 413
  • [22] Fake News Detection Based on BERT Multi-domain and Multi-modal Fusion Network
    Yu, Kai
    Jiao, Shiming
    Ma, Zhilong
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2025, 252
  • [23] Multi-level Cross-attention Siamese Network For Visual Object Tracking
    Zhang, Jianwei
    Wang, Jingchao
    Zhang, Huanlong
    Miao, Mengen
    Cai, Zengyu
    Chen, Fuguo
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2022, 16 (12): : 3976 - 3990
  • [24] Multi-Modal fusion with multi-level attention for Visual Dialog
    Zhang, Jingping
    Wang, Qiang
    Han, Yahong
    INFORMATION PROCESSING & MANAGEMENT, 2020, 57 (04)
  • [25] MMCAN: Multi-Modal Cross-Attention Network for Free-Space Detection with Uncalibrated Hyperspectral Sensors
    Fang, Feiyi
    Zhou, Tao
    Song, Zhenbo
    Lu, Jianfeng
    REMOTE SENSING, 2023, 15 (04)
  • [26] Multi-Modal fake news Detection on Social Media with Dual Attention Fusion Networks
    Yang, Haitian
    Zhao, Xuan
    Sun, Degang
    Wang, Yan
    Zhu, He
    Ma, Chao
    Huang, Weiqing
    26TH IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS (IEEE ISCC 2021), 2021,
  • [27] TRICAN: Multi-Modal Hateful Memes Detection with Triplet-Relation Information Cross-Attention Network
    Liang, Xiaolin
    Huang, Yajuan
    Liu, Wen
    Zhu, He
    Liang, Zhao
    Chen, Libo
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [28] Dual-stream fusion network with multi-head self-attention for multi-modal fake news detection
    Yang, Yimei
    Liu, Jinping
    Yang, Yujun
    Cen, Lihui
    APPLIED SOFT COMPUTING, 2024, 167
  • [29] Semantics-Enhanced Multi-Modal Fake News Detection
    Qi P.
    Cao J.
    Sheng Q.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2021, 58 (07): : 1456 - 1465
  • [30] Fake news detection based on multi-modal domain adaptation
    Xiaopei Wang
    Jiana Meng
    Di Zhao
    Xuan Meng
    Hewen Sun
    Neural Computing and Applications, 2025, 37 (7) : 5781 - 5793