Hate-UDF: Explainable Hateful Meme Detection With Uncertainty-Aware Dynamic Fusion

被引：0

作者：

Lei, Xia ^{[1
]}

Wang, Siqi ^{[2
]}

Fan, Yongkai ^{[1
]}

Shang, Wenqian ^{[1
]}

机构：

[1] Commun Univ China, State Key Lab Media Convergence & Commun, Beijing, Peoples R China

[2] Hebei Normal Univ, Coll Comp & Cyber Secur, Shijiazhuang, Peoples R China

来源：

SOFTWARE-PRACTICE & EXPERIENCE | 2024年

关键词：

detection; dynamic fusion; hateful meme; interpretable; multi-modal;

D O I：

10.1002/spe.3403

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

BackgroundWith the increasing integration of Artificial Intelligence (AI) and Internet of Things (IoT), the dissemination of multimodal data is undergoing revolutionary changes. To mitigate the societal risks posed by the rapid spread of malicious multimodal data, such as hateful memes, it is crucial to develop effective detection methods for such data. Existing detection models often struggle with data quality issues and lack interpretability, limiting their effectiveness in content moderation tasks.AimsThis paper aims to propose an explainable hateful meme detection model by uncertainty-aware dynamic fusion. The goal is to enhance both generalization performance and interpretability, addressing the limitations of conventional static fusion methods and existing algorithms for hateful meme detection.Materials & MethodsTo mitigate the societal risks posed by the rapid spread of malicious multimodal data, such as hateful memes, it is crucial to develop effective detection methods for such data. However, existing algorithms for hateful meme detection frequently overlook the data quality and the interpretability of model. To adress these challenges, this paper proposes Hate-UDF, an explainable hateful meme detection model with uncertainty-aware dynamic fusion, providing both high generalization ability and interpretability. This method dynamically evaluates the uncertainty of different modalities, obtains dynamic weights, and utilizes them to weight the feature values for fusion, thereby obtaining a uncertainty-aware dynamic fusion method with provable upper bounds on generalization error. Furthermore, an analysis of the dynamic weights can explain the modality on which the model primarily relies for detection, thereby providing a method that is both explainable and reliable.ResultsWe compare the performance of Hate-UDF with three general models and three State of the Art (SOTA) models in the field of hateful meme detection on the Facebook Hateful Memes (FHM) and the Multimedia Automatic Misogyny Identification (MAMI) datasets. Hate-UDF achieved state-of-the-art performance, surpassing existing models on both datasets. Specifically, it improved accuracy and AUC by 7.56% and 2.8% on FHM and by 3.34% and 0.17% on MAMI compared with the current SOTA model, respectively. Additionally, we demonstrate that the visual modality is more important than the textual modality in the hateful meme detection model, and we explain the primary reason behind this by visualization.DiscussionThe model dynamically adapts to modality quality, enhancing reliability and reducing the risk of misclassification. Its interpretability, achieved through visualizations of modality and feature attributions, provides valuable insights for content moderation systems and highlights the importance of image modality in detecting hateful meme. While Hate-UDF provides an explainable and reliable method for detecting hateful memes, it may still learn biases from the training data, potentially leading to the over-detection of content from certain groups or communities. Future research must focus on improving the fairness and ethical responsibilities of the model's decisions.ConclusionThis paper introduces the model of Hate-UDF, a dynamic fusion method based on uncertainty, designed to improve multimodal fusion issues in existing hateful meme detection models. The model determines the reliability of different modal information by assessing their uncertainty and generates dynamic weights accordingly. By comparing these weights, the model can identify which modality is most influential in detecting malicious content. Therefore, the Hate-UDF model not only has interpretability but also its generalization performance has been validated.

引用

页数：13

共 50 条

[41] Uncertainty-aware credit card fraud detection using deep learning
Habibpour, Maryam
Gharoun, Hassan
Mehdipour, Mohammadreza
Tajally, Amirreza
Asgharnezhad, Hamzeh
Shamsi, Afshar
Khosravi, Abbas
Nahavandi, Saeid
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 123
[42] Uncertainty-Aware Hardware Trojan Detection Using Multimodal Deep Learning
Vishwakarma, Rahul
Rezaei, Amin
Proceedings -Design, Automation and Test in Europe, DATE, 2024,
[43] Uncertainty-Aware Graph-Guided Weakly Supervised Object Detection
Zhu, Yueyi
Zhang, Yongqiang
Ding, Mingli
Zuo, Wangmeng
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (07) : 3257 - 3269
[44] Uncertainty-Aware Hardware Trojan Detection Using Multimodal Deep Learning
Vishwakarma, Rahul
Rezaei, Amin
2024 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION, DATE, 2024,
[45] Uncertainty-aware convolutional neural network for explainable artificial intelligence-assisted disaster damage assessment
Cheng, Chih-Shen
Behzadan, Amir H.
Noshadravan, Arash
STRUCTURAL CONTROL & HEALTH MONITORING, 2022, 29 (10):
[46] Uncertainty-aware Online Learning for Dynamic Power Management in Large Manycore systems
Narang, Gaurav
Ayoub, Raid
Kishinevsky, Michael
Doppa, Janardhan Rao
Pande, Partha Pratim
2023 IEEE/ACM INTERNATIONAL SYMPOSIUM ON LOW POWER ELECTRONICS AND DESIGN, ISLPED, 2023,
[47] Uncertainty-Aware Deep Learning Architectures for Highly Dynamic Air Quality Prediction
Mokhtari, Ichrak
Bechkit, Walid
Rivano, Herve
Yaici, Mouloud Riadh
IEEE ACCESS, 2021, 9 : 14765 - 14778
[48] Uncertainty-Aware Fast Curb Detection Using Convolutional Networks in Point Clouds
Jung, Younghwa
Jeon, Mingu
Kim, Chan
Seo, Seung-Woo
Kim, Seong-Woo
2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 12882 - 12888
[49] Uncertainty-aware accurate insulator fault detection based on an improved YOLOX model
Dai, Zhiyong
ENERGY REPORTS, 2022, 8 : 12809 - 12821
[50] Uncertainty-Aware COVID-19 Detection from Imbalanced Sound Data
Xia, Tong
Han, Jing
Qendro, Lorena
Dang, Ting
Mascolo, Cecilia
INTERSPEECH 2021, 2021, : 2951 - 2955

← 1 2 3 4 5 →