XAI for Image Captioning using SHAP

被引:4
|
作者
Dewi, Christine [1 ]
Chen, Rung-Ching [2 ]
Yu, Hui [3 ]
Jiang, Xiaoyi [4 ]
机构
[1] Satya Wacana Christian Univ, Dept Informat Technol, Salatiga 50517, Indonesia
[2] Chaoyang Univ Technol, Dept Informat Management, Taichung 413310, Taiwan
[3] Univ Portsmouth, Sch Creat Technol, Portsmouth PO1 2UP, England
[4] Univ Munster, Dept Math & Comp Sci, D-48149 Munster, Germany
基金
欧盟地平线“2020”;
关键词
SHAP; explainable artificial intelligence; image captioning; azure cognitive service; API; MACHINE; FRAMEWORK; TEXT;
D O I
10.6688/JISE.202307_39(4).0001
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In the fields of computer vision (CV) and natural language processing (NLP), they attempt to create a textual description of a given image is known as image captioning. Captioning is the process of creating an explanation for an image. Recognizing the significant items in an image, their qualities, and their connections are required for image captioning. It must also be able to construct phrases that are valid in both syntax and semantics. Deep-learning-based approaches are deal with the intricacies and problems of image captioning. This article provides a simple and effective Explainable Artificial Intelligence (XAI) technique for image text. Deep learning techniques have been widely applied to this work in recent years, and the results have been relatively positive. This work employs Azure Cognitive Service and Open-Source Image Captioning model to get image caption. We implement Explainable Artificial Intelligence (XAI) Image Captioning (Image to Text) using Shapley Additive explanations (SHAP). This work applies Cosine similarity by spaCy and Term Frequency Inverse Document Frequency (TF-IDF transform) to evaluate the sentence similarity. Our research work found that Azure Cognitive Services provides better descriptions for images compared to the Open-Source Image Captioning Model.
引用
收藏
页码:711 / 724
页数:14
相关论文
共 50 条
  • [1] EXplainable AI (XAI) approach to image captioning
    Han, Seung-Ho
    Kwon, Min-Su
    Choi, Ho-Jin
    JOURNAL OF ENGINEERING-JOE, 2020, 2020 (13): : 589 - 594
  • [2] Enhancing Malware Detection Through Machine Learning Using XAI with SHAP Framework
    Basheer, Nihala
    Pranggono, Bernardi
    Islam, Shareeful
    Papastergiou, Spyridon
    Mouratidis, Haralambos
    ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, PT I, AIAI 2024, 2024, 711 : 316 - 329
  • [3] Image Captioning using Deep Learning
    Jain, Yukti Sanjay
    Dhopeshwar, Tanisha
    Chadha, Supreet Kaur
    Pagire, Vrushali
    2021 INTERNATIONAL CONFERENCE ON COMPUTATIONAL PERFORMANCE EVALUATION (COMPE-2021), 2021,
  • [4] Fast image captioning using LSTM
    Meng Han
    Wenyu Chen
    Alemu Dagmawi Moges
    Cluster Computing, 2019, 22 : 6143 - 6155
  • [5] Fast image captioning using LSTM
    Han, Meng
    Chen, Wenyu
    Moges, Alemu Dagmawi
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2019, 22 (Suppl 3): : S6143 - S6155
  • [6] Image Captioning Using Deep Learning
    Adithya, Paluvayi Veera
    Kalidindi, Mourya Viswanadh
    Swaroop, Nallani Jyothi
    Vishwas, H. N.
    ADVANCED NETWORK TECHNOLOGIES AND INTELLIGENT COMPUTING, ANTIC 2023, PT III, 2024, 2092 : 42 - 58
  • [7] Improving traffic accident severity prediction using MobileNet transfer learning model and SHAP XAI technique
    Aboulola, Omar Ibrahim
    PLOS ONE, 2024, 19 (04):
  • [8] Text Augmentation Using BERT for Image Captioning
    Atliha, Viktar
    Sesok, Dmitrij
    APPLIED SCIENCES-BASEL, 2020, 10 (17):
  • [9] Improved Image Captioning Using GAN and ViT
    Rao, Vrushank D.
    Shashank, B. N.
    Bhattu, S. Nagesh
    COMPUTER VISION AND IMAGE PROCESSING, CVIP 2023, PT III, 2024, 2011 : 375 - 385
  • [10] An Image Captioning Approach Using Dynamical Attention
    Wang, Changzhi
    Gu, Xiaodong
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,