SCENE TEXT RECOGNITION MODELS EXPLAINABILITY USING LOCAL FEATURES

被引:1
|
作者
Ty, Mark Vincent [1 ]
Atienza, Rowel [1 ,2 ]
机构
[1] Univ Philippines, Elect & Elect Engn Inst, Quezon City, Philippines
[2] Univ Philippines, AI Grad Program, Quezon City, Philippines
来源
2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP | 2023年
关键词
Computer Vision; Scene Text Recognition; Explainable AI;
D O I
10.1109/ICIP49359.2023.10222406
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Explainable AI (XAI) is the study on how humans can be able to understand the cause of a model's prediction. In this work, the problem of interest is Scene Text Recognition (STR) Explainability, using XAI to understand the cause of an STR model's prediction. Recent XAI literatures on STR only provide a simple analysis and do not fully explore other XAI methods. In this study, we specifically work on data explainability frameworks, called attribution-based methods, that explains the important parts of an input data in deep learning models. However, integrating them into STR produces inconsistent and ineffective explanations, because they only explain the model in the global context. To solve this problem, we propose a new method, STRExp, to take into consideration the local explanations, i.e. the individual character prediction explanations. This is then benchmarked across different attribution-based methods on different STR datasets and evaluated across different STR models.
引用
收藏
页码:645 / 649
页数:5
相关论文
共 50 条
  • [41] Arabic and Latin Scene Text Recognition by Combining Handcrafted and Deep-Learned Features
    Tounsi, Maroua
    Moalla, Ikram
    Pal, Umapada
    Alimi, Adel M.
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2022, 47 (08) : 9727 - 9740
  • [42] Visual place recognition from end-to-end semantic scene text features
    Raisi, Zobeir
    Zelek, John
    FRONTIERS IN ROBOTICS AND AI, 2024, 11
  • [43] Arabic and Latin Scene Text Recognition by Combining Handcrafted and Deep-Learned Features
    Maroua Tounsi
    Ikram Moalla
    Umapada Pal
    Adel M. Alimi
    Arabian Journal for Science and Engineering, 2022, 47 : 9727 - 9740
  • [44] MASTER: Multi-aspect non-local network for scene text recognition
    Lu, Ning
    Yu, Wenwen
    Qi, Xianbiao
    Chen, Yihao
    Gong, Ping
    Xiao, Rong
    Bai, Xiang
    PATTERN RECOGNITION, 2021, 117
  • [45] Text Detection and Recognition from Scene Images using MSER and CNN
    Choudhary, Savita
    Singh, Nikhil Kumar
    Chichadwani, Sanjay
    2018 SECOND INTERNATIONAL CONFERENCE ON ADVANCES IN ELECTRONICS, COMPUTERS AND COMMUNICATIONS (ICAECC), 2018,
  • [46] Scene text recognition using residual convolutional recurrent neural network
    Lei, Zhengchao
    Zhao, Sanyuan
    Song, Hongmei
    Shen, Jianbing
    MACHINE VISION AND APPLICATIONS, 2018, 29 (05) : 861 - 871
  • [47] A Flash Flood Categorization System using Scene-Text Recognition
    Basnyat, Bipendra
    Roy, Nirmalya
    Gangopadhyay, Aryya
    2018 IEEE INTERNATIONAL CONFERENCE ON SMART COMPUTING (SMARTCOMP 2018), 2018, : 147 - 154
  • [48] Scene text recognition using residual convolutional recurrent neural network
    Zhengchao Lei
    Sanyuan Zhao
    Hongmei Song
    Jianbing Shen
    Machine Vision and Applications, 2018, 29 : 861 - 871
  • [49] Chinese Text Detection and Recognition in Natural Scene Using HOG and SVM
    Yu, Boran
    Wan, Hongjie
    2016 6TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY FOR MANUFACTURING SYSTEMS (ITMS 2016), 2016, : 148 - 152
  • [50] Scene Text Recognition Using Similarity and a Lexicon with Sparse Belief Propagation
    Weinman, Jerod J.
    Learned-Miller, Erik
    Hanson, Allen R.
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2009, 31 (10) : 1733 - 1746