SCENE TEXT RECOGNITION MODELS EXPLAINABILITY USING LOCAL FEATURES

被引:1
|
作者
Ty, Mark Vincent [1 ]
Atienza, Rowel [1 ,2 ]
机构
[1] Univ Philippines, Elect & Elect Engn Inst, Quezon City, Philippines
[2] Univ Philippines, AI Grad Program, Quezon City, Philippines
来源
2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP | 2023年
关键词
Computer Vision; Scene Text Recognition; Explainable AI;
D O I
10.1109/ICIP49359.2023.10222406
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Explainable AI (XAI) is the study on how humans can be able to understand the cause of a model's prediction. In this work, the problem of interest is Scene Text Recognition (STR) Explainability, using XAI to understand the cause of an STR model's prediction. Recent XAI literatures on STR only provide a simple analysis and do not fully explore other XAI methods. In this study, we specifically work on data explainability frameworks, called attribution-based methods, that explains the important parts of an input data in deep learning models. However, integrating them into STR produces inconsistent and ineffective explanations, because they only explain the model in the global context. To solve this problem, we propose a new method, STRExp, to take into consideration the local explanations, i.e. the individual character prediction explanations. This is then benchmarked across different attribution-based methods on different STR datasets and evaluated across different STR models.
引用
收藏
页码:645 / 649
页数:5
相关论文
共 50 条
  • [1] Portmanteauing Features for Scene Text Recognition
    Tan, Yew Lee
    Chew, Ernest Yu Kai
    Kong, Adams Wai-Kin
    Kim, Jung-Jae
    Lim, Joo Hwee
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 1499 - 1505
  • [2] SCENE TEXT RECOGNITION USING SPARSE CODING BASED FEATURES
    Zhang, Dong
    Wang, Da-Han
    Wang, Hanzi
    2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 1066 - 1070
  • [3] Visual attention models for scene text recognition
    Ghosh, Suman K.
    Valveny, Ernest
    Bagdanov, Andrew D.
    2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 943 - 948
  • [4] Text localization and recognition in complex scenes using local features
    School of Information Security Engineering, Shanghai Jiao Tong University, China
    不详
    Lect. Notes Comput. Sci., 1600, PART 3 (121-132):
  • [5] Text Localization and Recognition in Complex Scenes Using Local Features
    Zheng, Qi
    Chen, Kai
    Zhou, Yi
    Gu, Congcong
    Guan, Haibing
    COMPUTER VISION - ACCV 2010, PT III, 2011, 6494 : 121 - +
  • [6] CATNet: Scene Text Recognition Guided by Concatenating Augmented Text Features
    Zhang, Ziyin
    Pan, Lemeng
    Du, Lin
    Li, Qingrui
    Lu, Ning
    DOCUMENT ANALYSIS AND RECOGNITION - ICDAR 2021, PT I, 2021, 12821 : 350 - 365
  • [7] Recognition of Multiple Characters in a Scene Image Using Arrangement of Local Features
    Iwamura, Masakazu
    Kobayashi, Takuya
    Kise, Koichi
    11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011), 2011, : 1409 - 1413
  • [8] Scene Text Recognition with Permuted Autoregressive Sequence Models
    Bautista, Darwin
    Atienza, Rowel
    COMPUTER VISION - ECCV 2022, PT XXVIII, 2022, 13688 : 178 - 196
  • [9] Comparative Analysis of Using Different Text Features, Models, and Methods in Text Author Recognition
    Azimov, R. B.
    CYBERNETICS AND SYSTEMS ANALYSIS, 2024, 60 (05) : 711 - 725
  • [10] Exploring Font-independent Features for Scene Text Recognition
    Wang, Yizhi
    Lian, Zhouhui
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 1900 - 1908