SCENE TEXT RECOGNITION MODELS EXPLAINABILITY USING LOCAL FEATURES

被引：1

作者：

Ty, Mark Vincent ^{[1
]}

Atienza, Rowel ^{[1
,2
]}

机构：

[1] Univ Philippines, Elect & Elect Engn Inst, Quezon City, Philippines

[2] Univ Philippines, AI Grad Program, Quezon City, Philippines

来源：

2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP | 2023年

关键词：

Computer Vision; Scene Text Recognition; Explainable AI;

D O I：

10.1109/ICIP49359.2023.10222406

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Explainable AI (XAI) is the study on how humans can be able to understand the cause of a model's prediction. In this work, the problem of interest is Scene Text Recognition (STR) Explainability, using XAI to understand the cause of an STR model's prediction. Recent XAI literatures on STR only provide a simple analysis and do not fully explore other XAI methods. In this study, we specifically work on data explainability frameworks, called attribution-based methods, that explains the important parts of an input data in deep learning models. However, integrating them into STR produces inconsistent and ineffective explanations, because they only explain the model in the global context. To solve this problem, we propose a new method, STRExp, to take into consideration the local explanations, i.e. the individual character prediction explanations. This is then benchmarked across different attribution-based methods on different STR datasets and evaluated across different STR models.

引用

页码：645 / 649

页数：5

共 50 条

[1] Portmanteauing Features for Scene Text Recognition
Tan, Yew Lee
Chew, Ernest Yu Kai
Kong, Adams Wai-Kin
Kim, Jung-Jae
Lim, Joo Hwee
2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 1499 - 1505
[2] SCENE TEXT RECOGNITION USING SPARSE CODING BASED FEATURES
Zhang, Dong
Wang, Da-Han
Wang, Hanzi
2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 1066 - 1070
[3] Visual attention models for scene text recognition
Ghosh, Suman K.
Valveny, Ernest
Bagdanov, Andrew D.
2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 943 - 948
[4] Text localization and recognition in complex scenes using local features
School of Information Security Engineering, Shanghai Jiao Tong University, China
不详
Lect. Notes Comput. Sci., 1600, PART 3 (121-132):
[5] Text Localization and Recognition in Complex Scenes Using Local Features
Zheng, Qi
Chen, Kai
Zhou, Yi
Gu, Congcong
Guan, Haibing
COMPUTER VISION - ACCV 2010, PT III, 2011, 6494 : 121 - +
[6] CATNet: Scene Text Recognition Guided by Concatenating Augmented Text Features
Zhang, Ziyin
Pan, Lemeng
Du, Lin
Li, Qingrui
Lu, Ning
DOCUMENT ANALYSIS AND RECOGNITION - ICDAR 2021, PT I, 2021, 12821 : 350 - 365
[7] Recognition of Multiple Characters in a Scene Image Using Arrangement of Local Features
Iwamura, Masakazu
Kobayashi, Takuya
Kise, Koichi
11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011), 2011, : 1409 - 1413
[8] Scene Text Recognition with Permuted Autoregressive Sequence Models
Bautista, Darwin
Atienza, Rowel
COMPUTER VISION - ECCV 2022, PT XXVIII, 2022, 13688 : 178 - 196
[9] Comparative Analysis of Using Different Text Features, Models, and Methods in Text Author Recognition
Azimov, R. B.
CYBERNETICS AND SYSTEMS ANALYSIS, 2024, 60 (05) : 711 - 725
[10] Exploring Font-independent Features for Scene Text Recognition
Wang, Yizhi
Lian, Zhouhui
MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 1900 - 1908

← 1 2 3 4 5 →