Text Prior Guided Scene Text Image Super-Resolution

被引：30

作者：

Ma, Jianqi ^{[1
]}

Guo, Shi ^{[1
]}

Zhang, Lei ^{[1
]}

机构：

[1] Hong Kong Polytech Univ, Dept Comp, Hong Kong, Peoples R China

来源：

IEEE TRANSACTIONS ON IMAGE PROCESSING | 2023年 / 32卷

关键词：

Scene text image super-resolution; super-resolution; text prior; NETWORK; RECOGNITION;

D O I：

10.1109/TIP.2023.3237002

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Scene text image super-resolution (STISR) aims to improve the resolution and visual quality of low-resolution (LR) scene text images, while simultaneously boost the performance of text recognition. However, most of the existing STISR methods regard text images as natural scene images, ignoring the categorical information of text. In this paper, we make an inspiring attempt to embed text recognition prior into STISR model. Specifically, we adopt the predicted character recognition probability sequence as the text prior, which can be obtained conveniently from a text recognition model. The text prior provides categorical guidance to recover high-resolution (HR) text images. On the other hand, the reconstructed HR image can refine the text prior in return. Finally, we present a multi-stage text prior guided super-resolution (TPGSR) framework for STISR. Our experiments on the benchmark TextZoom dataset show that TPGSR can not only effectively improve the visual quality of scene text images, but also significantly improve the text recognition accuracy over existing STISR methods. Our model trained on TextZoom also demonstrates certain generalization capability to the LR images in other datasets. The source code of our work is available

引用

页码：1341 / 1353

页数：13

共 50 条

[21] Advancing scene text image super-resolution via edge enhancement priors
Li, Hongjun
Li, Shangfeng
SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (11) : 8241 - 8250
[22] Super-resolution enhancement of text image sequences
Capel, D
Zisserman, A
15TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 1, PROCEEDINGS: COMPUTER VISION AND IMAGE ANALYSIS, 2000, : 600 - 605
[23] TLWSR: Weakly supervised real-world scene text image super-resolution using text label
Shi, Qin
Zhu, Yu
Fang, Chuantao
Yang, Dawei
IET IMAGE PROCESSING, 2023, 17 (09) : 2780 - 2790
[24] Bayesian super-resolution of text in video with a text-specific bimodal prior
Donaldson, K
Myers, GK
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, : 1188 - 1195
[25] Bayesian super-resolution of text in video with a text-specific bimodal prior
Donaldson K.
Myers G.K.
International Journal of Document Analysis and Recognition (IJDAR), 2005, 7 (2-3): : 159 - 167
[26] HiREN: Towards higher supervision quality for better scene text image super-resolution
Zhao, Minyi
Xu, Yi
Li, Bingjia
Wang, Jie
Guan, Jihong
Zhou, Shuigeng
NEUROCOMPUTING, 2025, 623
[27] DCDM: Diffusion-Conditioned-Diffusion Model for Scene Text Image Super-Resolution
Singh, Shrey
Keserwani, Prateek
Iwamura, Masakazu
Roy, Partha Pratim
COMPUTER VISION - ECCV 2024, PT XV, 2025, 15073 : 303 - 320
[28] Towards Robust Scene Text Image Super-resolution via Explicit Location Enhancement
Guo, Hang
Dai, Tao
Meng, Guanghao
Xia, Shu-Tao
PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 782 - 790
[29] Multi-Task Learning for Scene Text Image Super-Resolution with Multiple Transformers
Honda, Kosuke
Kurematsu, Masaki
Fujita, Hamido
Selamat, Ali
ELECTRONICS, 2022, 11 (22)
[30] Scene text image super-resolution via textual reasoning and multiscale cross-convolution
Lan Yu
Xiaojie Li
Qi Yu
Guangju Li
Dehu Jin
Meng Qi
Applied Intelligence, 2024, 54 : 1997 - 2008

← 1 2 3 4 5 →