Scene Text Image Super-Resolution Reconstruction Based on Perceiving Multi-Domain Character Distance

被引：0

作者：

Huang, Jun-Yang ^{[1
]}

Chen, Hong-Hui ^{[1
]}

Wang, Jia-Bao ^{[1
]}

Chen, Ping-Ping ^{[1
]}

Lin, Zhi-Jian ^{[1
]}

机构：

[1] College of Physics and Information Engineering, Fuzhou University, Fujian, Fuzhou,350108, China

来源：

Tien Tzu Hsueh Pao/Acta Electronica Sinica | 2024年 / 52卷 / 07期

基金：

中国国家自然科学基金;

关键词：

D O I：

10.12263/DZXB.20240090

中图分类号：

学科分类号：

摘要：

Scene text image super-resolution (STISR) aims to enhance the resolution and legibility of text in low-resolution images. In cases of spatial deformation or low-resolution text images, the lack of details in text regions and the difficulty in aligning semantic cues and visual features with character position make it difficult to recognize text effectively. In order to address these challenges, this paper proposes a perceiving multi-domain character distance for scene text image super-resolution method (PMDC), which improves the image text region and edge texture details. Firsly, the visual and semantic features are extracted by using the asymmetric convolution module along with the semantic prior module. Then the enhanced position coding is obtained by the character distance perception module to perceive the distance change and semantic similarity between characters. Finally, the guiding cues and visual features are combined to restructure the pixels and generate a super-resolution text image. In comparison to TATT, experimental results from the public dataset TextZoom showed an increase of 0.11 dB in the fidelity of the peak signal-to-noise ratio index. This improvement effectively enhances the clarity of the text area and the detailed edge texture. Additionally, the recognition accuracy was improved by 1.4%, which effectively enhances the readability of the text image. © 2024 Chinese Institute of Electronics. All rights reserved.

引用

页码：2262 / 2270

共 50 条

[21] MAP-based single-frame super-resolution reconstruction for character image
Li, Zhan
Chen, Qing-Liang
Peng, Qing-Yu
Zhang, Qing-Feng
Li, Wei-Xiang
Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2015, 43 (01): : 191 - 197
[22] Scene Text Image Super-Resolution Through Multi-Scale Interaction of Structural and Semantic Priors
Zhu Z.
Zhang L.
Bai Y.
Wang Y.
Li P.
IEEE Transactions on Artificial Intelligence, 2024, 5 (07): : 1 - 11
[23] Navigating Style Variations in Scene Text Image Super-Resolution through Multi-Scale Perception
Xu, Feifei
Yu, Ziheng
PROCEEDINGS OF THE 4TH ANNUAL ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2024, 2024, : 229 - 238
[24] GARDEN: Generative Prior Guided Network for Scene Text Image Super-Resolution
Kong, Yuxin
Ma, Weihong
Jin, Lianwen
Xue, Yang
DOCUMENT ANALYSIS AND RECOGNITION-ICDAR 2024, PT V, 2024, 14808 : 196 - 214
[25] Scene Text Image Super-Resolution via Parallelly Contextual Attention Network
Zhao, Cairong
Feng, Shuyang
Zhao, Brian Nlong
Ding, Zhijun
Wu, Jun
Shen, Fuming
Shen, Heng Tao
PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 2908 - 2917
[26] Self-supervised memory learning for scene text image super-resolution
Guo, Kehua
Zhu, Xiangyuan
Schaefer, Gerald
Ding, Rui
Fang, Hui
EXPERT SYSTEMS WITH APPLICATIONS, 2024, 258
[27] Advancing scene text image super-resolution via edge enhancement priors
Li, Hongjun
Li, Shangfeng
SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (11) : 8241 - 8250
[28] Wavelet domain image super-resolution reconstruction based on image pyramid and cycle-spinning
Liu, H. C.
Feng, Y.
Sun, G. Y.
4TH INTERNATIONAL SYMPOSIUM ON INSTRUMENTATION SCIENCE AND TECHNOLOGY (ISIST' 2006), 2006, 48 : 417 - 421
[29] SUPER-RESOLUTION RECONSTRUCTION OF IMAGE BASED ON PRIOR IMAGE CONSTRAINT
Tang Bin-Bing
Wang Zheng-Ming
JOURNAL OF INFRARED AND MILLIMETER WAVES, 2008, 27 (05) : 389 - 392
[30] Image super-resolution reconstruction based on implicit image functions
Lin, Hai
Yang, JunJie
IET IMAGE PROCESSING, 2024, 18 (10) : 2690 - 2701

← 1 2 3 4 5 →