Scene Text Image Super-Resolution Reconstruction Based on Perceiving Multi-Domain Character Distance

被引:0
|
作者
Huang, Jun-Yang [1 ]
Chen, Hong-Hui [1 ]
Wang, Jia-Bao [1 ]
Chen, Ping-Ping [1 ]
Lin, Zhi-Jian [1 ]
机构
[1] College of Physics and Information Engineering, Fuzhou University, Fujian, Fuzhou,350108, China
来源
基金
中国国家自然科学基金;
关键词
D O I
10.12263/DZXB.20240090
中图分类号
学科分类号
摘要
Scene text image super-resolution (STISR) aims to enhance the resolution and legibility of text in low-resolution images. In cases of spatial deformation or low-resolution text images, the lack of details in text regions and the difficulty in aligning semantic cues and visual features with character position make it difficult to recognize text effectively. In order to address these challenges, this paper proposes a perceiving multi-domain character distance for scene text image super-resolution method (PMDC), which improves the image text region and edge texture details. Firsly, the visual and semantic features are extracted by using the asymmetric convolution module along with the semantic prior module. Then the enhanced position coding is obtained by the character distance perception module to perceive the distance change and semantic similarity between characters. Finally, the guiding cues and visual features are combined to restructure the pixels and generate a super-resolution text image. In comparison to TATT, experimental results from the public dataset TextZoom showed an increase of 0.11 dB in the fidelity of the peak signal-to-noise ratio index. This improvement effectively enhances the clarity of the text area and the detailed edge texture. Additionally, the recognition accuracy was improved by 1.4%, which effectively enhances the readability of the text image. © 2024 Chinese Institute of Electronics. All rights reserved.
引用
收藏
页码:2262 / 2270
相关论文
共 50 条
  • [21] MAP-based single-frame super-resolution reconstruction for character image
    Li, Zhan
    Chen, Qing-Liang
    Peng, Qing-Yu
    Zhang, Qing-Feng
    Li, Wei-Xiang
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2015, 43 (01): : 191 - 197
  • [22] Scene Text Image Super-Resolution Through Multi-Scale Interaction of Structural and Semantic Priors
    Zhu Z.
    Zhang L.
    Bai Y.
    Wang Y.
    Li P.
    IEEE Transactions on Artificial Intelligence, 2024, 5 (07): : 1 - 11
  • [23] Navigating Style Variations in Scene Text Image Super-Resolution through Multi-Scale Perception
    Xu, Feifei
    Yu, Ziheng
    PROCEEDINGS OF THE 4TH ANNUAL ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2024, 2024, : 229 - 238
  • [24] GARDEN: Generative Prior Guided Network for Scene Text Image Super-Resolution
    Kong, Yuxin
    Ma, Weihong
    Jin, Lianwen
    Xue, Yang
    DOCUMENT ANALYSIS AND RECOGNITION-ICDAR 2024, PT V, 2024, 14808 : 196 - 214
  • [25] Scene Text Image Super-Resolution via Parallelly Contextual Attention Network
    Zhao, Cairong
    Feng, Shuyang
    Zhao, Brian Nlong
    Ding, Zhijun
    Wu, Jun
    Shen, Fuming
    Shen, Heng Tao
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 2908 - 2917
  • [26] Self-supervised memory learning for scene text image super-resolution
    Guo, Kehua
    Zhu, Xiangyuan
    Schaefer, Gerald
    Ding, Rui
    Fang, Hui
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 258
  • [27] Advancing scene text image super-resolution via edge enhancement priors
    Li, Hongjun
    Li, Shangfeng
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (11) : 8241 - 8250
  • [28] Wavelet domain image super-resolution reconstruction based on image pyramid and cycle-spinning
    Liu, H. C.
    Feng, Y.
    Sun, G. Y.
    4TH INTERNATIONAL SYMPOSIUM ON INSTRUMENTATION SCIENCE AND TECHNOLOGY (ISIST' 2006), 2006, 48 : 417 - 421
  • [29] SUPER-RESOLUTION RECONSTRUCTION OF IMAGE BASED ON PRIOR IMAGE CONSTRAINT
    Tang Bin-Bing
    Wang Zheng-Ming
    JOURNAL OF INFRARED AND MILLIMETER WAVES, 2008, 27 (05) : 389 - 392
  • [30] Image super-resolution reconstruction based on implicit image functions
    Lin, Hai
    Yang, JunJie
    IET IMAGE PROCESSING, 2024, 18 (10) : 2690 - 2701