Scene Text Image Super-Resolution Reconstruction Based on Perceiving Multi-Domain Character Distance

被引:0
|
作者
Huang, Jun-Yang [1 ]
Chen, Hong-Hui [1 ]
Wang, Jia-Bao [1 ]
Chen, Ping-Ping [1 ]
Lin, Zhi-Jian [1 ]
机构
[1] College of Physics and Information Engineering, Fuzhou University, Fujian, Fuzhou,350108, China
来源
基金
中国国家自然科学基金;
关键词
D O I
10.12263/DZXB.20240090
中图分类号
学科分类号
摘要
Scene text image super-resolution (STISR) aims to enhance the resolution and legibility of text in low-resolution images. In cases of spatial deformation or low-resolution text images, the lack of details in text regions and the difficulty in aligning semantic cues and visual features with character position make it difficult to recognize text effectively. In order to address these challenges, this paper proposes a perceiving multi-domain character distance for scene text image super-resolution method (PMDC), which improves the image text region and edge texture details. Firsly, the visual and semantic features are extracted by using the asymmetric convolution module along with the semantic prior module. Then the enhanced position coding is obtained by the character distance perception module to perceive the distance change and semantic similarity between characters. Finally, the guiding cues and visual features are combined to restructure the pixels and generate a super-resolution text image. In comparison to TATT, experimental results from the public dataset TextZoom showed an increase of 0.11 dB in the fidelity of the peak signal-to-noise ratio index. This improvement effectively enhances the clarity of the text area and the detailed edge texture. Additionally, the recognition accuracy was improved by 1.4%, which effectively enhances the readability of the text image. © 2024 Chinese Institute of Electronics. All rights reserved.
引用
收藏
页码:2262 / 2270
相关论文
共 50 条
  • [1] CDistNet: Perceiving Multi-domain Character Distance for Robust Text Recognition
    Zheng, Tianlun
    Chen, Zhineng
    Fang, Shancheng
    Xie, Hongtao
    Jiang, Yu-Gang
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (02) : 300 - 318
  • [2] CDistNet: Perceiving Multi-domain Character Distance for Robust Text Recognition
    Tianlun Zheng
    Zhineng Chen
    Shancheng Fang
    Hongtao Xie
    Yu-Gang Jiang
    International Journal of Computer Vision, 2024, 132 : 300 - 318
  • [3] Perceiving Multiple Representations for scene text image super-resolution guided by text recognizer
    Shi, Qin
    Zhu, Yu
    Liu, Yatong
    Ye, Jiongyao
    Yang, Dawei
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 124
  • [4] Image super-resolution reconstruction based on wavelet domain
    Dong Ben-zhi
    Yu Ming-cong
    Zhao Peng
    CHINESE JOURNAL OF LIQUID CRYSTALS AND DISPLAYS, 2021, 36 (02) : 317 - 326
  • [5] Text Prior Guided Scene Text Image Super-Resolution
    Ma, Jianqi
    Guo, Shi
    Zhang, Lei
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 1341 - 1353
  • [6] Hyperspectral image super-resolution via multi-domain feature learning
    Li, Qiang
    Yuan, Yuan
    Wang, Qi
    NEUROCOMPUTING, 2022, 472 : 85 - 94
  • [7] Scene Text Telescope: Text-Focused Scene Image Super-Resolution
    Chen, Jingye
    Li, Bin
    Xue, Xiangyang
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 12021 - 12030
  • [8] Real Scene Text Image Super-Resolution Based on Multi-Scale and Attention Fusion
    Lu, Xinhua
    Wei, Haihai
    Ma, Li
    Xue, Qingji
    Fu, Yonghui
    JOURNAL OF INFORMATION PROCESSING SYSTEMS, 2023, 19 (04): : 427 - 438
  • [9] Batch-transformer for scene text image super-resolution
    Sun, Yaqi
    Xie, Xiaolan
    Li, Zhi
    Yang, Kai
    VISUAL COMPUTER, 2024, 40 (10): : 7399 - 7409
  • [10] Gradient-Based Graph Attention for Scene Text Image Super-resolution
    Zhu, Xiangyuan
    Guo, Kehua
    Fang, Hui
    Ding, Rui
    Wu, Zheng
    Schaefer, Gerald
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 3, 2023, : 3861 - 3869