Gradient-Based Graph Attention for Scene Text Image Super-resolution

被引:0
|
作者
Zhu, Xiangyuan [1 ]
Guo, Kehua [1 ]
Fang, Hui [2 ]
Ding, Rui [1 ]
Wu, Zheng [1 ]
Schaefer, Gerald [2 ]
机构
[1] Cent South Univ, Sch Comp Sci & Engn, Changsha, Peoples R China
[2] Loughborough Univ, Dept Comp Sci, Loughborough, Leics, England
基金
美国国家科学基金会;
关键词
NETWORK;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Scene text image super-resolution (STISR) in the wild has been shown to be beneficial to support improved vision-based text recognition from low-resolution imagery. An intuitive way to enhance STISR performance is to explore the well-structured and repetitive layout characteristics of text and exploit these as prior knowledge to guide model convergence. In this paper, we propose a novel gradient-based graph attention method to embed patch-wise text layout contexts into image feature representations for high-resolution text image reconstruction in an implicit and elegant manner. We introduce a non-local group-wise attention module to extract text features which are then enhanced by a cascaded channel attention module and a novel gradient-based graph attention module in order to obtain more effective representations by exploring correlations of regional and local patch-wise text layout properties. Extensive experiments on the benchmark TextZoom dataset convincingly demonstrate that our method supports excellent text recognition and outperforms the current state-of-the-art in STISR. The source code is available at https://github.com/xyzhu1/TSAN.
引用
收藏
页码:3861 / 3869
页数:9
相关论文
共 50 条
  • [21] GARDEN: Generative Prior Guided Network for Scene Text Image Super-Resolution
    Kong, Yuxin
    Ma, Weihong
    Jin, Lianwen
    Xue, Yang
    DOCUMENT ANALYSIS AND RECOGNITION-ICDAR 2024, PT V, 2024, 14808 : 196 - 214
  • [22] Self-supervised memory learning for scene text image super-resolution
    Guo, Kehua
    Zhu, Xiangyuan
    Schaefer, Gerald
    Ding, Rui
    Fang, Hui
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 258
  • [23] Advancing scene text image super-resolution via edge enhancement priors
    Li, Hongjun
    Li, Shangfeng
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (11) : 8241 - 8250
  • [24] Gradient-based edge preserving interpolation and its application to super-resolution
    Iwamoto, Yutaro
    Han, Xian-Hua
    Tateyama, Tomoko
    Ohashi, Motonori
    Sasatani, So
    Chen, Yen-Wei
    ELECTRONICS AND COMMUNICATIONS IN JAPAN, 2013, 96 (01) : 43 - 50
  • [25] Parametric loss-based super-resolution for scene text recognition
    Viriyavisuthisakul, Supatta
    Sanguansat, Parinya
    Racharak, Teeradaj
    Le Nguyen, Minh
    Kaothanthong, Natsuda
    Haruechaiyasak, Choochart
    Yamasaki, Toshihiko
    MACHINE VISION AND APPLICATIONS, 2023, 34 (04)
  • [26] Parametric loss-based super-resolution for scene text recognition
    Supatta Viriyavisuthisakul
    Parinya Sanguansat
    Teeradaj Racharak
    Minh Le Nguyen
    Natsuda Kaothanthong
    Choochart Haruechaiyasak
    Toshihiko Yamasaki
    Machine Vision and Applications, 2023, 34
  • [27] Spatially-adaptive Regularized Super-resolution Image Reconstruction Using A Gradient-based Saliency Measure
    Liu, Zhenyu
    Tian, Jing
    Chen, Li
    Wang, Yongtao
    2011 FIRST ASIAN CONFERENCE ON PATTERN RECOGNITION (ACPR), 2011, : 86 - 89
  • [28] Scene Text Image Super-Resolution Reconstruction Based on Perceiving Multi-Domain Character Distance
    Huang, Jun-Yang
    Chen, Hong-Hui
    Wang, Jia-Bao
    Chen, Ping-Ping
    Lin, Zhi-Jian
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2024, 52 (07): : 2262 - 2270
  • [29] HiREN: Towards higher supervision quality for better scene text image super-resolution
    Zhao, Minyi
    Xu, Yi
    Li, Bingjia
    Wang, Jie
    Guan, Jihong
    Zhou, Shuigeng
    NEUROCOMPUTING, 2025, 623
  • [30] DCDM: Diffusion-Conditioned-Diffusion Model for Scene Text Image Super-Resolution
    Singh, Shrey
    Keserwani, Prateek
    Iwamura, Masakazu
    Roy, Partha Pratim
    COMPUTER VISION - ECCV 2024, PT XV, 2025, 15073 : 303 - 320