Gradient-Based Graph Attention for Scene Text Image Super-resolution

被引:0
|
作者
Zhu, Xiangyuan [1 ]
Guo, Kehua [1 ]
Fang, Hui [2 ]
Ding, Rui [1 ]
Wu, Zheng [1 ]
Schaefer, Gerald [2 ]
机构
[1] Cent South Univ, Sch Comp Sci & Engn, Changsha, Peoples R China
[2] Loughborough Univ, Dept Comp Sci, Loughborough, Leics, England
基金
美国国家科学基金会;
关键词
NETWORK;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Scene text image super-resolution (STISR) in the wild has been shown to be beneficial to support improved vision-based text recognition from low-resolution imagery. An intuitive way to enhance STISR performance is to explore the well-structured and repetitive layout characteristics of text and exploit these as prior knowledge to guide model convergence. In this paper, we propose a novel gradient-based graph attention method to embed patch-wise text layout contexts into image feature representations for high-resolution text image reconstruction in an implicit and elegant manner. We introduce a non-local group-wise attention module to extract text features which are then enhanced by a cascaded channel attention module and a novel gradient-based graph attention module in order to obtain more effective representations by exploring correlations of regional and local patch-wise text layout properties. Extensive experiments on the benchmark TextZoom dataset convincingly demonstrate that our method supports excellent text recognition and outperforms the current state-of-the-art in STISR. The source code is available at https://github.com/xyzhu1/TSAN.
引用
收藏
页码:3861 / 3869
页数:9
相关论文
共 50 条
  • [1] A Text Attention Network for Spatial Deformation Robust Scene Text Image Super-resolution
    Ma, Jianqi
    Liang, Zhetong
    Zhang, Lei
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 5901 - 5910
  • [2] Gradient-based adaptive interpolation in super-resolution image restoration
    Chu, Jinyu
    Liu, Ju
    Qiao, Jianping
    Wang, Xiaoling
    Li, Yujun
    ICSP: 2008 9TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-5, PROCEEDINGS, 2008, : 1027 - +
  • [3] Scene Text Image Super-Resolution via Parallelly Contextual Attention Network
    Zhao, Cairong
    Feng, Shuyang
    Zhao, Brian Nlong
    Ding, Zhijun
    Wu, Jun
    Shen, Fuming
    Shen, Heng Tao
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 2908 - 2917
  • [4] Real Scene Text Image Super-Resolution Based on Multi-Scale and Attention Fusion
    Lu, Xinhua
    Wei, Haihai
    Ma, Li
    Xue, Qingji
    Fu, Yonghui
    JOURNAL OF INFORMATION PROCESSING SYSTEMS, 2023, 19 (04): : 427 - 438
  • [5] Text Prior Guided Scene Text Image Super-Resolution
    Ma, Jianqi
    Guo, Shi
    Zhang, Lei
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 1341 - 1353
  • [6] Scene Text Telescope: Text-Focused Scene Image Super-Resolution
    Chen, Jingye
    Li, Bin
    Xue, Xiangyang
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 12021 - 12030
  • [7] GRADIENT-BASED ADAPTIVE IMAGE SUPER RESOLUTION
    Junaidi, Achmad
    Lin, Chao-Hung
    Tseng, Yi-Hsing
    Chang, Li-Hsueh
    Peng, Shin-Chia
    2019 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2019), 2019, : 2774 - 2777
  • [8] Pixel Adapter: A Graph-Based Post-Processing Approach for Scene Text Image Super-Resolution
    Zhang, Wenyu
    Deng, Xin
    Jia, Baojun
    Yu, Xingtong
    Chen, Yifan
    Ma, Jin
    Ding, Qing
    Zhang, Xinming
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 2168 - 2179
  • [9] Pixel Adapter: A Graph-Based Post-Processing Approach for Scene Text Image Super-Resolution
    Zhang, Wenyu
    Deng, Xin
    Jia, Baojun
    Yu, Xingtong
    Chen, Yifan
    Ma, Jin
    Ding, Qing
    Zhang, Xinming
    MM 2023 - Proceedings of the 31st ACM International Conference on Multimedia, 2023, : 2168 - 2179
  • [10] Batch-transformer for scene text image super-resolution
    Sun, Yaqi
    Xie, Xiaolan
    Li, Zhi
    Yang, Kai
    VISUAL COMPUTER, 2024, 40 (10): : 7399 - 7409