Batch-transformer for scene text image super-resolution

被引:1
|
作者
Sun, Yaqi [1 ,3 ]
Xie, Xiaolan [1 ,2 ]
Li, Zhi [1 ]
Yang, Kai [3 ]
机构
[1] Guangxi Normal Univ, Sch Comp Sci & Engn, Guilin, Guangxi, Peoples R China
[2] Guilin Univ Technol, Sch Informat Sci & Engn, Guilin, Guangxi, Peoples R China
[3] Hengyang Normal Univ, Sch Comp Sci & Technol, Hengyang, Peoples R China
来源
VISUAL COMPUTER | 2024年 / 40卷 / 10期
基金
中国国家自然科学基金;
关键词
Computer vision; Super-resolution; Scene text image; Batch-transformer; Loss function; NETWORK;
D O I
10.1007/s00371-024-03598-7
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Recognizing low-resolution text images is challenging as they often lose their detailed information, leading to poor recognition accuracy. Moreover, the traditional methods, based on deep convolutional neural networks (CNNs), are not effective enough for some low-resolution text images with dense characters. In this paper, a novel CNN-based batch-transformer network for scene text image super-resolution (BT-STISR) method is proposed to address this problem. In order to obtain the text information for text reconstruction, a pre-trained text prior module is employed to extract text information. Then a novel two pipeline batch-transformer-based module is proposed, leveraging self-attention and global attention mechanisms to exert the guidance of text prior to the text reconstruction process. Experimental study on a benchmark dataset TextZoom shows that the proposed method BT-STISR achieves the best state-of-the-art performance in terms of structural similarity (SSIM) and peak signal-to-noise ratio (PSNR) metrics compared to some latest methods.
引用
收藏
页码:7399 / 7409
页数:11
相关论文
共 50 条
  • [41] Text Image Super-Resolution Guided by Text Structure and Embedding Priors
    Huang, Cong
    Peng, Xiulian
    Liu, Dong
    Lu, Yan
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (06)
  • [42] Parametric loss-based super-resolution for scene text recognition
    Viriyavisuthisakul, Supatta
    Sanguansat, Parinya
    Racharak, Teeradaj
    Le Nguyen, Minh
    Kaothanthong, Natsuda
    Haruechaiyasak, Choochart
    Yamasaki, Toshihiko
    MACHINE VISION AND APPLICATIONS, 2023, 34 (04)
  • [43] Parametric loss-based super-resolution for scene text recognition
    Supatta Viriyavisuthisakul
    Parinya Sanguansat
    Teeradaj Racharak
    Minh Le Nguyen
    Natsuda Kaothanthong
    Choochart Haruechaiyasak
    Toshihiko Yamasaki
    Machine Vision and Applications, 2023, 34
  • [44] Unveiling the Influence of Image Super-Resolution on Aerial Scene Classification
    Ibrahim, Mohamed Ramzy
    Benavente, Robert
    Ponsa, Daniel
    Lumbreras, Felipe
    PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS, CIARP 2023, PT I, 2024, 14469 : 214 - 228
  • [45] Scene Text Image Super-Resolution Through Multi-Scale Interaction of Structural and Semantic Priors
    Zhu Z.
    Zhang L.
    Bai Y.
    Wang Y.
    Li P.
    IEEE Transactions on Artificial Intelligence, 2024, 5 (07): : 1 - 11
  • [46] Scene Text Image Super-Resolution Reconstruction Based on Perceiving Multi-Domain Character Distance
    Huang, Jun-Yang
    Chen, Hong-Hui
    Wang, Jia-Bao
    Chen, Ping-Ping
    Lin, Zhi-Jian
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2024, 52 (07): : 2262 - 2270
  • [47] Navigating Style Variations in Scene Text Image Super-Resolution through Multi-Scale Perception
    Xu, Feifei
    Yu, Ziheng
    PROCEEDINGS OF THE 4TH ANNUAL ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2024, 2024, : 229 - 238
  • [48] TextDiff: Enhancing scene text image super-resolution with mask-guided residual diffusion models
    Liu, Baolin
    Yang, Zongyuan
    Chiu, Chinwai
    Xiong, Yongping
    PATTERN RECOGNITION, 2025, 164
  • [49] Soft-edge-guided significant coordinate attention network for scene text image super-resolution
    Xi, Chenchen
    Zhang, Kaibing
    He, Xin
    Hu, Yanting
    Chen, Jinguang
    VISUAL COMPUTER, 2024, 40 (08): : 5393 - 5406
  • [50] Dtsr: detail-enhanced transformer for image super-resolution
    Huang, Xiaoqian
    Huang, Detian
    Huang, Qin
    Huang, Caixia
    Chen, Feiyang
    Xu, Zhengjun
    VISUAL COMPUTER, 2024, 40 (11): : 7667 - 7684