TLWSR: Weakly supervised real-world scene text image super-resolution using text label

被引:1
|
作者
Shi, Qin [1 ]
Zhu, Yu [1 ,3 ]
Fang, Chuantao [1 ]
Yang, Dawei [1 ,2 ]
机构
[1] East China Univ Sci & Technol, Sch Informat Sci & Engn, Shanghai 200237, Peoples R China
[2] Fudan Univ, Zhongshan Hosp, Dept Pulm & Crit Care Med, Shanghai, Peoples R China
[3] Shanghai Engn Res Ctr Internet Things Resp Med, Shanghai, Peoples R China
关键词
image processing; image resolution; unsupervised learning; NETWORK;
D O I
10.1049/ipr2.12827
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Scene text image super-resolution (STISR) has recently received considerable attention. Existing STISR methods are applicable to the situation that all the LR-HR pairs are available. However, in real-world scenarios, it is difficult and expensive to collect ground-truth HR labels and align them with LR images, and thus it is essential to find a way to implement weakly supervised learning. We investigate the STISR problem in the situation that only a subset of HR labels is available and design a weak supervision framework using coarse-grained text labels named TLWSR, which combines incomplete supervision and inexact supervision. Specifically, a lightweight text recognition network and connectionist temporal classification loss are used to guide the super-resolution of text images during training. Extensive experiments on the benchmark TextZoom demonstrate that TLWSR generates distinguishable text images and exceeds the fully supervised baseline TSRN in boosting text recognition accuracywith only 50% HR labels available. Meanwhile, TLWSR can be applied to different super-resolution backbones and significantly improves their performance. Furthermore, TLWSR shows good generalization capability to low-quality images on scene text recognition benchmarks, which verifies the effectiveness of this framework. To the authors' knowledge, this is the first work exploring the problem of STISR in weakly supervised scenarios.
引用
收藏
页码:2780 / 2790
页数:11
相关论文
共 50 条
  • [31] HiREN: Towards higher supervision quality for better scene text image super-resolution
    Zhao, Minyi
    Xu, Yi
    Li, Bingjia
    Wang, Jie
    Guan, Jihong
    Zhou, Shuigeng
    NEUROCOMPUTING, 2025, 623
  • [32] DCDM: Diffusion-Conditioned-Diffusion Model for Scene Text Image Super-Resolution
    Singh, Shrey
    Keserwani, Prateek
    Iwamura, Masakazu
    Roy, Partha Pratim
    COMPUTER VISION - ECCV 2024, PT XV, 2025, 15073 : 303 - 320
  • [33] Towards Robust Scene Text Image Super-resolution via Explicit Location Enhancement
    Guo, Hang
    Dai, Tao
    Meng, Guanghao
    Xia, Shu-Tao
    PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 782 - 790
  • [34] Multi-Task Learning for Scene Text Image Super-Resolution with Multiple Transformers
    Honda, Kosuke
    Kurematsu, Masaki
    Fujita, Hamido
    Selamat, Ali
    ELECTRONICS, 2022, 11 (22)
  • [35] Improving Scene Text Image Super-resolution via Dual Prior Modulation Network
    Zhu, Shipeng
    Zhao, Zuoyan
    Fang, Pengfei
    Xue, Hui
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 3, 2023, : 3843 - 3851
  • [36] More and Less: Enhancing Abundance and Refining Redundancy for Text-Prior-Guided Scene Text Image Super-Resolution
    Yang, Wei
    Luo, Yihong
    Ibrayim, Mayire
    Hamdulla, Askar
    DOCUMENT ANALYSIS AND RECOGNITION-ICDAR 2024, PT V, 2024, 14808 : 129 - 146
  • [37] Scene text image super-resolution using multi-scale convolutional neural network with skip connections
    Walha, Rim
    Aouini, Amal
    APPLIED INTELLIGENCE, 2024, : 5931 - 5943
  • [38] Real-World Image Super-Resolution by Exclusionary Dual-Learning
    Li, Hao
    Qin, Jinghui
    Yang, Zhijing
    Wei, Pengxu
    Pan, Jinshan
    Lin, Liang
    Shi, Yukai
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 4752 - 4763
  • [39] Review of Research on Real-World Single Image Super-Resolution Reconstruction
    Zhang, Yanqing
    Ma, Jianhong
    Han, Ying
    Cao, Yangjie
    Li, Jie
    Yang, Cong
    Computer Engineering and Applications, 2023, 59 (08) : 28 - 40
  • [40] Structure and Texture Preserving Network for Real-World Image Super-Resolution
    Zhou, Bijun
    Yan, Huibin
    Wang, Shuoyao
    IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 2173 - 2177