TSRGAN: Real-world text image super-resolution based on adversarial learning and triplet attention

被引:19
|
作者
Fang, Chuantao [1 ]
Zhu, Yu [1 ]
Liao, Lei [1 ]
Ling, Xiaofeng [1 ]
机构
[1] East China Univ Sci & Technol, Sch Informat Sci & Engn, Shanghai 200237, Peoples R China
基金
上海市自然科学基金;
关键词
Text image super-resolution; Adversarial learning; Triplet attention; Wavelet loss; Scene text recognition; NEURAL-NETWORK; SCENE;
D O I
10.1016/j.neucom.2021.05.060
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The text in a low-resolution (LR) image is usually hard to read. Super-resolution (SR) is an intuitive solution to this issue. Existing single image super-resolution (SISR) models are mainly trained on synthetic datasets whose LR images are obtained by performing bicubic interpolation or gaussian blur on high-resolution (HR) images. However, these models can hardly generalize to practical scenarios because real-world LR images are more difficult to super-resolve. The newly proposed TextZoom dataset is the first dataset for real-world text image super-resolution. We propose a new model termed TSRGAN trained on this dataset. First, a discriminator is designed to prevent the SR network from generating over-smoothed images. Second, we introduce triplet attention into the SR network for better representational ability. Moreover, besides L-2 loss and adversarial loss, wavelet loss is incorporated to help reconstruct sharper character edges. Since TextZoom provides text labels, the recognition accuracy of scene text recognition (STR) model can be used to evaluate the quality of SR images. It can reflect the performance of text image SR models better than traditional SR evaluation metrics such as PSNR and SSIM. Comprehensive experiments show the superiority of our TSRGAN. Compared with the state-of-the-art method, the proposed TSRGAN improves the average recognition accuracy of ASTER, MORAN and CRNN by 0.8%, 1.5% and 3.2% on TextZoom respectively. (C) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页码:88 / 96
页数:9
相关论文
共 50 条
  • [41] Real-world Video Super-resolution: A Benchmark Dataset and A Decomposition based Learning Scheme
    Yang, Xi
    Xiang, Wangmeng
    Zeng, Hui
    Zhang, Lei
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 4761 - 4770
  • [42] Coupled Adversarial Learning for Single Image Super-Resolution
    Hsu, Chih-Chung
    Huang, Kuan-Yu
    2020 IEEE 11TH SENSOR ARRAY AND MULTICHANNEL SIGNAL PROCESSING WORKSHOP (SAM), 2020,
  • [43] AnimeSR: Learning Real-World Super-Resolution Models for Animation Videos
    Wu, Yanze
    Wang, Xintao
    Li, Gen
    Shan, Ying
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [44] Semi-Cycled Generative Adversarial Networks for Real-World Face Super-Resolution
    Hou, Hao
    Xu, Jun
    Hou, Yingkun
    Hu, Xiaotao
    Wei, Benzheng
    Shen, Dinggang
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 1184 - 1199
  • [45] GDSSR: Toward Real-World Ultra-High-Resolution Image Super-Resolution
    Chi, Yichen
    Yang, Wenming
    Tian, Yapeng
    IEEE SIGNAL PROCESSING LETTERS, 2023, 30 : 95 - 99
  • [46] REAL IMAGE SUPER-RESOLUTION USING TOKEN BASED CONTEXTUAL ATTENTION
    Pan, Zhihong
    Li, Baopu
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 1615 - 1619
  • [47] Gradient-Based Graph Attention for Scene Text Image Super-resolution
    Zhu, Xiangyuan
    Guo, Kehua
    Fang, Hui
    Ding, Rui
    Wu, Zheng
    Schaefer, Gerald
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 3, 2023, : 3861 - 3869
  • [48] Efficient and Degradation-Adaptive Network for Real-World Image Super-Resolution
    Liang, Jie
    Zeng, Hui
    Zhang, Lei
    COMPUTER VISION - ECCV 2022, PT XVIII, 2022, 13678 : 574 - 591
  • [49] Real-World Light Field Image Super-Resolution Via Degradation Modulation
    Wang, Yingqian
    Liang, Zhengyu
    Wang, Longguang
    Yang, Jungang
    An, Wei
    Guo, Yulan
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,
  • [50] REAL-WORLD IMAGE SUPER-RESOLUTION VIA KERNEL AUGMENTATION AND STOCHASTIC VARIATION
    Zhang, Haiyu
    Zhu, Yu
    Sun, Jinqiu
    Zhang, Yanning
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 2506 - 2510