TSRGAN: Real-world text image super-resolution based on adversarial learning and triplet attention

被引:19
|
作者
Fang, Chuantao [1 ]
Zhu, Yu [1 ]
Liao, Lei [1 ]
Ling, Xiaofeng [1 ]
机构
[1] East China Univ Sci & Technol, Sch Informat Sci & Engn, Shanghai 200237, Peoples R China
基金
上海市自然科学基金;
关键词
Text image super-resolution; Adversarial learning; Triplet attention; Wavelet loss; Scene text recognition; NEURAL-NETWORK; SCENE;
D O I
10.1016/j.neucom.2021.05.060
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The text in a low-resolution (LR) image is usually hard to read. Super-resolution (SR) is an intuitive solution to this issue. Existing single image super-resolution (SISR) models are mainly trained on synthetic datasets whose LR images are obtained by performing bicubic interpolation or gaussian blur on high-resolution (HR) images. However, these models can hardly generalize to practical scenarios because real-world LR images are more difficult to super-resolve. The newly proposed TextZoom dataset is the first dataset for real-world text image super-resolution. We propose a new model termed TSRGAN trained on this dataset. First, a discriminator is designed to prevent the SR network from generating over-smoothed images. Second, we introduce triplet attention into the SR network for better representational ability. Moreover, besides L-2 loss and adversarial loss, wavelet loss is incorporated to help reconstruct sharper character edges. Since TextZoom provides text labels, the recognition accuracy of scene text recognition (STR) model can be used to evaluate the quality of SR images. It can reflect the performance of text image SR models better than traditional SR evaluation metrics such as PSNR and SSIM. Comprehensive experiments show the superiority of our TSRGAN. Compared with the state-of-the-art method, the proposed TSRGAN improves the average recognition accuracy of ASTER, MORAN and CRNN by 0.8%, 1.5% and 3.2% on TextZoom respectively. (C) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页码:88 / 96
页数:9
相关论文
共 50 条
  • [1] Robust Real-World Image Super-Resolution against Adversarial Attacks
    Yue, Jiutao
    Li, Haofeng
    Wei, Pengxu
    Li, Guanbin
    Lin, Liang
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 5148 - 5157
  • [2] Dynamic degradation learning for real-world image super-resolution
    Chunxiao Fan
    Qiong Wu
    Xiang Ye
    Signal, Image and Video Processing, 2023, 17 : 315 - 322
  • [3] Dynamic degradation learning for real-world image super-resolution
    Fan, Chunxiao
    Wu, Qiong
    Ye, Xiang
    SIGNAL IMAGE AND VIDEO PROCESSING, 2023, 17 (02) : 315 - 322
  • [4] Real-World Thermal Image Super-Resolution
    Allahham, Moaaz
    Aakerberg, Andreas
    Nasrollahi, Kamal
    Moeslund, Thomas B.
    ADVANCES IN VISUAL COMPUTING (ISVC 2021), PT I, 2021, 13017 : 3 - 14
  • [5] Unsupervised Learning for Real-World Super-Resolution
    Lugmayr, Andreas
    Danelljan, Martin
    Timofte, Radu
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 3408 - 3416
  • [6] Real-World Image Super-Resolution by Exclusionary Dual-Learning
    Li, Hao
    Qin, Jinghui
    Yang, Zhijing
    Wei, Pengxu
    Pan, Jinshan
    Lin, Liang
    Shi, Yukai
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 4752 - 4763
  • [7] Real-World Image Super-Resolution as Multi-Task Learning
    Zhang, Wenlong
    Li, Xiaohui
    Shi, Guangyuan
    Chen, Xiangyu
    Zhang, Xiaoyun
    Qiao, Yu
    Wu, Xiao-Ming
    Dong, Chao
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [8] Generalized Real-World Super-Resolution through Adversarial Robustness
    Castillo, Angela
    Escobar, Maria
    Perez, Juan C.
    Romero, Andres
    Timofte, Radu
    Van Gool, Luc
    Arbelaez, Pablo
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, : 1855 - 1865
  • [9] Multiscale generative adversarial network for real-world super-resolution
    Sun, Ying
    Yang, Zhiwen
    Tao, Bo
    Jiang, Guozhang
    Hao, Zhiqiang
    Chen, Baojia
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2021, 33 (21):
  • [10] Dual Adversarial Adaptation for Cross-Device Real-World Image Super-Resolution
    Xu, Xiaoqian
    Wei, Pengxu
    Chen, Weikai
    Liu, Yang
    Mao, Mingzhi
    Lin, Liang
    Li, Guanbin
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 5657 - 5666