TSRGAN: Real-world text image super-resolution based on adversarial learning and triplet attention

被引：19

作者：

Fang, Chuantao ^{[1
]}

Zhu, Yu ^{[1
]}

Liao, Lei ^{[1
]}

Ling, Xiaofeng ^{[1
]}

机构：

[1] East China Univ Sci & Technol, Sch Informat Sci & Engn, Shanghai 200237, Peoples R China

来源：

NEUROCOMPUTING | 2021年 / 455卷

基金：

上海市自然科学基金;

关键词：

Text image super-resolution; Adversarial learning; Triplet attention; Wavelet loss; Scene text recognition; NEURAL-NETWORK; SCENE;

D O I：

10.1016/j.neucom.2021.05.060

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The text in a low-resolution (LR) image is usually hard to read. Super-resolution (SR) is an intuitive solution to this issue. Existing single image super-resolution (SISR) models are mainly trained on synthetic datasets whose LR images are obtained by performing bicubic interpolation or gaussian blur on high-resolution (HR) images. However, these models can hardly generalize to practical scenarios because real-world LR images are more difficult to super-resolve. The newly proposed TextZoom dataset is the first dataset for real-world text image super-resolution. We propose a new model termed TSRGAN trained on this dataset. First, a discriminator is designed to prevent the SR network from generating over-smoothed images. Second, we introduce triplet attention into the SR network for better representational ability. Moreover, besides L-2 loss and adversarial loss, wavelet loss is incorporated to help reconstruct sharper character edges. Since TextZoom provides text labels, the recognition accuracy of scene text recognition (STR) model can be used to evaluate the quality of SR images. It can reflect the performance of text image SR models better than traditional SR evaluation metrics such as PSNR and SSIM. Comprehensive experiments show the superiority of our TSRGAN. Compared with the state-of-the-art method, the proposed TSRGAN improves the average recognition accuracy of ASTER, MORAN and CRNN by 0.8%, 1.5% and 3.2% on TextZoom respectively. (C) 2021 Elsevier B.V. All rights reserved.

引用

页码：88 / 96

页数：9

共 50 条

[1] Robust Real-World Image Super-Resolution against Adversarial Attacks
Yue, Jiutao
Li, Haofeng
Wei, Pengxu
Li, Guanbin
Lin, Liang
PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 5148 - 5157
[2] Dynamic degradation learning for real-world image super-resolution
Chunxiao Fan
Qiong Wu
Xiang Ye
Signal, Image and Video Processing, 2023, 17 : 315 - 322
[3] Dynamic degradation learning for real-world image super-resolution
Fan, Chunxiao
Wu, Qiong
Ye, Xiang
SIGNAL IMAGE AND VIDEO PROCESSING, 2023, 17 (02) : 315 - 322
[4] Real-World Thermal Image Super-Resolution
Allahham, Moaaz
Aakerberg, Andreas
Nasrollahi, Kamal
Moeslund, Thomas B.
ADVANCES IN VISUAL COMPUTING (ISVC 2021), PT I, 2021, 13017 : 3 - 14
[5] Unsupervised Learning for Real-World Super-Resolution
Lugmayr, Andreas
Danelljan, Martin
Timofte, Radu
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 3408 - 3416
[6] Real-World Image Super-Resolution by Exclusionary Dual-Learning
Li, Hao
Qin, Jinghui
Yang, Zhijing
Wei, Pengxu
Pan, Jinshan
Lin, Liang
Shi, Yukai
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 4752 - 4763
[7] Real-World Image Super-Resolution as Multi-Task Learning
Zhang, Wenlong
Li, Xiaohui
Shi, Guangyuan
Chen, Xiangyu
Zhang, Xiaoyun
Qiao, Yu
Wu, Xiao-Ming
Dong, Chao
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[8] Generalized Real-World Super-Resolution through Adversarial Robustness
Castillo, Angela
Escobar, Maria
Perez, Juan C.
Romero, Andres
Timofte, Radu
Van Gool, Luc
Arbelaez, Pablo
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, : 1855 - 1865
[9] Multiscale generative adversarial network for real-world super-resolution
Sun, Ying
Yang, Zhiwen
Tao, Bo
Jiang, Guozhang
Hao, Zhiqiang
Chen, Baojia
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2021, 33 (21):
[10] Dual Adversarial Adaptation for Cross-Device Real-World Image Super-Resolution
Xu, Xiaoqian
Wei, Pengxu
Chen, Weikai
Liu, Yang
Mao, Mingzhi
Lin, Liang
Li, Guanbin
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 5657 - 5666

← 1 2 3 4 5 →