DENSE CHAINED ATTENTION NETWORK FOR SCENE TEXT RECOGNITION

被引:0
|
作者
Gao, Yunze [1 ]
Chen, Yingying
Wang, Jinqiao
Tang, Ming
Lu, Hanqing
机构
[1] Chinese Acad Sci, Natl Lab Pattern Recognit, Inst Automat, Beijing 100190, Peoples R China
关键词
text recognition; attention; convolution-deconvolution;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Reading text in the wild is a challenging task in computer vision. Scene text suffers from various background noise, including shadow, irrelevant symbols and background texture. In order to reduce the disturbance of background noise, we propose a dense chained attention network with stacked attention modules for scene text recognition. Each attention module learns the attention map that is adapted to corresponding features to enhance the foreground text and suppress the background noise. Besides, the attention branch is designed with the convolution-deconvolution structure which rapidly captures global information to guide the discriminative feature selection. We stack multiple attention modules to gradually refine the attention maps and capture both the low-level appearance feature and the high-level semantic information. Extensive experiments on the standard benchmarks, the Street View Text, IIIT5K, and ICDAR datasets validate the superiority of the proposed method. The dense chained attention network achieves state-of-the-art or highly competitive recognition performance.
引用
收藏
页码:679 / 683
页数:5
相关论文
共 50 条
  • [21] Review network for scene text recognition
    Li, Shuohao
    Han, Anqi
    Chen, Xu
    Yin, Xiaoqing
    Zhang, Jun
    JOURNAL OF ELECTRONIC IMAGING, 2017, 26 (05)
  • [22] Text proposals with location-awareness-attention network for arbitrarily shaped scene text detection and recognition
    Zhong, Dajian
    Lyu, Shujing
    Shivakumara, Palaiahankote
    Pal, Umapada
    Lu, Yue
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 205
  • [23] SGBANet: Semantic GAN and Balanced Attention Network for Arbitrarily Oriented Scene Text Recognition
    Zhong, Dajian
    Lyu, Shujing
    Shivakumara, Palaiahnakote
    Yin, Bing
    Wu, Jiajia
    Pal, Umapada
    Lu, Yue
    COMPUTER VISION - ECCV 2022, PT XXVIII, 2022, 13688 : 464 - 480
  • [24] Attention-Based Deep Neural Network and Its Application to Scene Text Recognition
    He, Haizhen
    Li, Jiehan
    2019 IEEE 11TH INTERNATIONAL CONFERENCE ON COMMUNICATION SOFTWARE AND NETWORKS (ICCSN 2019), 2019, : 672 - 677
  • [25] CAMTNet: CTC-Attention Mechanism and Transformer Fusion Network for Scene Text Recognition
    Wang, Ling
    Luo, Kexin
    Wang, Peng
    Bai, Yane
    IAENG International Journal of Computer Science, 2024, 51 (11) : 1750 - 1760
  • [26] Sequential alignment attention model for scene text recognition
    Wu, Yan
    Fan, Jiaxin
    Tao, Renshuai
    Wang, Jiakai
    Qin, Haotong
    Liu, Aishan
    Liu, Xianglong
    Tao, Renshuai (rstao@buaa.edu.cn), 1600, Academic Press Inc. (80):
  • [27] FACLSTM: ConvLSTM with focused attention for scene text recognition
    Wang, Qingqing
    Huang, Ye
    Jia, Wenjing
    He, Xiangjian
    Blumenstein, Michael
    Lyu, Shujing
    Lu, Yue
    SCIENCE CHINA-INFORMATION SCIENCES, 2020, 63 (02)
  • [28] SCENE TEXT RECOGNITION VIA GATED CASCADE ATTENTION
    Wang, Siwei
    Wang, Yongtao
    Qin, Xiaoran
    Zhao, Qijie
    Tang, Zhi
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 1018 - 1023
  • [29] Attention Guided Feature Encoding for Scene Text Recognition
    Hassan, Ehtesham
    Lekshmi, V. L.
    JOURNAL OF IMAGING, 2022, 8 (10)
  • [30] FACLSTM: ConvLSTM with focused attention for scene text recognition
    Qingqing Wang
    Ye Huang
    Wenjing Jia
    Xiangjian He
    Michael Blumenstein
    Shujing Lyu
    Yue Lu
    Science China Information Sciences, 2020, 63