DENSE CHAINED ATTENTION NETWORK FOR SCENE TEXT RECOGNITION

被引:0
|
作者
Gao, Yunze [1 ]
Chen, Yingying
Wang, Jinqiao
Tang, Ming
Lu, Hanqing
机构
[1] Chinese Acad Sci, Natl Lab Pattern Recognit, Inst Automat, Beijing 100190, Peoples R China
关键词
text recognition; attention; convolution-deconvolution;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Reading text in the wild is a challenging task in computer vision. Scene text suffers from various background noise, including shadow, irrelevant symbols and background texture. In order to reduce the disturbance of background noise, we propose a dense chained attention network with stacked attention modules for scene text recognition. Each attention module learns the attention map that is adapted to corresponding features to enhance the foreground text and suppress the background noise. Besides, the attention branch is designed with the convolution-deconvolution structure which rapidly captures global information to guide the discriminative feature selection. We stack multiple attention modules to gradually refine the attention maps and capture both the low-level appearance feature and the high-level semantic information. Extensive experiments on the standard benchmarks, the Street View Text, IIIT5K, and ICDAR datasets validate the superiority of the proposed method. The dense chained attention network achieves state-of-the-art or highly competitive recognition performance.
引用
收藏
页码:679 / 683
页数:5
相关论文
共 50 条
  • [1] Scene Text Recognition with Cascade Attention Network
    Zhang, Min
    Ma, Meng
    Wang, Ping
    PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL (ICMR '21), 2021, : 385 - 393
  • [2] Gaussian Constrained Attention Network for Scene Text Recognition
    Qiao, Zhi
    Qin, Xugong
    Zhou, Yu
    Yang, Fei
    Wang, Weiping
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 3328 - 3335
  • [3] Scene Text Recognition by Attention Network with Gated Embedding
    Wang, Cong
    Liu, Cheng-Lin
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [4] Spatial attention contrastive network for scene text recognition
    Wang, Fan
    Yin, Dong
    JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (04)
  • [5] A holistic representation guided attention network for scene text recognition
    Yang, Lu
    Wang, Peng
    Li, Hui
    Li, Zhen
    Zhang, Yanning
    NEUROCOMPUTING, 2020, 414 : 67 - 75
  • [6] Deep neural network with attention model for scene text recognition
    Li, Shuohao
    Tang, Min
    Guo, Qiang
    Lei, Jun
    Zhang, Jun
    IET COMPUTER VISION, 2017, 11 (07) : 605 - 612
  • [7] Deformable Mixed Domain Attention Network for Scene Text Recognition
    Huang, Yangyang
    Fang, Wei
    PROCEEDINGS OF 2020 IEEE 11TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS 2020), 2020, : 142 - 145
  • [8] EPAN: Effective parts attention network for scene text recognition
    Huang, Yunlong
    Sun, Zenghui
    Jin, Lianwen
    Luo, Canjie
    NEUROCOMPUTING, 2020, 376 (376) : 202 - 213
  • [9] A Two-Level Rectification Attention Network for Scene Text Recognition
    Wu, Lintai
    Xu, Yong
    Hou, Junhui
    Chen, C. L. Philip
    Liu, Cheng-Lin
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 2404 - 2414
  • [10] MEAN: Multi-Element Attention Network for Scene Text Recognition
    Yan, Ruijie
    Peng, Liangrui
    Xiao, Shanyu
    Yao, Gang
    Min, Jaesik
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 6850 - 6857