A Self-attention Based Model for Offline Handwritten Text Recognition

被引:2
|
作者
Nam Tuan Ly [1 ]
Trung Tan Ngo [1 ]
Nakagawa, Masaki [1 ]
机构
[1] Tokyo Univ Agr & Technol, Tokyo, Japan
来源
关键词
Self-attention; Multi-head; Handwritten text recognition; CNN; BLSTM; CTC; SEQUENCE;
D O I
10.1007/978-3-031-02444-3_27
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Offline handwritten text recognition is an important part of document analysis and it has been receiving a lot of attention from numerous researchers for decades. In this paper, we present a self-attention-based model for offline handwritten textline recognition. The proposed model consists of three main components: a feature extractor by CNN; an encoder by a BLSTM network and a self-attention module; and a decoder by CTC. The self-attention module is complementary to RNN in the encoder and helps the encoder to capture long-range and multi-level dependencies across an input sequence. According to the extensive experiments on the two datasets of IAM Handwriting and Kuzushiji, the proposed model achieves better accuracy than the state-of-the-art models. The self-attention map visualization shows that the self-attention mechanism helps the encoder capture long-range and multi-level dependencies across an input sequence.
引用
收藏
页码:356 / 369
页数:14
相关论文
共 50 条
  • [41] Offline Handwritten Text Recognition Using Support Vector Machines
    Rajnoha, Martin
    Burget, Radim
    Dutta, Malay Kishore
    2017 4TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND INTEGRATED NETWORKS (SPIN), 2017, : 132 - 136
  • [42] Bridging the Gap in Resource for Offline English Handwritten Text Recognition
    Mondal, Ajoy
    Tulsyan, Krishna
    Jawahai, C., V
    DOCUMENT ANALYSIS AND RECOGNITION-ICDAR 2024, PT II, 2024, 14805 : 413 - 428
  • [43] Polarimetric HRRP Recognition Based on ConvLSTM With Self-Attention
    Zhang, Liang
    Li, Yang
    Wang, Yanhua
    Wang, Junfu
    Long, Teng
    IEEE SENSORS JOURNAL, 2021, 21 (06) : 7884 - 7898
  • [44] Finger Vein Recognition Based on ResNet With Self-Attention
    Zhang, Zhibo
    Chen, Guanghua
    Zhang, Weifeng
    Wang, Huiyang
    IEEE ACCESS, 2024, 12 : 1943 - 1951
  • [45] A Bayesian-based method of unconstrained handwritten offline Chinese text line recognition
    Li, Nan-Xi
    Jin, Lian-Wen
    INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2013, 16 (01) : 17 - 31
  • [46] Recognition of unconstrained handwritten offline Chinese text line based on multiple information fusion
    Li N.-X.
    Jin L.-W.
    Huanan Ligong Daxue Xuebao/Journal of South China University of Technology (Natural Science), 2010, 38 (12): : 45 - 49
  • [47] A Bayesian-based method of unconstrained handwritten offline Chinese text line recognition
    Nan-Xi Li
    Lian-Wen Jin
    International Journal on Document Analysis and Recognition (IJDAR), 2013, 16 : 17 - 31
  • [48] Offline handwritten Chinese character recognition based on DBN fusion model
    Liu, Lu
    Sun, Weiwei
    Ding, Bo
    2016 IEEE INTERNATIONAL CONFERENCE ON INFORMATION AND AUTOMATION (ICIA), 2016, : 1807 - 1811
  • [49] Efficient Self-Attention Model for Speech Recognition-Based Assistive Robots Control
    Poirier, Samuel
    Cote-Allard, Ulysse
    Routhier, Francois
    Campeau-Lecours, Alexandre
    SENSORS, 2023, 23 (13)
  • [50] Self-attention for Speech Emotion Recognition
    Tarantino, Lorenzo
    Garner, Philip N.
    Lazaridis, Alexandros
    INTERSPEECH 2019, 2019, : 2578 - 2582