A Novel Connectionist System for Unconstrained Handwriting Recognition

被引:1250
|
作者
Graves, Alex [1 ]
Liwicki, Marcus [2 ]
Fernandez, Santiago [3 ]
Bertolami, Roman [4 ]
Bunke, Horst [4 ]
Schmidhuber, Juergen [1 ]
机构
[1] Tech Univ Munich, Inst Informat, D-85478 Garching, Germany
[2] DKFI German Res Ctr Artificial Intelligence, Res Grp Knowledge Management, D-67663 Kaiserslautern, Germany
[3] IDSIA, CH-6928 Manno Lugano, Switzerland
[4] Inst Comp Sci & Appl Math IAM, Res Grp Comp Vis & Artificial Intelligence FKI, CH-3012 Bern, Switzerland
基金
瑞士国家科学基金会;
关键词
Handwriting recognition; online handwriting; offline handwriting; connectionist temporal classification; bidirectional long short-term memory; recurrent neural networks; hidden Markov model; ONLINE;
D O I
10.1109/TPAMI.2008.137
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recognizing lines of unconstrained handwritten text is a challenging task. The difficulty of segmenting cursive or overlapping characters, combined with the need to exploit surrounding context, has led to low recognition rates for even the best current recognizers. Most recent progress in the field has been made either through improved preprocessing or through advances in language modeling. Relatively little work has been done on the basic recognition algorithms. Indeed, most systems rely on the same hidden Markov models that have been used for decades in speech and handwriting recognition, despite their well-known shortcomings. This paper proposes an alternative approach based on a novel type of recurrent neural network, specifically designed for sequence labeling tasks where the data is hard to segment and contains long-range bidirectional interdependencies. In experiments on two large unconstrained handwriting databases, our approach achieves word recognition accuracies of 79.7 percent on online data and 74.1 percent on offline data, significantly outperforming a state-of-the-art HMM-based system. In addition, we demonstrate the network's robustness to lexicon size, measure the individual influence of its hidden layers, and analyze its use of context. Last, we provide an in-depth discussion of the differences between the network and HMMs, suggesting reasons for the network's superior performance.
引用
收藏
页码:855 / 868
页数:14
相关论文
共 50 条
  • [31] Airwriting: a wearable handwriting recognition system
    Amma, Christoph
    Georgi, Marcus
    Schultz, Tanja
    PERSONAL AND UBIQUITOUS COMPUTING, 2014, 18 (01) : 191 - 203
  • [32] A maximum-likelihood approach to segmentation-based recognition of unconstrained handwriting text
    Senda, S
    Yamada, K
    SIXTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, PROCEEDINGS, 2001, : 184 - 188
  • [33] An online handwriting recognition system for Turkish
    Vural, E
    Erdogan, H
    Oflazer, K
    Yanikoglu, B
    PROCEEDINGS OF THE IEEE 12TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, 2004, : 607 - 610
  • [34] Online Bangla handwriting recognition system
    Roy, K.
    Sharma, N.
    Pal, T.
    Pal, U.
    PROCEEDINGS OF THE SIXTH INTERNATIONAL CONFERENCE ON ADVANCES IN PATTERN RECOGNITION, 2007, : 117 - +
  • [35] A maximum-likelihood approach to segmentation-based recognition of unconstrained handwriting text
    Senda, Shuji
    Yamada, Keiji
    Proceedings of the International Conference on Document Analysis and Recognition, ICDAR, 2001, 2001-January : 184 - 188
  • [36] Text Entry Performance of State of the Art Unconstrained Handwriting Recognition: A Longitudinal User Study
    Kristensson, Per Ola
    Denby, Leif C.
    CHI2009: PROCEEDINGS OF THE 27TH ANNUAL CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, VOLS 1-4, 2009, : 567 - 570
  • [37] Conv-transformer architecture for unconstrained off-line Urdu handwriting recognition
    Nauman Riaz
    Haziq Arbab
    Arooba Maqsood
    Khuzaeymah Nasir
    Adnan Ul-Hasan
    Faisal Shafait
    International Journal on Document Analysis and Recognition (IJDAR), 2022, 25 : 373 - 384
  • [38] Conv-transformer architecture for unconstrained off-line Urdu handwriting recognition
    Riaz, Nauman
    Arbab, Haziq
    Maqsood, Arooba
    Nasir, Khuzaeymah
    Ul-Hasan, Adnan
    Shafait, Faisal
    INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2022, 25 (04) : 373 - 384
  • [39] CONTEXT-DEPENDENT SEARCH IN INTERCONNECTED HIDDEN MARKOV MODEL FOR UNCONSTRAINED HANDWRITING RECOGNITION
    OH, SC
    HA, JY
    KIM, JH
    PATTERN RECOGNITION, 1995, 28 (11) : 1693 - 1704
  • [40] Committees of RBFN - A novel connectionist model for speech recognition
    Meenakshisundaram, S
    Woo, WL
    Dlay, SS
    Proceedings of the Eighth IASTED International Conference on Artificial Intelligence and Soft Computing, 2004, : 300 - 304