Joint Line Segmentation and Transcription for End-to-End Handwritten Paragraph Recognition

被引:0
|
作者
Bluche, Theodore [1 ]
机构
[1] A2iA SAS, 39 Rue Bienfaisance, F-75008 Paris, France
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Offline handwriting recognition systems require cropped text line images for both training and recognition. On the one hand, the annotation of position and transcript at line level is costly to obtain. On the other hand, automatic line segmentation algorithms are prone to errors, compromising the subsequent recognition. In this paper, we propose a modification of the popular and efficient Multi-Dimensional Long Short-Term Memory Recurrent Neural Networks (MDLSTM-RNNs) to enable end-to-end processing of handwritten paragraphs. More particularly, we replace the collapse layer transforming the two-dimensional representation into a sequence of predictions by a recurrent version which can select one line at a time. In the proposed model, a neural network performs a kind of implicit line segmentation by computing attention weights on the image representation. The experiments on paragraphs of Rimes and IAM databases yield results that are competitive with those of networks trained at line level, and constitute a significant step towards end-to-end transcription of full documents.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Scan, Attend and Read: End-to-End Handwritten Paragraph Recognition with MDLSTM Attention
    Bluche, Theodore
    Louradour, Jerome
    Messina, Ronaldo
    2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 1050 - 1055
  • [2] End-to-End Handwritten Paragraph Text Recognition Using a Vertical Attention Network
    Coquenet, Denis
    Chatelain, Clement
    Paquet, Thierry
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (01) : 508 - 524
  • [3] End-to-end Handwritten Chinese Paragraph Text Recognition Using Residual Attention Networks
    Wang, Yintong
    Yang, Yingjie
    Chen, Haiyan
    Zheng, Hao
    Chang, Heyou
    INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2022, 34 (01): : 371 - 388
  • [4] ResneSt-Transformer: Joint attention segmentation-free for end-to-end handwriting paragraph recognition model
    Hamdan, Mohammed
    Cheriet, Mohamed
    ARRAY, 2023, 19
  • [5] Joint Recognition of Handwritten Text and Named Entities with a Neural End-to-end Model
    Carbonell, Manuel
    Villegas, Mauricio
    Fornes, Alicia
    Llados, Josep
    2018 13TH IAPR INTERNATIONAL WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS (DAS), 2018, : 399 - 404
  • [6] End-to-End Handwritten Text Detection and Transcription in Full Pages
    Carbonell, Manuel
    Mas, Joan
    Villegas, Mauricio
    Fornes, Alicia
    Llados, Josep
    2019 INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION WORKSHOPS (ICDARW), VOL 5, 2019, : 29 - 34
  • [7] End-to-End Optical Character Recognition for Bengali Handwritten Words
    Safir, Farisa Benta
    Ohi, Abu Quwsar
    Mridha, M. F.
    Monowar, Muhammad Mostafa
    Hamid, Md Abdul
    2021 IEEE NATIONAL COMPUTING COLLEGES CONFERENCE (NCCC 2021), 2021, : 1067 - +
  • [8] An end-to-end generative framework for video segmentation and recognition
    Kuehne, Hilde
    Gall, Juergen
    Serre, Thomas
    2016 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2016), 2016,
  • [9] An End-to-End Approach for Recognition of Modern and Historical Handwritten Numeral Strings
    Hochuli, Andre G.
    Britto, Alceu S., Jr.
    Barddal, Jean P.
    Oliveira, Luiz E. S.
    Sabourin, Robert
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [10] A comprehensive comparison of end-to-end approaches for handwritten digit string recognition
    Hochuli, Andre G.
    Britto Jr, Alceu S.
    Saji, David A.
    Saavedra, Jose M.
    Sabourin, Robert
    Oliveira, Luiz S.
    EXPERT SYSTEMS WITH APPLICATIONS, 2021, 165 (165)