Are End-to-End Systems Really Necessary for NER on Handwritten Document Images?

被引:7
|
作者
Tueselmann, Oliver [1 ]
Wolf, Fabian [1 ]
Fink, Gernot A. [1 ]
机构
[1] TU Dortmund Univ, Dept Comp Sci, D-44227 Dortmund, Germany
关键词
Named entity recognition; Document image analysis; Information retrieval; Handwritten documents; RECOGNITION;
D O I
10.1007/978-3-030-86331-9_52
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Named entities (NEs) are fundamental in the extraction of information from text. The recognition and classification of these entities into predefined categories is called Named Entity Recognition (NER) and plays a major role in Natural Language Processing. However, only a few works consider this task with respect to the document image domain. The approaches are either based on a two-stage or end-to-end architecture. A two-stage approach transforms the document image into a textual representation and determines the NEs using a textual NER. The end-to-end approach, on the other hand, avoids the explicit recognition step at text level and determines the NEs directly on image level. Current approaches that try to tackle the task of NER on segmented word images use end-to-end architectures. This is motivated by the assumption that handwriting recognition is too erroneous to allow for an effective application of textual NLP methods. In this work, we present a two-stage approach and compare it against state-of-the-art end-to-end approaches. Due to the lack of datasets and evaluation protocols, such a comparison is currently difficult. Therefore, we manually annotated the known IAM and George Washington datasets with NE labels and publish them along with optimized splits and an evaluation protocol. Our experiments show, contrary to the common belief, that a two-stage model can achieve higher scores on all tested datasets.
引用
收藏
页码:808 / 822
页数:15
相关论文
共 50 条
  • [1] End-to-end Piece-wise Unwarping of Document Images
    Das, Sagnik
    Singh, Kunwar Yashraj
    Wu, Jon
    Bas, Erhan
    Mahadevan, Vijay
    Bhotika, Rahul
    Samaras, Dimitris
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 4248 - 4257
  • [2] End-to-end system for printed Amazigh script recognition in document images
    Aharrane, Nabil
    Dahmouni, Abdellatif
    Ensah, Karim El Moutaouakil
    Satori, Khalid
    2017 3RD INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR SIGNAL AND IMAGE PROCESSING (ATSIP), 2017, : 313 - 318
  • [3] An end-to-end framework for the detection of mathematical expressions in scientific document images
    Phong, Bui Hai
    Hoang, Thang Manh
    Le, Thi-Lan
    EXPERT SYSTEMS, 2022, 39 (01)
  • [4] Applying End-to-End Trainable Approach on Stroke Extraction in Handwritten Math Expressions Images
    Moussa, Elmokhtar Mohamed
    Lelore, Thibault
    Mouchere, Harold
    DOCUMENT ANALYSIS AND RECOGNITION, ICDAR 2021, PT III, 2021, 12823 : 445 - 458
  • [5] End-To-End Deep-Learning-Based Tamil Handwritten Document Recognition and Classification Model
    Vinotheni, C.
    Pandian, S. Lakshmana
    IEEE ACCESS, 2023, 11 : 43195 - 43204
  • [6] Using Large Language Model for End-to-End Chinese ASR and NER
    Li, Yuang
    Yu, Jiawei
    Zhang, Min
    Ren, Mengxin
    Zhao, Yanqing
    Zhao, Xiaofeng
    Tao, Shimin
    Su, Jinsong
    Yang, Hao
    INTERSPEECH 2024, 2024, : 822 - 826
  • [7] An end-to-end administrative document analysis system
    Hamza, Hatem
    Belaid, Yolande
    Belaid, Abdel
    Chaudhuri, Bidyut B.
    PROCEEDINGS OF THE 8TH IAPR INTERNATIONAL WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS, 2008, : 175 - 182
  • [8] DocFormer: End-to-End Transformer for Document Understanding
    Appalaraju, Srikar
    Jasani, Bhavan
    Kota, Bhargava Urala
    Xie, Yusheng
    Manmatha, R.
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 973 - 983
  • [9] Multimodal End-to-End Visual Document Parsing
    Lu, Yujiang
    Qiu, Weifeng
    Hong, Yinghua
    Wang, Jiayi
    HEALTH INFORMATION PROCESSING. EVALUATION TRACK PAPERS, 2023, 1773 : 154 - 163
  • [10] External Knowledge Acquisition for End-to-End Document-Oriented Dialog Systems
    Lai, Tuan M.
    Castellucci, Giuseppe
    Kuzi, Saar
    Ji, Heng
    Rokhlenko, Oleg
    17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 3633 - 3647