Are End-to-End Systems Really Necessary for NER on Handwritten Document Images?

被引：7

作者：

Tueselmann, Oliver ^{[1
]}

Wolf, Fabian ^{[1
]}

Fink, Gernot A. ^{[1
]}

机构：

[1] TU Dortmund Univ, Dept Comp Sci, D-44227 Dortmund, Germany

来源：

DOCUMENT ANALYSIS AND RECOGNITION - ICDAR 2021, PT II | 2021年 / 12822卷

关键词：

Named entity recognition; Document image analysis; Information retrieval; Handwritten documents; RECOGNITION;

D O I：

10.1007/978-3-030-86331-9_52

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Named entities (NEs) are fundamental in the extraction of information from text. The recognition and classification of these entities into predefined categories is called Named Entity Recognition (NER) and plays a major role in Natural Language Processing. However, only a few works consider this task with respect to the document image domain. The approaches are either based on a two-stage or end-to-end architecture. A two-stage approach transforms the document image into a textual representation and determines the NEs using a textual NER. The end-to-end approach, on the other hand, avoids the explicit recognition step at text level and determines the NEs directly on image level. Current approaches that try to tackle the task of NER on segmented word images use end-to-end architectures. This is motivated by the assumption that handwriting recognition is too erroneous to allow for an effective application of textual NLP methods. In this work, we present a two-stage approach and compare it against state-of-the-art end-to-end approaches. Due to the lack of datasets and evaluation protocols, such a comparison is currently difficult. Therefore, we manually annotated the known IAM and George Washington datasets with NE labels and publish them along with optimized splits and an evaluation protocol. Our experiments show, contrary to the common belief, that a two-stage model can achieve higher scores on all tested datasets.

引用

页码：808 / 822

页数：15

共 50 条

[1] End-to-end Piece-wise Unwarping of Document Images
Das, Sagnik
Singh, Kunwar Yashraj
Wu, Jon
Bas, Erhan
Mahadevan, Vijay
Bhotika, Rahul
Samaras, Dimitris
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 4248 - 4257
[2] End-to-end system for printed Amazigh script recognition in document images
Aharrane, Nabil
Dahmouni, Abdellatif
Ensah, Karim El Moutaouakil
Satori, Khalid
2017 3RD INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR SIGNAL AND IMAGE PROCESSING (ATSIP), 2017, : 313 - 318
[3] An end-to-end framework for the detection of mathematical expressions in scientific document images
Phong, Bui Hai
Hoang, Thang Manh
Le, Thi-Lan
EXPERT SYSTEMS, 2022, 39 (01)
[4] Applying End-to-End Trainable Approach on Stroke Extraction in Handwritten Math Expressions Images
Moussa, Elmokhtar Mohamed
Lelore, Thibault
Mouchere, Harold
DOCUMENT ANALYSIS AND RECOGNITION, ICDAR 2021, PT III, 2021, 12823 : 445 - 458
[5] End-To-End Deep-Learning-Based Tamil Handwritten Document Recognition and Classification Model
Vinotheni, C.
Pandian, S. Lakshmana
IEEE ACCESS, 2023, 11 : 43195 - 43204
[6] Using Large Language Model for End-to-End Chinese ASR and NER
Li, Yuang
Yu, Jiawei
Zhang, Min
Ren, Mengxin
Zhao, Yanqing
Zhao, Xiaofeng
Tao, Shimin
Su, Jinsong
Yang, Hao
INTERSPEECH 2024, 2024, : 822 - 826
[7] An end-to-end administrative document analysis system
Hamza, Hatem
Belaid, Yolande
Belaid, Abdel
Chaudhuri, Bidyut B.
PROCEEDINGS OF THE 8TH IAPR INTERNATIONAL WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS, 2008, : 175 - 182
[8] DocFormer: End-to-End Transformer for Document Understanding
Appalaraju, Srikar
Jasani, Bhavan
Kota, Bhargava Urala
Xie, Yusheng
Manmatha, R.
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 973 - 983
[9] Multimodal End-to-End Visual Document Parsing
Lu, Yujiang
Qiu, Weifeng
Hong, Yinghua
Wang, Jiayi
HEALTH INFORMATION PROCESSING. EVALUATION TRACK PAPERS, 2023, 1773 : 154 - 163
[10] External Knowledge Acquisition for End-to-End Document-Oriented Dialog Systems
Lai, Tuan M.
Castellucci, Giuseppe
Kuzi, Saar
Ji, Heng
Rokhlenko, Oleg
17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 3633 - 3647

← 1 2 3 4 5 →