A Language Model for Improving the Graph-Based Transcription Approach for Historical Documents

被引:0
|
作者
Lecireth Meza-Lovon, Graciela [1 ]
机构
[1] Univ La Salle, Arequipa, Peru
关键词
Language model; Bigram; Dictionary; Text transcription; Handwriting recognition; Support vector machines; HANDWRITING RECOGNITION;
D O I
10.1007/978-3-319-12027-0_19
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Language Models (LMs) capture the contextual dependencies of a language and assign higher probabilities to well-formed sequences of words. For that reason, LMs have been commonly used in generic handwriting recognition, improving recognition results. In this paper, we present the integration of a Language Model along with a dictionary into a graph-based recognizer, which aims at transcribing handwritten historical documents. The results of such integration show a significant improvement on word accuracy when applied to our corpora.
引用
收藏
页码:229 / 241
页数:13
相关论文
共 50 条
  • [31] An Innovative Graph-Based Approach to Advance Feature Selection from Multiple Textual Documents
    Giarelis, Nikolaos
    Kanakaris, Nikos
    Karacapilidis, Nikos
    ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, AIAI 2020, PT I, 2020, 583 : 96 - 106
  • [32] Graph-Based Keyword Spotting in Historical Documents Using Context-Aware Hausdorff Edit Distance
    Stauffer, Michael
    Fischer, Andreas
    Riesen, Kaspar
    2018 13TH IAPR INTERNATIONAL WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS (DAS), 2018, : 49 - 54
  • [33] Shapley Flow: A Graph-based Approach to Interpreting Model Predictions
    Wang, Jiaxuan
    Wiens, Jenna
    Lundberg, Scott
    24TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS (AISTATS), 2021, 130 : 721 - +
  • [34] Improving Graph-based Document-Level Relation Extraction Model with Novel Graph Structure
    Park, Seongsik
    Yoon, Dongkeun
    Kim, Harksoo
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 4379 - 4383
  • [35] A human in the loop approach to historical handwritten documents transcription
    Santoro, Adolfo
    Parziale, Antonio
    Marcelli, Angelo
    PROCEEDINGS OF 2016 15TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2016, : 222 - 227
  • [36] Improving the graph-based image segmentation method
    Zhang, Ming
    Alhajj, Reda
    ICTAI-2006: EIGHTEENTH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2006, : 617 - +
  • [37] Implementing hyperlog, a graph-based database language
    Hild, S
    Poulovassilis, A
    JOURNAL OF VISUAL LANGUAGES AND COMPUTING, 1996, 7 (03): : 267 - 289
  • [38] GRAPH-BASED IMPLEMENTATION OF A FUNCTIONAL LOGIC LANGUAGE
    KUCHEN, H
    LOOGEN, R
    MORENONAVARRO, JJ
    RODRIGUEZARTALEJO, M
    LECTURE NOTES IN COMPUTER SCIENCE, 1990, 432 : 271 - 290
  • [39] Graph-based traceability: a comprehensive approach
    Hannes Schwarz
    Jürgen Ebert
    Andreas Winter
    Software & Systems Modeling, 2010, 9 : 473 - 492
  • [40] A graph-based approach to feature selection
    Zhang Z.
    Hancock E.R.
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2011, 6658 LNCS : 205 - 214