A Language Model for Improving the Graph-Based Transcription Approach for Historical Documents

被引:0
|
作者
Lecireth Meza-Lovon, Graciela [1 ]
机构
[1] Univ La Salle, Arequipa, Peru
关键词
Language model; Bigram; Dictionary; Text transcription; Handwriting recognition; Support vector machines; HANDWRITING RECOGNITION;
D O I
10.1007/978-3-319-12027-0_19
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Language Models (LMs) capture the contextual dependencies of a language and assign higher probabilities to well-formed sequences of words. For that reason, LMs have been commonly used in generic handwriting recognition, improving recognition results. In this paper, we present the integration of a Language Model along with a dictionary into a graph-based recognizer, which aims at transcribing handwritten historical documents. The results of such integration show a significant improvement on word accuracy when applied to our corpora.
引用
收藏
页码:229 / 241
页数:13
相关论文
共 50 条
  • [1] A graph-based approach for segmenting touching lines in historical handwritten documents
    Fernandez-Mota, David
    Llados, Josep
    Fornes, Alicia
    INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2014, 17 (03) : 293 - 312
  • [2] A graph-based approach for segmenting touching lines in historical handwritten documents
    David Fernández-Mota
    Josep Lladós
    Alicia Fornés
    International Journal on Document Analysis and Recognition (IJDAR), 2014, 17 : 293 - 312
  • [3] Graph-Based Keyword Spotting in Historical Handwritten Documents
    Stauffer, Michael
    Fischer, Andreas
    Riesen, Kaspar
    STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, S+SSPR 2016, 2016, 10029 : 564 - 573
  • [4] A Graph-Based Approach for Transcribing Ancient Documents
    Lecireth Meza-Lovon, Graciela
    ADVANCES IN ARTIFICIAL INTELLIGENCE - IBERAMIA 2012, 2012, 7637 : 210 - 220
  • [5] A graph-based approach to transform XML documents
    Taentzer, G
    Carughi, GT
    FUNDAMENTAL APPROACHES TO SOFTWARE ENGINEERING, PROCEEDINGS, 2006, 3922 : 48 - 62
  • [6] An approach to graph-based analysis of textual documents
    Bronselaer, Antoon
    Pasi, Gabriella
    PROCEEDINGS OF THE 8TH CONFERENCE OF THE EUROPEAN SOCIETY FOR FUZZY LOGIC AND TECHNOLOGY (EUSFLAT-13), 2013, 32 : 634 - 641
  • [7] Filters for graph-based keyword spotting in historical handwritten documents
    Stauffer, Michael
    Fischer, Andreas
    Riesen, Kaspar
    PATTERN RECOGNITION LETTERS, 2020, 134 : 125 - 134
  • [8] Ensembles for Graph-based Keyword Spotting in Historical Handwritten Documents
    Stauffer, Michael
    Fischer, Andreas
    Riesen, Kaspar
    2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 714 - 720
  • [9] Graph-based Statistical Language Model for Code
    Anh Tuan Nguyen
    Nguyen, Tien N.
    2015 IEEE/ACM 37TH IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, VOL 1, 2015, : 858 - 868
  • [10] A Graph-based Approach at Passage Level to Investigate the Cohesiveness of Documents
    Sarwar, Ghulam
    O'Riordan, Colm
    PROCEEDINGS OF THE 10TH INTERNATIONAL CONFERENCE ON DATA SCIENCE, TECHNOLOGY AND APPLICATIONS (DATA), 2021, : 115 - 123