Data Augmentation and Text Recognition on Khmer Historical Manuscripts

被引:6
|
作者
Valy, Dona [1 ]
Verleysen, Michel [2 ]
Chhun, Sophea [1 ]
机构
[1] Inst Technol Cambodia, Dept Informat & Commun Engn, Phnom Penh, Cambodia
[2] Catholic Univ Louvain, ICTEAM Inst, Ottignies, Belgium
关键词
historical document analysis; palm leaf manuscript; neural network; data augmentation; CHARACTER;
D O I
10.1109/ICFHR2020.2020.00024
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Analysis and recognition of historical documents faces many challenges, one of which is the scarcity of the ground truth data needed for most machine learning techniques, deep learning in particular. In this paper, we present a novel approach which significantly augments the word image samples generated from an existing dataset of Khmer ancient palm leaf manuscripts. Instead of segmenting real Khmer words, we combine the annotated glyphs into groups called sub-syllables. A new text recognition method is also proposed to take into account the spatially complex structure of Khmer writing. The proposed method is composed of two main modules: a feature generator and a decoder. The generator utilizes convolutional blocks, inception blocks, and also a bi-directional LSTM to encode information extracted from the input image so that it can be decoded by the attention-based decoder to predict the final text transcription. Experiments are conducted on a new dataset of groups of sub-syllables constructed from annotated glyphs of the SleukRith Set.
引用
收藏
页码:73 / 78
页数:6
相关论文
共 50 条
  • [21] Named Entity Recognition in Chinese Rice Breeding Questions Based on Text Data Augmentation
    Niu, Peiyu
    Hou, Chen
    Nongye Jixie Xuebao/Transactions of the Chinese Society for Agricultural Machinery, 2024, 55 (08): : 333 - 343
  • [22] Generative adversarial network based adaptive data augmentation for handwritten Arabic text recognition
    Eltay, Mohamed
    Zidouri, Abdelmalek
    Ahmad, Irfan
    Elarian, Yousef
    PEERJ COMPUTER SCIENCE, 2022, 8
  • [23] Few-shot dysarthric speech recognition with text-to-speech data augmentation
    Hermann, Enno
    Magimai-Doss, Mathew
    INTERSPEECH 2023, 2023, : 156 - 160
  • [24] Improving Handwritten Arabic Text Recognition Using an Adaptive Data-Augmentation Algorithm
    Eltay, Mohamed
    Zidouri, Abdelmalek
    Ahmad, Irfan
    Elarian, Yousef
    DOCUMENT ANALYSIS AND RECOGNITION, ICDAR 2021 WORKSHOPS, PT I, 2021, 12916 : 322 - 335
  • [25] Entity-to-Text based Data Augmentation for various Named Entity Recognition Tasks
    Hu, Xuming
    Jiang, Yong
    Liu, Aiwei
    Huang, Zhongqiang
    Xie, Pengjun
    Huang, Fei
    Wen, Lijie
    Yu, Philip S.
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023), 2023, : 9072 - 9087
  • [26] On the Effectiveness of Neural Text Generation Based Data Augmentation for Recognition of Morphologically Rich Speech
    Tarjan, Balazs
    Szaszak, Gyorgy
    Fegyo, Tibor
    Mihajlik, Peter
    TEXT, SPEECH, AND DIALOGUE (TSD 2020), 2020, 12284 : 437 - 445
  • [27] Text Line Extraction using DMLP Classifiers for Historical Manuscripts
    Baechler, Micheal
    Liwicki, Marcus
    Ingold, Rolf
    2013 12TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2013, : 1029 - 1033
  • [28] iForal: Automated Handwritten Text Transcription for Historical Medieval Manuscripts
    Matos, Alexandre
    Almeida, Pedro
    Correia, Paulo L.
    Pacheco, Osvaldo
    JOURNAL OF IMAGING, 2025, 11 (02)
  • [29] Text Data Augmentation for Deep Learning
    Shorten, Connor
    Khoshgoftaar, Taghi M.
    Furht, Borko
    JOURNAL OF BIG DATA, 2021, 8 (01)
  • [30] Text Data Augmentation for Deep Learning
    Connor Shorten
    Taghi M. Khoshgoftaar
    Borko Furht
    Journal of Big Data, 8