Training an Arabic handwriting recognizer without a handwritten training data set

被引:0
|
作者
Ahmad, Irfan [1 ,2 ]
Fink, Gernot A. [2 ]
机构
[1] KFUPM, Informat & Comp Sci Dept, Dhahran, Saudi Arabia
[2] TU Dortmund Univ, Dept Comp Sci, Dortmund, Germany
关键词
Handwritten text recognition; hidden Markov models; training data; efficient training; HMM adaptation; OCR; ADAPTATION; MODELS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Handwritten text recognition is an active research area in pattern recognition. One of the prerequisites of setting up a handwritten text recognizer is to train them using, mostly, large amounts of labeled training data. In the current paper we report our work on handwritten text recognition using no handwritten training set. We investigate different approaches including, computer generated text in different typefaces as training data, unsupervised adaptation, and using recognition hypothesis on the test sets as training data. Results from handwritten Arabic word recognition task show that the approach is promising with good recognition rates.
引用
收藏
页码:476 / 480
页数:5
相关论文
共 50 条
  • [1] New Hybrid Arabic Handwriting Recognizer
    Chergui, Leila
    Kef, Maamar
    Chikhi, Salim
    2012 6TH INTERNATIONAL CONFERENCE ON SCIENCES OF ELECTRONICS, TECHNOLOGIES OF INFORMATION AND TELECOMMUNICATIONS (SETIT), 2012, : 319 - 325
  • [2] Equalizing the training set for neural network recognizer
    Wang, YH
    Liu, GS
    Wang, YD
    AUTOMATIC TARGET RECOGNITION VII, 1997, 3069 : 494 - 502
  • [3] Training of an on-line handwritten Japanese character recognizer by artificial patterns
    Chen, Bin
    Zhu, Bilan
    Nakagawa, Masaki
    PATTERN RECOGNITION LETTERS, 2014, 35 : 178 - 185
  • [4] Training a Whole-Book LSTM-Based Recognizer with an Optimal Training Set
    Soheili, Mohammad Reza
    Yousefi, Mohammad Reza
    Kabir, Ehsanollah
    Stricker, Didier
    TENTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2017), 2018, 10696
  • [5] PROPER AND EFFECTIVE TRAINING OF A PATTERN RECOGNIZER FOR CYCLIC DATA
    HWARNG, HB
    IIE TRANSACTIONS, 1995, 27 (06) : 746 - 756
  • [6] Synthesizing Training Data for Handwritten Music Recognition
    Mayer, Jiri
    Pecina, Pavel
    DOCUMENT ANALYSIS AND RECOGNITION, ICDAR 2021, PT III, 2021, 12823 : 626 - 641
  • [7] Optimizing the number of states, training iterations and Gaussians in an HMM-based handwritten word recognizer
    Günter, S
    Bunke, H
    SEVENTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS I AND II, PROCEEDINGS, 2003, : 472 - 476
  • [8] Experiences in data collection for the training of an automatic speech recognizer in Sepedi
    Manamela, MJD
    Botha, EC
    2002 IEEE AFRICON, VOLS 1 AND 2: ELECTROTECHNOLOGICAL SERVICES FOR AFRICA, 2002, : 377 - 381
  • [9] Operon prediction without a training set
    Westover, BP
    Buhler, JD
    Sonnenburg, JL
    Gordon, JI
    BIOINFORMATICS, 2005, 21 (07) : 880 - 888
  • [10] Residual Recurrent Neural Network with Sparse Training for Offline Arabic Handwriting Recognition
    Yan, Ruijie
    Peng, Liangrui
    Bin, GuangXiang
    Wang, Shengjin
    Cheng, Yao
    2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 1031 - 1037