A deep learning model for Ottoman OCR

被引:5
|
作者
Dolek, Ishak [1 ]
Kurt, Atakan [1 ]
机构
[1] Istanbul Univ Cerrahpasa, Engn Sch, Comp Engn Dept, Istanbul, Turkey
来源
关键词
CNN; CTC; deep neural networks; LSTM; OCR; Ottoman; printed naksh font; RNN; NEURAL-NETWORK; RECOGNITION; SEGMENTATION; RETRIEVAL;
D O I
10.1002/cpe.6937
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The Ottoman OCR is an open problem because the OCR models for Arabic do not perform well on Ottoman. The models specifically trained with Ottoman documents have not produced satisfactory results either. We present a deep learning model and an OCR tool using that model for the OCR of printed Ottoman documents in the naksh font. We propose an end-to-end trainable CRNN architecture consisting of CNN, RNN (LSTM), and CTC layers for the Ottoman OCR problem. An experimental comparison of this model, called , with the Tesseract Arabic, the Tesseract Persian, Abby Finereader, Miletos, and Google Docs OCR tools or models was performed using a test data set of 21 pages of original documents. With 88.86% raw text, 96.12% normalized text, and 97.37% joined text character recognition accuracy, the Hybrid model outperforms the others with a marked difference. Our model outperforms the next best model by a clear margin of 4% which is a significant improvement considering the difficulty of the Ottoman OCR problem, and the huge size of the Ottoman archives to be processed. The hybrid model also achieves 58% word recognition accuracy on normalized text which is the only rate above 50%.
引用
收藏
页数:17
相关论文
共 50 条
  • [31] Towards a ptolemaic model for OCR
    Veeramachaneni, S
    Nagy, G
    SEVENTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS I AND II, PROCEEDINGS, 2003, : 1060 - 1064
  • [32] Deep multiscale model learning
    Wang, Yating
    Cheung, Siu Wun
    Chung, Eric T.
    Efendiev, Yalchin
    Wang, Min
    JOURNAL OF COMPUTATIONAL PHYSICS, 2020, 406
  • [33] A Statistical Learning Model with Deep Learning Characteristics
    Liao, Lei
    Huang, Zhiqiu
    Wang, Wengjie
    51ST ANNUAL IEEE/IFIP INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS (DSN-W 2021), 2021, : 137 - 140
  • [34] Model-Based Deep Learning: On the Intersection of Deep Learning and Optimization
    Shlezinger, Nir
    Eldar, Yonina C.
    Boyd, Stephen P.
    IEEE ACCESS, 2022, 10 : 115384 - 115398
  • [35] Can Deep Learning Model Perceptual Learning?
    Bakhtiari, Shahab
    JOURNAL OF NEUROSCIENCE, 2019, 39 (02): : 194 - 196
  • [36] Decoding multiculturalism through linguistic landscapes: a deep learning–based OCR analysis of street view images
    Hyebin Kim
    Eunseon Seong
    Harim Lee
    Dong-Kyu Chae
    Sugie Lee
    Urban Informatics, 4 (1):
  • [37] DHM-OCR: A Deep Hybrid Model for Online Course Recommendation and Sustainable Development of Education
    Mekala, Sagar
    Padma, T. N. S.
    Tandu, Rama Rao
    INTERNATIONAL JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING SYSTEMS, 2024, 15 (04) : 345 - 354
  • [38] OCR for Data Retrieval :An analysis and Machine Learning Application model for NGO social volunteering
    Sharma, Ruchi
    Dave, Parv
    Chaudhary, Jay
    PROCEEDINGS OF THE 2021 FIFTH INTERNATIONAL CONFERENCE ON I-SMAC (IOT IN SOCIAL, MOBILE, ANALYTICS AND CLOUD) (I-SMAC 2021), 2021, : 422 - 427
  • [39] Assessing the Relationship Between Binarization and OCR in the Context of Deep Learning-Based ID Document Analysis
    Sanchez-Rivero, Ruben
    Bezmaternykh, Pavel
    Morales-Gonzalez, Annette
    Jose Silva-Mata, Francisco
    Bulatov, Konstantin
    PROGRESS IN ARTIFICIAL INTELLIGENCE AND PATTERN RECOGNITION, 2021, 13055 : 134 - 144
  • [40] Deep Packet: Deep Learning Model for Intrusion Detection
    Kiet Nguyen Tuan
    Nguyen Duc Thai
    INTELLIGENCE OF THINGS: TECHNOLOGIES AND APPLICATIONS, ICIT 2024, VOL 2, 2025, 230 : 339 - 348