Script identification in handwritten and printed documents using convolutional recurrent connection

被引:0
|
作者
Jindal A. [1 ]
机构
[1] School of Computer Science, UPES, Bidholi, Uttarakhand, Dehradun
关键词
Bayesian optimization; CNN-LSTM; Deep learning; Script identification;
D O I
10.1007/s11042-024-19106-x
中图分类号
学科分类号
摘要
Identification of the script in multi-script handwritten or printed documents is one of the essential component to recognize the text. The script identification module helps Optical Character Recognition (OCR) to digitize the text present in the multi-script handwritten or printed documents. The similarity of characters between two or more scripts create this task tedious. The factors such as noise and writing style creates identification of the script more tedious. The present research work has proposed a deep learning method having a set of optimized convolutional layers followed by recurrently connected layers to identify the script of any word sample present in the handwritten or printed document. The proposed method has two components to extract deep hierarchical features and identify the temporal features. The experiments have been carried out on MDIW-13 and PHDIndic_11 datasets having handwritten and printed documents. The experimental results from the proposed method has improved the performance over existing methods in this regard. © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2024.
引用
收藏
页码:5549 / 5563
页数:14
相关论文
共 50 条
  • [31] Writer Identification from Handwritten Devanagari Script
    Halder, Chayan
    Thakur, Kishore
    Phadikar, Santanu
    Roy, Kaushik
    INFORMATION SYSTEMS DESIGN AND INTELLIGENT APPLICATIONS, VOL 2, 2015, 340 : 497 - 505
  • [32] Radon and Wavelet Transforms for Handwritten Script Identification
    Veershetty, C.
    Pardeshi, Rajmohan
    Hangarge, Mallikarjun
    Dhawale, Chitra
    AMBIENT COMMUNICATIONS AND COMPUTER SYSTEMS, RACCCS 2017, 2018, 696 : 755 - 765
  • [33] Improved word-level handwritten Indic script identification by integrating small convolutional neural networks
    Ukil, Soumya
    Ghosh, Swarnendu
    Obaidullah, Sk Md
    Santosh, K. C.
    Roy, Kaushik
    Das, Nibaran
    NEURAL COMPUTING & APPLICATIONS, 2020, 32 (07): : 2829 - 2844
  • [34] Improved word-level handwritten Indic script identification by integrating small convolutional neural networks
    Soumya Ukil
    Swarnendu Ghosh
    Sk Md Obaidullah
    K. C. Santosh
    Kaushik Roy
    Nibaran Das
    Neural Computing and Applications, 2020, 32 : 2829 - 2844
  • [35] Script identification from Indian documents
    Joshi, GD
    Carg, S
    Sivaswamy, J
    DOCUMENT ANALYSIS SYSTEMS VII, PROCEEDINGS, 2006, 3872 : 255 - 267
  • [36] Writer Identification in Noisy Handwritten Documents
    Ni, Karl
    Callier, Patrick
    Hatch, Bradley
    2017 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2017), 2017, : 1177 - 1186
  • [37] Script-independent text line segmentation in freestyle handwritten documents
    Li, Yi
    Zheng, Yefeng
    Doermann, David
    Jaeger, Stefan
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2008, 30 (08) : 1313 - 1329
  • [38] Language Identification from Handwritten Documents
    Mioulet, Luc
    Garain, Utpal
    Chatelain, Clement
    Barlas, Philippine
    Paquet, Thierry
    2015 13TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2015, : 676 - 680
  • [39] A Novel Technique for Line Segmentation in Offline Handwritten Gurmukhi Script Documents
    Kumar, Munish
    Jindal, M. K.
    Sharma, R. K.
    NATIONAL ACADEMY SCIENCE LETTERS-INDIA, 2017, 40 (04): : 273 - 277
  • [40] A Comparison of Recognition Strategies for Printed/Handwritten Composite Documents
    Moysset, Bastien
    Messina, Ronaldo
    Kermorvant, Christopher
    2014 14TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2014, : 158 - 163