Development of an effective character segmentation and efficient feature extraction technique for malayalam character recognition from palm leaf manuscripts

被引:3
|
作者
Sudarsan, Dhanya [1 ]
Sankar, Deepa [1 ]
机构
[1] Cochin Univ Sci & Technol, Sch Engn Elect & Commun, Kochi, India
关键词
Character segmentation; character recognition; base classifiers; KNN; Bayesian; decision tree; feature extraction; Malayalam Palm Leaf manuscripts; HANDWRITTEN BANGLA CHARACTER; NEURAL-NETWORK; CLASSIFICATION;
D O I
10.1007/s12046-023-02181-5
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
The paper developed a novel character segmentation and feature extraction technique for old Malayalam Palm leaf manuscripts. The generic novel segmentation algorithm developed in this paper is fine-tuned to address all the language-specific properties of Malayalam characters written in old palm-leaf manuscripts. Since no major work has been reported in the area of character recognition from old Malayalam palm leaf manuscripts, the paper provides a clear insight into the performance of various feature extractors in recognizing the Malayalam characters which is mandatory while analyzing the performance of deep learning neural network for Malayalam character recognition from palm leaf manuscript. For this, an in-depth analysis of the performance of various existing feature extraction techniques on the base classifiers for Malayalam character recognition from palm-leaf manuscripts is done. The paper also aims to identify the best feature extractor classifier pair suitable for character recognition from old Malayalam palm leaf manuscript images. Initially, the color palm leaf manuscript is preprocessed using the linear block-by-block transformation, Nilblacks technique, and morphological operations for noise removal and binarization. A novel feature extraction technique is proposed is a combination of Log-Gabor which encodes a natural image in the best possible way and can properly address the properties of handwritten characters (similarity, overlapping characters, uneven background color, and foreground-background contrast) efficiently and uniform rotational invariant LBP which solves the invariant text analysis deficiency of Log-Gabor and thus the combination Log Gabor and uniform rotation invariant LBP was proved to be the best feature extractor for the purpose with an accuracy of 95.57%. The stacked ResNet (Convolutional Neural Network) architecture with the Long Short-Term Memory (LSTM) architecture is used to classify the different characters present in the manuscript.
引用
收藏
页数:21
相关论文
共 35 条
  • [1] Development of an effective character segmentation and efficient feature extraction technique for malayalam character recognition from palm leaf manuscripts
    Dhanya Sudarsan
    Deepa Sankar
    Sādhanā, 48
  • [2] A Character Segmentation Algorithm for the Palm Leaf Manuscripts
    Peng, Ge
    Yu, PengFei
    Li, HaiYan
    Li, HongSong
    Zhu, XuDong
    2017 2ND IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND APPLICATIONS (ICCIA), 2017, : 354 - 358
  • [3] Enhancing Malayalam Palm Leaf Character Segmentation: An Improved Simplified Approach
    Sudarsan D.
    Sankar D.
    SN Computer Science, 5 (5)
  • [4] A Hybrid Approach for Feature Extraction in Malayalam Handwritten Character Recognition
    Sujala, K.
    James, Ajay
    Saravanan, C.
    PROCEEDINGS OF THE 2017 IEEE SECOND INTERNATIONAL CONFERENCE ON ELECTRICAL, COMPUTER AND COMMUNICATION TECHNOLOGIES (ICECCT), 2017,
  • [5] Character and Text Recognition of Khmer Historical Palm Leaf Manuscripts
    Valy, Dona
    Verleysen, Michel
    Chhun, Sophea
    Burie, Jean-Christophe
    PROCEEDINGS 2018 16TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2018, : 13 - 18
  • [6] Character Recognition on Palm-Leaf Manuscripts-A Survey
    Sagar, B.
    Minavathi
    EMERGING RESEARCH IN ELECTRONICS, COMPUTER SCIENCE AND TECHNOLOGY, ICERECT 2018, 2019, 545 : 669 - 685
  • [7] A Comparative Study of Different Feature Extraction Techniques for Offline Malayalam Character Recognition
    Chacko, Anitha Mary M. O.
    Dhanya, P. M.
    COMPUTATIONAL INTELLIGENCE IN DATA MINING, VOL 2, 2015, 32 : 9 - 18
  • [8] Feature Extraction Using Geometrical Features for Malayalam Handwritten Character Recognition System
    Thushara, K.
    James, Ajay
    Saravanan, C.
    2017 2ND IEEE INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, SIGNAL PROCESSING AND NETWORKING (WISPNET), 2017, : 477 - 482
  • [9] Study on Feature Extraction Methods for Character Recognition of Balinese Script on Palm Leaf Manuscript Images
    Kesiman, Made Windu Antara
    Prum, Sophea
    Burie, Jean-Christophe
    Ogier, Jean-Marc
    2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 4017 - 4022
  • [10] Effective feature extraction for character recognition from low resolution images
    Department of Information Science, Faculty of Engineering, Gifu University, 1-1 Yanagido, Gifu-shi, Gifu 501-1193, Japan
    IEEJ Trans. Electron. Inf. Syst., 2009, 5 (963-969+26):