Development of an effective character segmentation and efficient feature extraction technique for malayalam character recognition from palm leaf manuscripts

被引:3
|
作者
Sudarsan, Dhanya [1 ]
Sankar, Deepa [1 ]
机构
[1] Cochin Univ Sci & Technol, Sch Engn Elect & Commun, Kochi, India
关键词
Character segmentation; character recognition; base classifiers; KNN; Bayesian; decision tree; feature extraction; Malayalam Palm Leaf manuscripts; HANDWRITTEN BANGLA CHARACTER; NEURAL-NETWORK; CLASSIFICATION;
D O I
10.1007/s12046-023-02181-5
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
The paper developed a novel character segmentation and feature extraction technique for old Malayalam Palm leaf manuscripts. The generic novel segmentation algorithm developed in this paper is fine-tuned to address all the language-specific properties of Malayalam characters written in old palm-leaf manuscripts. Since no major work has been reported in the area of character recognition from old Malayalam palm leaf manuscripts, the paper provides a clear insight into the performance of various feature extractors in recognizing the Malayalam characters which is mandatory while analyzing the performance of deep learning neural network for Malayalam character recognition from palm leaf manuscript. For this, an in-depth analysis of the performance of various existing feature extraction techniques on the base classifiers for Malayalam character recognition from palm-leaf manuscripts is done. The paper also aims to identify the best feature extractor classifier pair suitable for character recognition from old Malayalam palm leaf manuscript images. Initially, the color palm leaf manuscript is preprocessed using the linear block-by-block transformation, Nilblacks technique, and morphological operations for noise removal and binarization. A novel feature extraction technique is proposed is a combination of Log-Gabor which encodes a natural image in the best possible way and can properly address the properties of handwritten characters (similarity, overlapping characters, uneven background color, and foreground-background contrast) efficiently and uniform rotational invariant LBP which solves the invariant text analysis deficiency of Log-Gabor and thus the combination Log Gabor and uniform rotation invariant LBP was proved to be the best feature extractor for the purpose with an accuracy of 95.57%. The stacked ResNet (Convolutional Neural Network) architecture with the Long Short-Term Memory (LSTM) architecture is used to classify the different characters present in the manuscript.
引用
收藏
页数:21
相关论文
共 35 条
  • [21] Efficient Feature Extraction Techniques for Offline Handwritten Gurmukhi Character Recognition
    Kumar, Munish
    Sharma, R. K.
    Jindal, M. K.
    NATIONAL ACADEMY SCIENCE LETTERS-INDIA, 2014, 37 (04): : 381 - 391
  • [22] A Multilevel CNN Architecture for Character Recognition from Palm Leaf Images
    Jyothi, R. L.
    Rahiman, M. Abdul
    INTELLIGENT COMPUTING AND COMMUNICATION, ICICC 2019, 2020, 1034 : 185 - 193
  • [23] The development of the feature extraction algorithms for Thai handwritten character recognition system
    Mitrpanont, JL
    Kiwprasopsak, S
    DEVELOPMENTS IN APPLIED ARTIFICAIL INTELLIGENCE, PROCEEDINGS, 2002, 2358 : 536 - 546
  • [24] Combined Horizontal and Vertical Projection Feature Extraction Technique for Gurmukhi Handwritten Character Recognition
    Mahto, Manoj Kumar
    Bhatia, Karamjit
    Sharma, R. K.
    2015 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTER ENGINEERING AND APPLICATIONS (ICACEA), 2015, : 59 - 65
  • [25] An efficient feature extraction and dimensionality reduction scheme for isolated Greek handwritten character recognition
    Vamvakas, G.
    Gatos, B.
    Petridis, S.
    Stamatopoulos, N.
    ICDAR 2007: NINTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS I AND II, PROCEEDINGS, 2007, : 1073 - 1077
  • [26] A contour character extraction approach in conjunction with a neural confidence fusion technique for the segmentation of handwriting recognition
    Verma, B
    ICONIP'02: PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON NEURAL INFORMATION PROCESSING: COMPUTATIONAL INTELLIGENCE FOR THE E-AGE, 2002, : 2459 - 2463
  • [27] Hindi Character Recognition Using RBF Neural Network and Directional Group Feature Extraction Technique
    Singh, Dayashankar
    Saini, J. P.
    Chauhan, D. S.
    2015 INTERNATIONAL CONFERENCE ON COGNITIVE COMPUTING AND INFORMATION PROCESSING (CCIP), 2015,
  • [28] A New Scheme for Text Line and Character Segmentation from Gray Scale Images of Palm Leaf Manuscript
    Kesiman, Made Windu Antara
    Burie, Jean-Christophe
    Ogier, Jean-Marc
    PROCEEDINGS OF 2016 15TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2016, : 325 - 330
  • [29] An effective geometrical feature extraction method for scale and rotational invariant multi-lingual character recognition
    Mohammed, Sharfuddin Waseem
    Murugan, Brindha
    JOURNAL OF REAL-TIME IMAGE PROCESSING, 2025, 22 (02)
  • [30] A Novel Geometrical Scale and Rotation Independent Feature Extraction Technique for Multi-lingual Character Recognition
    Soora, Narasimha Reddy
    Mohammed, Ehsan Ur Rahman
    Mohammed, Sharfuddin Waseem
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (11) : 231 - 239