A Hybrid Swarm and Gravitation-based feature selection algorithm for handwritten Indic script classification problem

被引:8
|
作者
Guha, Ritam [1 ]
Ghosh, Manosij [1 ]
Singh, Pawan Kumar [2 ]
Sarkar, Ram [1 ]
Nasipuri, Mita [1 ]
机构
[1] Jadavpur Univ, Dept Comp Sci & Engn, Kolkata 700032, W Bengal, India
[2] Jadavpur Univ, Dept Informat Technol, Kolkata 700032, W Bengal, India
关键词
Feature selection; Hybrid Swarm and Gravitation-based Feature Selection; Particle swarm optimization; Gravitational search algorithm; Handwritten script classification; Indic script; IDENTIFICATION; OPTIMIZATION; SEARCH;
D O I
10.1007/s40747-020-00237-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In any multi-script environment, handwritten script classification is an unavoidable pre-requisite before the document images are fed to their respective Optical Character Recognition (OCR) engines. Over the years, this complex pattern classification problem has been solved by researchers proposing various feature vectors mostly having large dimensions, thereby increasing the computation complexity of the whole classification model. Feature Selection (FS) can serve as an intermediate step to reduce the size of the feature vectors by restricting them only to the essential and relevant features. In the present work, we have addressed this issue by introducing a new FS algorithm, called Hybrid Swarm and Gravitation-based FS (HSGFS). This algorithm has been applied over three feature vectors introduced in the literature recently-Distance-Hough Transform (DHT), Histogram of Oriented Gradients (HOG), and Modified log-Gabor (MLG) filter Transform. Three state-of-the-art classifiers, namely, Multi-Layer Perceptron (MLP), K-Nearest Neighbour (KNN), and Support Vector Machine (SVM), are used to evaluate the optimal subset of features generated by the proposed FS model. Handwritten datasets at block, text line, and word level, consisting of officially recognized 12 Indic scripts, are prepared for experimentation. An average improvement in the range of 2-5% is achieved in the classification accuracy by utilizing only about 75-80% of the original feature vectors on all three datasets. The proposed method also shows better performance when compared to some popularly used FS models. The codes used for implementing HSGFS can be found in the following Github link: https://github.com/Ritam-Guha/HSGFS.
引用
收藏
页码:823 / 839
页数:17
相关论文
共 50 条
  • [31] A novel gaussian based particle swarm optimization gravitational search algorithm for feature selection and classification
    Saravanapriya Kumar
    Bagyamani John
    Neural Computing and Applications, 2021, 33 : 12301 - 12315
  • [32] A Hybrid Particle Swarm Optimization Algorithm Based on Immune Selection for Stochastic Loader Problem
    Wang, Hong
    Zhao, Peiyi
    MECHATRONICS AND INDUSTRIAL INFORMATICS, PTS 1-4, 2013, 321-324 : 2214 - +
  • [33] A maximum relevance minimum redundancy hybrid feature selection algorithm based on particle swarm optimization
    Yao, Xu
    Wang, Xiao-Dan
    Zhang, Yu-Xi
    Quan, Wen
    Kongzhi yu Juece/Control and Decision, 2013, 28 (03): : 413 - 417
  • [34] Hybrid Bat and Salp Swarm Algorithm for Feature Selection and Classification of Crisis-Related Tweets in Social Networks
    Farooqui, Nafees Akhter
    Hasan, Mohammad Kamrul
    Noori, Mohammed Ahsan Raza
    Abd Rahman, Abdul Hadi
    Islam, Shayla
    Haleem, Mohammad
    Ahmad, Sheikh Fahad
    Khan, Asif
    Ahmed, Fatima Rayan Awad
    Babiker, Nissrein Babiker Mohammed
    Ahmed, Thowiba E.
    Khan, Atta Ur Rehman
    IEEE ACCESS, 2024, 12 : 103908 - 103920
  • [35] An Archive Based Particle Swarm Optimisation for Feature Selection in Classification
    Xue, Bing
    Qin, A. K.
    Zhang, Mengjie
    2014 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2014, : 3119 - 3126
  • [36] Feature Selection Based on Swallow Swarm Optimization for Fuzzy Classification
    Hodashinsky, Ilya
    Sarin, Konstantin
    Shelupanov, Alexander
    Slezkin, Artem
    SYMMETRY-BASEL, 2019, 11 (11):
  • [37] Improved salp swarm algorithm based on particle swarm optimization for feature selection
    Ibrahim, Rehab Ali
    Ewees, Ahmed A.
    Oliva, Diego
    Abd Elaziz, Mohamed
    Lu, Songfeng
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2019, 10 (08) : 3155 - 3169
  • [38] Evolution of the random subset feature selection algorithm for classification problem
    SabbaghGol, Hamed
    Saadatfar, Hamid
    Khazaiepoor, Mahdi
    KNOWLEDGE-BASED SYSTEMS, 2024, 285
  • [39] Improved salp swarm algorithm based on particle swarm optimization for feature selection
    Rehab Ali Ibrahim
    Ahmed A. Ewees
    Diego Oliva
    Mohamed Abd Elaziz
    Songfeng Lu
    Journal of Ambient Intelligence and Humanized Computing, 2019, 10 : 3155 - 3169
  • [40] A hybrid feature selection algorithm for gene expression data classification
    Lu, Huijuan
    Chen, Junying
    Yan, Ke
    Jin, Qun
    Xue, Yu
    Gao, Zhigang
    NEUROCOMPUTING, 2017, 256 : 56 - 62