A Hybrid Swarm and Gravitation-based feature selection algorithm for handwritten Indic script classification problem

被引:8
|
作者
Guha, Ritam [1 ]
Ghosh, Manosij [1 ]
Singh, Pawan Kumar [2 ]
Sarkar, Ram [1 ]
Nasipuri, Mita [1 ]
机构
[1] Jadavpur Univ, Dept Comp Sci & Engn, Kolkata 700032, W Bengal, India
[2] Jadavpur Univ, Dept Informat Technol, Kolkata 700032, W Bengal, India
关键词
Feature selection; Hybrid Swarm and Gravitation-based Feature Selection; Particle swarm optimization; Gravitational search algorithm; Handwritten script classification; Indic script; IDENTIFICATION; OPTIMIZATION; SEARCH;
D O I
10.1007/s40747-020-00237-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In any multi-script environment, handwritten script classification is an unavoidable pre-requisite before the document images are fed to their respective Optical Character Recognition (OCR) engines. Over the years, this complex pattern classification problem has been solved by researchers proposing various feature vectors mostly having large dimensions, thereby increasing the computation complexity of the whole classification model. Feature Selection (FS) can serve as an intermediate step to reduce the size of the feature vectors by restricting them only to the essential and relevant features. In the present work, we have addressed this issue by introducing a new FS algorithm, called Hybrid Swarm and Gravitation-based FS (HSGFS). This algorithm has been applied over three feature vectors introduced in the literature recently-Distance-Hough Transform (DHT), Histogram of Oriented Gradients (HOG), and Modified log-Gabor (MLG) filter Transform. Three state-of-the-art classifiers, namely, Multi-Layer Perceptron (MLP), K-Nearest Neighbour (KNN), and Support Vector Machine (SVM), are used to evaluate the optimal subset of features generated by the proposed FS model. Handwritten datasets at block, text line, and word level, consisting of officially recognized 12 Indic scripts, are prepared for experimentation. An average improvement in the range of 2-5% is achieved in the classification accuracy by utilizing only about 75-80% of the original feature vectors on all three datasets. The proposed method also shows better performance when compared to some popularly used FS models. The codes used for implementing HSGFS can be found in the following Github link: https://github.com/Ritam-Guha/HSGFS.
引用
收藏
页码:823 / 839
页数:17
相关论文
共 50 条
  • [41] A quantum feature selection algorithm for multi-classification problem
    Chen, Junxiu
    Liu, Wenjie
    Gao, Peipei
    Wang, Haibin
    2019 INTERNATIONAL CONFERENCE ON INTERNET OF THINGS (ITHINGS) AND IEEE GREEN COMPUTING AND COMMUNICATIONS (GREENCOM) AND IEEE CYBER, PHYSICAL AND SOCIAL COMPUTING (CPSCOM) AND IEEE SMART DATA (SMARTDATA), 2019, : 519 - 525
  • [42] A Hybrid Feature Selection Algorithm For Classification Unbalanced Data Processsing
    Zhang, Xue
    Shi, Zhiguo
    Liu, Xuan
    Li, Xueni
    2018 IEEE INTERNATIONAL CONFERENCE ON SMART INTERNET OF THINGS (SMARTIOT 2018), 2018, : 269 - 275
  • [43] FEATURE EXTRACTION AND SELECTION HYBRID ALGORITHM FOR HYPERSPECTRAL IMAGERY CLASSIFICATION
    Jia, Sen
    Qian, Yuntao
    Li, Jiming
    Liu, Weixiang
    Ji, Zhen
    2010 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2010, : 72 - 75
  • [44] A new hybrid filter/wrapper algorithm for feature selection in classification
    Zhang, Jixiong
    Xiong, Yanmei
    Min, Shungeng
    ANALYTICA CHIMICA ACTA, 2019, 1080 : 43 - 54
  • [45] Set based particle swarm optimization for the feature selection problem
    Engelbrecht, Andries P.
    Grobler, Jacomine
    Langeveld, Joost
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2019, 85 : 324 - 336
  • [46] Feature Selection for Data Classification in the Semiconductor Industry by a Hybrid of Simplified Swarm Optimization
    Yeh, Wei-Chang
    Chu, Chia-Li
    ELECTRONICS, 2024, 13 (12)
  • [47] Intrusion Feature Selection Algorithm Based on Particle Swarm Optimization
    Tong, Lihong
    Wu, Qingtao
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2014, 14 (12): : 40 - 44
  • [48] A Novel Feature Selection Method Based on Salp Swarm Algorithm
    Yan, Chaokun
    Suo, Zhihao
    Guan, Xinyu
    Luo, Huimin
    2021 IEEE INTERNATIONAL CONFERENCE ON INFORMATION COMMUNICATION AND SOFTWARE ENGINEERING (ICICSE 2021), 2021, : 126 - 130
  • [49] A Forward Search Inspired Particle Swarm Optimization Algorithm for Feature Selection in Classification
    Li, An-Da
    Xue, Bing
    Zhang, Mengjie
    2021 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC 2021), 2021, : 786 - 793
  • [50] A Hybrid Approach for Feature Selection Based on Correlation Feature Selection and Genetic Algorithm
    Rani, Pooja
    Kumar, Rajneesh
    Jain, Anurag
    INTERNATIONAL JOURNAL OF SOFTWARE INNOVATION, 2022, 10 (01)