A novel feature selection approach based on clustering algorithm

被引:7
|
作者
Moslehi, Fateme [1 ]
Haeri, Abdorrahman [2 ]
机构
[1] Iran Univ Sci & Technol, Informat Technol Engn, Tehran, Iran
[2] Iran Univ Sci & Technol, Sch Ind Engn, Tehran, Iran
关键词
Data mining; clustering; K-means algorithm; feature selection; FEATURE SUBSET-SELECTION; GRAVITATIONAL SEARCH ALGORITHM; PARTICLE SWARM OPTIMIZATION; MUTUAL INFORMATION; CLASSIFICATION; HYBRID; REDUCTION;
D O I
10.1080/00949655.2020.1822358
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Clustering is one of the main methods of data mining. K-means algorithm is one of the most common clustering algorithms due to its efficiency and ease of use. In many data mining issues, the dataset contains a large number of fields and, therefore, the identification of the effective fields is an important issue. Appling the proposed algorithm, the important variables of the dataset would be identified. In the proposed method, the dataset is clustered in several stages and in each step the characteristics of the created clusters are examined and the features that transform the structure of clusters are introduced as effective features of the dataset. The proposed method was examined on 4 datasets and the results of this method were compared with other similar work and demonstrated that using this algorithm would eliminate redundant and unrelated features of the dataset and improve classification accuracy.
引用
收藏
页码:581 / 604
页数:24
相关论文
共 50 条
  • [31] A novel filter feature selection algorithm based on relief
    Xueting Cui
    Ying Li
    Jiahao Fan
    Tan Wang
    Applied Intelligence, 2022, 52 : 5063 - 5081
  • [32] Feature Selection Approach Based on Whale Optimization Algorithm
    Sharawi, Marwa
    Zawbaa, Hossam M.
    Emary, E.
    Zawbaa, Hossam M.
    2017 NINTH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTATIONAL INTELLIGENCE (ICACI), 2017, : 163 - 168
  • [33] A Novel Crowding Clustering Algorithm for Unsupervised and Supervised Filter Feature Selection Problem
    Ghanem, Khadoudja
    Layeb, Abdesslem
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2024,
  • [34] A feature selection bayesian approach for extracting classification rules with a clustering genetic algorithm
    Hruschka, ER
    Hruschka, ER
    Ebecken, NFF
    APPLIED ARTIFICIAL INTELLIGENCE, 2003, 17 (5-6) : 489 - 506
  • [35] A Clustering Strategy-Based Evolutionary Algorithm for Feature Selection in Classification
    Zhang, Baohang
    Wang, Zigian
    Lei, Zhenyu
    Yu, Jiatianyi
    Jin, Ting
    Gao, Shangce
    ADVANCES AND TRENDS IN ARTIFICIAL INTELLIGENCE. THEORY AND APPLICATIONS, IEA/AIE 2023, PT I, 2023, 13925 : 49 - 59
  • [36] A harmony search algorithm for clustering with feature selection
    Cobos, Carlos
    Leon, Elizabeth
    Mendoza, Martha
    REVISTA FACULTAD DE INGENIERIA-UNIVERSIDAD DE ANTIOQUIA, 2010, (55): : 153 - 164
  • [37] A Novel Hybrid Algorithm for Feature Selection Based on Whale Optimization Algorithm
    Zheng, Yuefeng
    Li, Ying
    Wang, Gang
    Chen, Yupeng
    Xu, Qian
    Fan, Jiahao
    Cui, Xueting
    IEEE ACCESS, 2019, 7 : 14908 - 14923
  • [38] A Novel Approach for Feature Selection Method TF-IDF in Document Clustering
    Patil, Leena. H.
    Atique, Mohammed
    PROCEEDINGS OF THE 2013 3RD IEEE INTERNATIONAL ADVANCE COMPUTING CONFERENCE (IACC), 2013, : 858 - 862
  • [39] A Novel Feature Selection Approach Based on FODPSO and SVM
    Ghamisi, Pedram
    Couceiro, Micael S.
    Benediktsson, Jon Atli
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2015, 53 (05): : 2935 - 2947
  • [40] A Novel Entropy-Based Approach to Feature Selection
    Tu, Chia-Hao
    Li, Chunshien
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2017, PT I, 2017, 10191 : 445 - 454