A novel feature selection approach based on clustering algorithm

被引:7
|
作者
Moslehi, Fateme [1 ]
Haeri, Abdorrahman [2 ]
机构
[1] Iran Univ Sci & Technol, Informat Technol Engn, Tehran, Iran
[2] Iran Univ Sci & Technol, Sch Ind Engn, Tehran, Iran
关键词
Data mining; clustering; K-means algorithm; feature selection; FEATURE SUBSET-SELECTION; GRAVITATIONAL SEARCH ALGORITHM; PARTICLE SWARM OPTIMIZATION; MUTUAL INFORMATION; CLASSIFICATION; HYBRID; REDUCTION;
D O I
10.1080/00949655.2020.1822358
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Clustering is one of the main methods of data mining. K-means algorithm is one of the most common clustering algorithms due to its efficiency and ease of use. In many data mining issues, the dataset contains a large number of fields and, therefore, the identification of the effective fields is an important issue. Appling the proposed algorithm, the important variables of the dataset would be identified. In the proposed method, the dataset is clustered in several stages and in each step the characteristics of the created clusters are examined and the features that transform the structure of clusters are introduced as effective features of the dataset. The proposed method was examined on 4 datasets and the results of this method were compared with other similar work and demonstrated that using this algorithm would eliminate redundant and unrelated features of the dataset and improve classification accuracy.
引用
收藏
页码:581 / 604
页数:24
相关论文
共 50 条
  • [21] A Novel Clustering-Based Hybrid Feature Selection Approach Using Ant Colony Optimization
    Rajesh Dwivedi
    Aruna Tiwari
    Neha Bharill
    Milind Ratnaparkhe
    Arabian Journal for Science and Engineering, 2023, 48 : 10727 - 10744
  • [22] A Novel Clustering-Based Hybrid Feature Selection Approach Using Ant Colony Optimization
    Dwivedi, Rajesh
    Tiwari, Aruna
    Bharill, Neha
    Ratnaparkhe, Milind
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2023, 48 (08) : 10727 - 10744
  • [23] A Local Feature Selection Approach for Clustering
    Gui, Bing
    FOUNDATIONS OF INTELLIGENT SYSTEMS (ISKE 2011), 2011, 122 : 55 - 62
  • [24] An approach to feature selection based on fuzzy clustering and statistic theory
    Gao, XB
    Ji, HB
    Xie, WX
    ICEMI'99: FOURTH INTERNATIONAL CONFERENCE ON ELECTRONIC MEASUREMENT & INSTRUMENTS, VOLS 1 AND 2, CONFERENCE PROCEEDINGS, 1999, : 988 - 992
  • [25] Adaptive Data Clustering Ensemble Algorithm Based on Stability Feature Selection and Spectral Clustering
    Li, Zuhong
    Ma, Zhixin
    Ma, Zhicheng
    Yang, Shibo
    2019 2ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND BIG DATA (ICAIBD 2019), 2019, : 277 - 281
  • [26] A new unsupervised feature selection algorithm using similarity-based feature clustering
    Zhu, Xiaoyan
    Wang, Yu
    Li, Yingbin
    Tan, Yonghui
    Wang, Guangtao
    Song, Qinbao
    COMPUTATIONAL INTELLIGENCE, 2019, 35 (01) : 2 - 22
  • [27] A novel peer to peer traffic identification approach based on hybrid feature selection algorithm
    Wang, Z. (wangzling@163.com), 1600, Advanced Institute of Convergence Information Technology (06):
  • [28] Parasitism - Predation algorithm (PPA): A novel approach for feature selection
    Mohamed, Al-Attar A.
    Hassan, S. A.
    Hemeida, A. M.
    Alkhalaf, Salem
    Mahmoud, M. M. M.
    Eldin, Ayman M. Baha
    AIN SHAMS ENGINEERING JOURNAL, 2020, 11 (02) : 293 - 308
  • [29] Binary Peacock Algorithm: A Novel Metaheuristic Approach for Feature Selection
    Banati, Hema
    Sharma, Richa
    Yadav, Asha
    JOURNAL OF CLASSIFICATION, 2024, 41 (02) : 216 - 244
  • [30] A novel filter feature selection algorithm based on relief
    Cui, Xueting
    Li, Ying
    Fan, Jiahao
    Wang, Tan
    APPLIED INTELLIGENCE, 2022, 52 (05) : 5063 - 5081