Feature and Subfeature Selection for Classification Using Correlation Coefficient and Fuzzy Model

被引:15
|
作者
Bhuyan, Hemanta Kumar [1 ]
Chakraborty, Chinmay [2 ]
Pani, Subhendu Kumar [3 ]
Ravi, Vinayakumar [4 ]
机构
[1] Technol & Res Deemed Univ, Vignans Fdn Sci, Dept Informat Technol, Vejendla 522213, Andhra Pradesh, India
[2] Birla Inst Technol Mesra, Elect & Commun Engn, Jharhand 835215, India
[3] Biju Patnaik Univ Technol, Orissa Engn Coll, Dept Comp Sci & Engn, Rourkela 769004, Odisha, India
[4] Prince Mohammad Bin Fahd Univ, Ctr Artificial Intelligence, Khobar 34754, Saudi Arabia
关键词
Feature extraction; Correlation; Redundancy; Databases; Data models; Data mining; Task analysis; Classification; correlation coefficient; data mining; feature selection; fuzzy model; UNSUPERVISED FEATURE-SELECTION; SUB-FEATURE SELECTION;
D O I
10.1109/TEM.2021.3065699
中图分类号
F [经济];
学科分类号
02 ;
摘要
This article presents an analysis of data extraction for classification using correlation coefficient and fuzzy model. Several traditional methods of data extraction are used for classification that could not provide sufficient information for further step of data analysis on class. It needs refinement of features data to distinguish a class that differs from a traditional class. Thus, it proposes the feature tiny data (subfeature data) to find distinguish class from a traditional class using two methods such as correlation coefficient and fuzzy model to select features as well as subfeature for distinguishing class. In the first approach, the correlation coefficient methods with gradient descent technique are used to select features from the dataset and in the second approach, the fuzzy model with supreme of minimum value is considered to get subfeature data. As per the proposed model, some features (i.e., three features from the acoustic dataset, two features from the QCM dataset, and eight features from the audit dataset, etc.) and subfeatures (as per threshold value like 20 for acoustic; 10 for QCM, and 20 for audit, etc.) are selected based on correlation coefficient as well as fuzzy methods, respectively. Further, the probability approach is used to find the association and availability of subfeature data from the dimensional reduced database. The experimental results show the proposed framework identifies and selects both feature and subfeature data with the effectiveness of the new class. The comparison results of several classifiers on several datasets are explained in the experimental section.
引用
收藏
页码:1655 / 1669
页数:15
相关论文
共 50 条
  • [31] A fuzzy neural network for pattern classification and feature selection
    Li, RP
    Mukaidono, M
    Turksen, IB
    FUZZY SETS AND SYSTEMS, 2002, 130 (01) : 101 - 108
  • [32] Relative Fuzzy Rough Approximations for Feature Selection and Classification
    An, Shuang
    Zhao, Enhui
    Wang, Changzhong
    Guo, Ge
    Zhao, Suyun
    Li, Piyu
    IEEE TRANSACTIONS ON CYBERNETICS, 2023, 53 (04) : 2200 - 2210
  • [33] Feature Selection for Text Classification Based on Gini Coefficient of Inequality
    Singh, Sanasam Ranbir
    Murthy, Hema A.
    Gonsalves, Timothy A.
    PROCEEDINGS OF THE FOURTH INTERNATIONAL WORKSHOP ON FEATURE SELECTION IN DATA MINING, 2010, 10 : 76 - 85
  • [34] Feature Selection Using Maximum Feature Tree Embedded with Mutual Information and Coefficient of Variation for Bird Sound Classification
    Xu, Haifeng
    Zhang, Yan
    Liu, Jiang
    Lv, Danjv
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2021, 2021
  • [35] Design of Reinforced Fuzzy Model Driven to Feature Selection Through Univariable-Based Correlation and Multivariable-Based Determination Coefficient Analysis
    Kim, Eun-Hu
    Oh, Sung-Kwun
    Pedrycz, Witold
    Fu, Zunwei
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2022, 30 (10) : 4224 - 4238
  • [36] On the benefit of feature selection and ensemble feature selection for fuzzy k-nearest neighbor classification
    Lohrmann, Christoph
    Lohrmann, Alena
    Kumbure, Mahinda Mailagaha
    APPLIED SOFT COMPUTING, 2025, 171
  • [37] Text Classification Using Correlation Based Feature Selection on Multi-layer ELM Feature Space
    Roul, Rajendra Kumar
    Sahoo, Jajati Keshari
    Satyanath, Gaurav
    DISTRIBUTED COMPUTING AND INTELLIGENT TECHNOLOGY, ICDCIT 2023, 2023, 13776 : 355 - 361
  • [38] Classification using feature interval selection
    Chiu, DKY
    Buczynski, BJ
    DYNAMICS OF CONTINUOUS DISCRETE AND IMPULSIVE SYSTEMS-SERIES B-APPLICATIONS & ALGORITHMS, 2003, : 137 - 141
  • [39] Hybrid Classification Model of Correlation-based Feature Selection and Support Vector Machine
    Dubey, Vimal Kumar
    Saxena, Amit Kumar
    2016 IEEE INTERNATIONAL CONFERENCE ON CURRENT TRENDS IN ADVANCED COMPUTING (ICCTAC), 2016,
  • [40] Feature Subset Selection Using Information Energy and Correlation Coefficients of Hesitant Fuzzy Sets
    Ebrahimpour, Mohammad Kazem
    Eftekhari, Mahdi
    2015 7TH CONFERENCE ON INFORMATION AND KNOWLEDGE TECHNOLOGY (IKT), 2015,