A sub-concept-based feature selection method for one-class classification

被引:2
|
作者
Liu, Zhen [1 ,2 ,5 ]
Japkowicz, Nathalie [2 ]
Wang, Ruoyu [3 ,4 ]
Liu, Li [2 ,6 ]
机构
[1] Guangdong Pharmaceut Univ, Sch Med Informat Engn, Guangzhou 510006, Peoples R China
[2] Amer Univ, Dept Comp Sci, Washington, DC 20016 USA
[3] South China Univ Technol, Informat & Network Engn & Res Ctr, Guangzhou 510041, Peoples R China
[4] Commun & Comp Network Lab Guangdong, Guangzhou 510041, Peoples R China
[5] Guangdong Prov Precise Med & Big Data Engn Techno, Guangzhou 510006, Peoples R China
[6] Huizhou Univ, Dept Comp Sci, Huizhou 516007, Peoples R China
基金
中国国家自然科学基金;
关键词
One-class classification; Filter-based feature selection; Sub-concept; Multimodal data; Outlier detection; Cyber security; DATA COMPLEXITY;
D O I
10.1007/s00500-020-04828-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Similarly to binary classification methods, one-class classification methods could benefit from feature selection. However, the feature selection algorithms for the binary or multi-class are not applicable to one-class classification situations since only one class of instances is provided. Few techniques have been proposed so far for feature selection in one-class classification. This paper focuses on designing a filter-based feature selection method for one-class classification. Our approach is based on the observation that for some tasks such as outlier detection, anomaly detection, the training data (normal data) may contain multiple sub-concepts. The sub-concept is a source of data complexity. Our approach aims at searching the features that characterize the instances of the sub-concepts more compact, so as to reduce the data complexity. It firstly finds the sub-concepts using a clustering algorithm with a fixed cluster number and then applies combined feature measures to evaluate the relevance between each feature and the sub-concepts. A fixed number of features-those with the highest relevance scores-are selected as a feature subset. In the searching process, the Davies-Bouldin Index is used to assess the data complexity on the sub-concepts obtained with different number of clusters. The feature subset with the lowest DBI is selected as the final feature subset. Experiments on UCI benchmark and cyber security datasets demonstrate that our feature selection algorithm can select relevant features and improve the performance of one-class classification on multimodal data.
引用
收藏
页码:7047 / 7062
页数:16
相关论文
共 50 条
  • [21] Dynamic ensemble selection for multi -class classification with one-class classifiers
    Krawczyk, Bartosz
    Galar, Mikel
    Wozniak, Michal
    Bustince, Humberto
    Herrera, Francisco
    PATTERN RECOGNITION, 2018, 83 : 34 - 51
  • [22] Generalized mean for feature extraction in one-class classification problems
    Oh, Jiyong
    Kwak, Nojun
    Lee, Minsik
    Choi, Chong-Ho
    PATTERN RECOGNITION, 2013, 46 (12) : 3328 - 3340
  • [23] Virtual screening based on one-class classification
    Karpov, P. V.
    Baskin, I. I.
    Palyulin, V. A.
    Zefirov, N. S.
    DOKLADY CHEMISTRY, 2011, 437 : 107 - 111
  • [24] A universal steganalysis based on one-class classification
    Zhou, Zhiping
    Zhang, Xiaoxiang
    Chen, Zongmin
    Journal of Computational Information Systems, 2010, 6 (09): : 2941 - 2948
  • [25] SEGMENTATION OF NEURONS BASED ON ONE-CLASS CLASSIFICATION
    Hernandez-Herrera, Paul
    Papadakis, Manos
    Kakadiaris, Ioannis A.
    2014 IEEE 11TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI), 2014, : 1316 - 1319
  • [26] One-class classification based eigen contours
    Xu, Q. (xuqiang050@sina.com), 1600, Institute of Computing Technology (26):
  • [27] MAHALANOBIS-BASED ONE-CLASS CLASSIFICATION
    Nader, Patric
    Honeine, Paul
    Beauseroy, Pierre
    2014 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2014,
  • [28] Virtual screening based on one-class classification
    P. V. Karpov
    I. I. Baskin
    V. A. Palyulin
    N. S. Zefirov
    Doklady Chemistry, 2011, 437 : 107 - 111
  • [29] A New One-class Classification Method Based on Symbolic Representation: Application to Document Classification
    Alaei, Fahimeh
    Girard, Nathalie
    Barrat, Sabine
    Ramel, Jean-Yves
    2014 11TH IAPR INTERNATIONAL WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS (DAS 2014), 2014, : 272 - 276
  • [30] Selecting target concept in one-class classification for handling class imbalance problem
    Perez-Sanchez, Beatriz
    Fontenla-Romero, Oscar
    Sanchez-Marono, Noelia
    2015 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2015,