A sub-concept-based feature selection method for one-class classification

被引:2
|
作者
Liu, Zhen [1 ,2 ,5 ]
Japkowicz, Nathalie [2 ]
Wang, Ruoyu [3 ,4 ]
Liu, Li [2 ,6 ]
机构
[1] Guangdong Pharmaceut Univ, Sch Med Informat Engn, Guangzhou 510006, Peoples R China
[2] Amer Univ, Dept Comp Sci, Washington, DC 20016 USA
[3] South China Univ Technol, Informat & Network Engn & Res Ctr, Guangzhou 510041, Peoples R China
[4] Commun & Comp Network Lab Guangdong, Guangzhou 510041, Peoples R China
[5] Guangdong Prov Precise Med & Big Data Engn Techno, Guangzhou 510006, Peoples R China
[6] Huizhou Univ, Dept Comp Sci, Huizhou 516007, Peoples R China
基金
中国国家自然科学基金;
关键词
One-class classification; Filter-based feature selection; Sub-concept; Multimodal data; Outlier detection; Cyber security; DATA COMPLEXITY;
D O I
10.1007/s00500-020-04828-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Similarly to binary classification methods, one-class classification methods could benefit from feature selection. However, the feature selection algorithms for the binary or multi-class are not applicable to one-class classification situations since only one class of instances is provided. Few techniques have been proposed so far for feature selection in one-class classification. This paper focuses on designing a filter-based feature selection method for one-class classification. Our approach is based on the observation that for some tasks such as outlier detection, anomaly detection, the training data (normal data) may contain multiple sub-concepts. The sub-concept is a source of data complexity. Our approach aims at searching the features that characterize the instances of the sub-concepts more compact, so as to reduce the data complexity. It firstly finds the sub-concepts using a clustering algorithm with a fixed cluster number and then applies combined feature measures to evaluate the relevance between each feature and the sub-concepts. A fixed number of features-those with the highest relevance scores-are selected as a feature subset. In the searching process, the Davies-Bouldin Index is used to assess the data complexity on the sub-concepts obtained with different number of clusters. The feature subset with the lowest DBI is selected as the final feature subset. Experiments on UCI benchmark and cyber security datasets demonstrate that our feature selection algorithm can select relevant features and improve the performance of one-class classification on multimodal data.
引用
收藏
页码:7047 / 7062
页数:16
相关论文
共 50 条
  • [1] A sub-concept-based feature selection method for one-class classification
    Zhen Liu
    Nathalie Japkowicz
    Ruoyu Wang
    Li Liu
    Soft Computing, 2020, 24 : 7047 - 7062
  • [2] A New Feature Selection Method for One-Class Classification Problems
    Jeong, Young-Seon
    Kang, In-Ho
    Jeong, Myong-Kee
    Kong, Dongjoon
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2012, 42 (06): : 1500 - 1509
  • [3] Filter Feature Selection for One-Class Classification
    Luiz H N Lorena
    André C P L F Carvalho
    Ana C Lorena
    Journal of Intelligent & Robotic Systems, 2015, 80 : 227 - 243
  • [4] Filter Feature Selection for One-Class Classification
    Lorena, Luiz H. N.
    Carvalho, Andre C. P. L. F.
    Lorena, Ana C.
    JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2015, 80 : S227 - S243
  • [5] A Novel One-Class Classification Method Based on Feature Analysis and Prototype Reduction
    Cabral, George Gomes
    Inacio de Oliveira, Adriano Lorena
    2011 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2011, : 983 - 988
  • [6] Feature extraction for one-class classification
    Tax, DMJ
    Müller, KR
    ARTIFICAIL NEURAL NETWORKS AND NEURAL INFORMATION PROCESSING - ICAN/ICONIP 2003, 2003, 2714 : 342 - 349
  • [7] Feature variance regularization method for autoencoder-based one-class classification
    Kim, Boeun
    Ryu, Kyung Hwan
    Kim, Ji Hee
    Heo, Seongmin
    COMPUTERS & CHEMICAL ENGINEERING, 2022, 161
  • [8] One-Class Oriented Feature Selection and Classification of Heterogeneous Remote Sensing Images
    Hossain, Md. Ali
    Jia, Xiuping
    Benediktsson, Jon Atli
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2016, 9 (04) : 1606 - 1612
  • [9] A consistency-based model selection for one-class classification
    Tax, DMJ
    Müller, KR
    PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 3, 2004, : 363 - 366
  • [10] Dynamic classifier selection for one-class classification
    Krawczyk, Bartosz
    Wozniak, Michal
    KNOWLEDGE-BASED SYSTEMS, 2016, 107 : 43 - 53