l2,1 norm regularized multi-kernel based joint nonlinear feature selection and over-sampling for imbalanced data classification

被引:28
|
作者
Cao, Peng [1 ]
Liu, Xiaoli [1 ]
Zhang, Jian [3 ]
Zhao, Dazhe [4 ]
Huang, Min [2 ]
Zaiane, Osmar [5 ]
机构
[1] Northeastern Univ, Coll Comp Sci & Engn, Shenyang, Peoples R China
[2] Northeastern Univ, Coll Informat Sci & Engn, Shenyang, Peoples R China
[3] Nanjing Univ Informat Sci Technol, Sch Comp Software, Nanjing, Peoples R China
[4] Northeastern Univ, Minist Educ, Key Lab Med Image Comp, Shenyang, Peoples R China
[5] Univ Alberta, Comp Sci, Edmonton, AB, Canada
基金
中国国家自然科学基金; 国家高技术研究发展计划(863计划); 美国国家科学基金会;
关键词
Imbalanced data learning; Feature selection; Classification; Multi-kernel learning; Proximal method;
D O I
10.1016/j.neucom.2016.12.036
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
High dimensionality and classification of imbalanced data sets are two of the most interesting machine learning challenges. Both issues have been independently studied in the literature. In order to simultaneously explore the both issues of feature selection and oversampling, we efficiently combine two different methodological approaches in an unified kernel framework. Specifically, we proposed a novel l(2,1) norm balanced multiple kernel feature selection (l(2,1) MKFS), and designed a proximal based optimization algorithm for efficiently learning the model. Moreover, multiple kernel oversampling (MKOS) was developed to generate synthetic instances in the optimal kernel space induced by l(2,1) MKFS, so as to compensate for the class imbalanced distribution. Our experimental results on multiple UCI data and two real medical application demonstrate that jointly operating nonlinear feature selection and oversampling with l(2,1) norm multi-kernel learning framework (l(2,1) MKFSOS) can lead to a promising classification performance.
引用
收藏
页码:38 / 57
页数:20
相关论文
共 19 条
  • [1] A l2,1 norm regularized multi-kernel learning for false positive reduction in Lung nodule CAD
    Cao, Peng
    Liu, Xiaoli
    Zhang, Jian
    Li, Wei
    Zhao, Dazhe
    Huang, Min
    Zaiane, Osmar
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2017, 140 : 211 - 231
  • [2] l2,1 Norm regularized fisher criterion for optimal feature selection
    Zhang, Jian
    Yu, Jun
    Wan, Jian
    Zeng, Zhiqiang
    NEUROCOMPUTING, 2015, 166 : 455 - 463
  • [3] An Approach to Imbalanced Data Classification Based on Instance Selection and Over-Sampling
    Czarnowski, Ireneusz
    Jedrzejowicz, Piotr
    COMPUTATIONAL COLLECTIVE INTELLIGENCE, PT I, 2019, 11683 : 601 - 610
  • [4] A multi-kernel based framework for heterogeneous feature selection and over-sampling for computer-aided detection of pulmonary nodules
    Cao, Peng
    Liu, Xiaoli
    Yang, Jinzhu
    Zhao, Dazhe
    Li, Wei
    Huang, Min
    Zaiane, Osmar
    PATTERN RECOGNITION, 2017, 64 : 327 - 346
  • [5] Feature selection and its combination with data over-sampling for multi-class imbalanced datasets
    Tsai, Chih-Fong
    Chen, Kuan-Chen
    Lin, Wei -Chao
    APPLIED SOFT COMPUTING, 2024, 153
  • [6] Preprocessing of Imbalanced Breast Cancer Data using Feature Selection Combined with Over-Sampling Technique for classification
    Jojan, Janjira
    Srivihok, Anongnart
    2013 INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER SCIENCE AND INFORMATION SYSTEMS (ICACSIS), 2013, : 407 - 412
  • [7] Robust Feature Selection Method Based on Joint L2,1 Norm Minimization for Sparse Regression
    Yang, Libo
    Zhu, Dawei
    Liu, Xuemei
    Cui, Pei
    ELECTRONICS, 2023, 12 (21)
  • [8] Multi-feature hyperspectral image classification with L2,1 norm constrained joint sparse representation
    Zhang, Chengkun
    Han, Min
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2021, 42 (12) : 4789 - 4808
  • [9] AFNFS: Adaptive fuzzy neighborhood-based feature selection with adaptive synthetic over-sampling for imbalanced data
    Sun, Lin
    Li, Mengmeng
    Ding, Weiping
    Zhang, En
    Mu, Xiaoxia
    Xu, Jiucheng
    INFORMATION SCIENCES, 2022, 612 : 724 - 744
  • [10] Joint L2,1 Norm and Fisher Discrimination Constrained Feature Selection for Rational Synthesis of Microporous Aluminophosphates
    Qi, Miao
    Wang, Ting
    Yi, Yugen
    Gao, Na
    Kong, Jun
    Wang, Jianzhong
    MOLECULAR INFORMATICS, 2017, 36 (04)