A local information-based feature-selection algorithm for data regression

被引:18
|
作者
Peng, Xinjun [1 ,2 ]
Xu, Dong [1 ]
机构
[1] Shanghai Normal Univ, Dept Math, Shanghai 200234, Peoples R China
[2] Sci Comp Key Lab Shanghai Univ, Shanghai 200234, Peoples R China
关键词
Feature selection; Local information; Irrelevant feature; Least squares loss; Gradient descent; Data regression; FEATURE SUBSET-SELECTION; GENE SELECTION; CLASSIFICATION; RELEVANCE;
D O I
10.1016/j.patcog.2013.02.010
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a novel feature-selection algorithm for data regression with a lot of irrelevant features. The proposed method is based on well-established machine-learning technique without any assumption about the underlying data distribution. The key idea in this method is to decompose an arbitrarily complex nonlinear problem into a set of locally linear ones through local information, and to learn globally feature relevance within the least squares loss framework. In contrast to other feature-selection algorithms for data regression, the learning of this method is efficient since the solution can be readily found through gradient descent with a simple update rule. Experiments on some synthetic and real-world data sets demonstrate the viability of our formulation of the feature-selection problem and the effectiveness of our algorithm. Crown Copyright (C) 2013 Published by Elsevier Ltd. All rights reserved.
引用
收藏
页码:2519 / 2530
页数:12
相关论文
共 50 条
  • [31] Study on mutual information-based feature selection for text categorization
    Xu, Yan
    Jones, Gareth
    Li, Jintao
    Wang, Bin
    Sun, Chunming
    Journal of Computational Information Systems, 2007, 3 (03): : 1007 - 1012
  • [32] Mutual information-based feature selection for intrusion detection systems
    Amiri, Fatemeh
    Yousefi, MohammadMahdi Rezaei
    Lucas, Caro
    Shakery, Azadeh
    Yazdani, Nasser
    JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2011, 34 (04) : 1184 - 1199
  • [33] Mutual Information-Based Feature Selection and Ensemble Learning for Classification
    Qi, Chengming
    Zhou, Zhangbing
    Wang, Qun
    Hu, Lishuan
    2016 INTERNATIONAL CONFERENCE ON IDENTIFICATION, INFORMATION AND KNOWLEDGE IN THE INTERNET OF THINGS (IIKI), 2016, : 116 - 121
  • [34] FULL VALIDATION PROCEDURES FOR FEATURE-SELECTION IN CLASSIFICATION AND REGRESSION PROBLEMS
    LANTERI, S
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 1992, 15 (2-3) : 159 - 169
  • [35] Heterogeneous feature subset selection using mutual information-based feature transformation
    Wei, Min
    Chow, Tommy W. S.
    Chan, Rosa H. M.
    NEUROCOMPUTING, 2015, 168 : 706 - 718
  • [36] Privileged Information-based Conditional Regression Forest for Facial Feature Detection
    Yang, Heng
    Patras, Ioannis
    2013 10TH IEEE INTERNATIONAL CONFERENCE AND WORKSHOPS ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG), 2013,
  • [37] Directed-information based feature-selection for tissue-specific sequences
    Rao, Arvind
    Hero, Alfred O., III
    States, David J.
    Engel, James Douglas
    2007 IEEE/SP 14TH WORKSHOP ON STATISTICAL SIGNAL PROCESSING, VOLS 1 AND 2, 2007, : 210 - 214
  • [38] FEATURE-SELECTION BASED ON THE STRUCTURAL INDEXES OF CATEGORIES
    KUDO, M
    SHIMBO, M
    PATTERN RECOGNITION, 1993, 26 (06) : 891 - 901
  • [39] Some results about mutual information-based feature selection and fuzzy discretization of vague data
    Sanchez, Luciano
    Suarez, M. Rosario
    Villar, J. R.
    Couso, Ines
    2007 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-4, 2007, : 1963 - +
  • [40] Feature selection algorithm based on mutual information and lasso for microarray data
    Zhongxin W.
    Gang S.
    Jing Z.
    Jia Z.
    Gang, Sun (ahfysungang@163.com), 1600, Bentham Science Publishers B.V., P.O. Box 294, Bussum, 1400 AG, Netherlands (10): : 278 - 286