Elastic net-based high dimensional data selection for regression

被引:10
|
作者
Chamlal, Hasna [1 ]
Benzmane, Asmaa [1 ]
Ouaderhman, Tayeb [1 ]
机构
[1] Hassan II Univ, Fac Sci Ain Chock, Dept Math & Informat, Fundamental & Appl Math Lab, Casablanca, Morocco
关键词
Feature screening; Regression; Rank correlation; High-dimensional data; Elastic net; VIEW; REGULARIZATION; ALGORITHM;
D O I
10.1016/j.eswa.2023.122958
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
High -dimensional feature selection is of particular interest to researchers. In some domains, such as microarray data, it is quite common for a group of highly correlated explanatory variables to be of equal importance for inclusion in the predictive model. This paper proposes a new hybrid feature selection approach that integrates feature screening based on Kendall's tau and Elastic Net regularized regression (K -EN). K -EN as an approach that embeds the Elastic Net, has the advantage of the grouping effect, which automatically includes all the highly correlated variables in the group. The K -EN approach offers insightful solutions to high -dimensional regression problems and improves Elastic Net performance since the screening phase is preceded by a step that further reduces the number of explanatory variables by removing those that disagree with the target based on Kendall's tau. The use of Kendall's tau further enhances Elastic Net performance, as it is robust enough to handle heavy-tailed distributions, non-parametric models, outliers, and non-normal data with greater ease. K -EN is therefore a time-saving approach. The proposed algorithm is evaluated on four simulation scenarios and four publicly available datasets, including riboflavin, eyedata, Longley, and Boston Housing, and achieves 0.2528, 0.0098, 0.1007, and 0.4121 respectively as the Mean Squared Error (MSE). K-EN's MSEs are the best compared to those achieved by the state-of-the-art approaches reviewed in this paper. In addition, K -EN selects up to 100% of relevant features when run on simulated data.
引用
收藏
页数:14
相关论文
共 50 条
  • [41] Net-based knowledge communication in groups
    Hesse, Friedrich W.
    ZEITSCHRIFT FUR PSYCHOLOGIE-JOURNAL OF PSYCHOLOGY, 2007, 215 (04): : 207 - 208
  • [43] Petri net-based decision nets
    Simoes, MAS
    Barretto, MRP
    INTELLIGENT MANUFACTURING SYSTEMS 1998 (IMS'98), 1999, : 251 - 256
  • [44] Elastic Net Regression and Empirical Mode Decomposition for Enhancing the Accuracy of the Model Selection
    Al-Jawarneh, Abdullah S.
    Ismail, Mohd Tahir
    Awajan, Ahmad M.
    INTERNATIONAL JOURNAL OF MATHEMATICAL ENGINEERING AND MANAGEMENT SCIENCES, 2021, 6 (02) : 564 - 583
  • [45] ELASTIC NET VARIABLE SELECTION REGULARIZATION METHOD FOR REGRESSION DISCONTINUITY DESIGNS WITH APPLICATION
    Mohammed, Bahr kadhim
    Kadhim, Ashwaq Abdul Sada
    INTERNATIONAL JOURNAL OF AGRICULTURAL AND STATISTICAL SCIENCES, 2021, 17 : 1423 - 1430
  • [46] Multinomial Regression with Elastic Net Penalty and Its Grouping Effect in Gene Selection
    Chen, Liuyuan
    Yang, Jie
    Li, Juntao
    Wang, Xiaoyu
    ABSTRACT AND APPLIED ANALYSIS, 2014,
  • [47] A Petri net-based workflow system
    Li, Xiaofang
    Wang, Congming
    Liang, Y.
    DYNAMICS OF CONTINUOUS DISCRETE AND IMPULSIVE SYSTEMS-SERIES B-APPLICATIONS & ALGORITHMS, 2006, 13E : 3726 - 3729
  • [48] Combination of Ensembles of Regularized Regression Models with Resampling-Based Lasso Feature Selection in High Dimensional Data
    Patil, Abhijeet R.
    Kim, Sangjin
    MATHEMATICS, 2020, 8 (01)
  • [49] Data integration by multi-tuning parameter elastic net regression
    Jie Liu
    Gangning Liang
    Kimberly D Siegmund
    Juan Pablo Lewinger
    BMC Bioinformatics, 19
  • [50] Genomic selection using regularized linear regression models: ridge regression, lasso, elastic net and their extensions
    Joseph O Ogutu
    Torben Schulz-Streeck
    Hans-Peter Piepho
    BMC Proceedings, 6 (Suppl 2)