Regularized (bridge) logistic regression for variable selection based on ROC criterion

被引:0
|
作者
Tian, Guo-Liang [1 ]
Fang, Hong-Bin [2 ]
Liu, Zhenqiu [2 ]
Tan, Ming T. [2 ]
机构
[1] Univ Hong Kong, Dept Stat & Actuarial Sci, Hong Kong, Hong Kong, Peoples R China
[2] Univ Maryland, Greenebaum Canc Ctr, Div Biostat, Baltimore, MD 21201 USA
关键词
AUC; EM algorithm; Lasso regression; Logistic regression; MM algorithm; ROC; Variable/feature selection;
D O I
暂无
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
It is well known that the bridge regression (with tuning parameter less or equal to 1) gives asymptotically unbiased estimates of the nonzero regression parameters while shrinking smaller regression parameters to zero to achieve variable selection. Despite advances in the last several decades in developing such regularized regression models, issues regarding the choice of penalty parameter and the computational methods for models fitting with parameter constraints even for bridge linear regression are still not resolved. In this article, we first propose a new criterion based on an area under the receiver operating characteristic (ROC) curve (AUC) to choose the appropriate penalty parameter as opposed to the conventional generalized cross-validation criterion. The model selected by the AUC criterion is shown to have better predictive accuracy while achieving sparsity simultaneously. We then approach the problem from a constrained parameter model and develop a fast minorization-maximization (MM) algorithm for non-linear optimization with positivity constraints for model fitting. This algorithm is further applied to bridge regression where the regression coefficients are constrained with l(p)-norm with the level of p selected by data for binary responses. Examples of prognostic factors and gene selection are presented to illustrate the proposed method.
引用
收藏
页码:493 / 502
页数:10
相关论文
共 50 条
  • [31] A Forward Variable Selection Method for Fuzzy Logistic Regression
    Fatemeh Salmani
    Seyed Mahmoud Taheri
    Alireza Abadi
    International Journal of Fuzzy Systems, 2019, 21 : 1259 - 1269
  • [32] Variable Selection for Sparse Logistic Regression with Grouped Variables
    Zhong, Mingrui
    Yin, Zanhua
    Wang, Zhichao
    MATHEMATICS, 2023, 11 (24)
  • [33] Variable selection in Logistic regression model with genetic algorithm
    Zhang, Zhongheng
    Trevino, Victor
    Hoseini, Sayed Shahabuddin
    Belciug, Smaranda
    Boopathi, Arumugam Manivanna
    Zhang, Ping
    Gorunescu, Florin
    Subha, Velappan
    Dai, Songshi
    ANNALS OF TRANSLATIONAL MEDICINE, 2018, 6 (03)
  • [34] A Forward Variable Selection Method for Fuzzy Logistic Regression
    Salmani, Fatemeh
    Taheri, Seyed Mahmoud
    Abadi, Alireza
    INTERNATIONAL JOURNAL OF FUZZY SYSTEMS, 2019, 21 (04) : 1259 - 1269
  • [35] Application of Bayesian variable selection in logistic regression model
    Bangchang, Kannat Na
    AIMS MATHEMATICS, 2024, 9 (05): : 13336 - 13345
  • [36] AN EXPLORATORY CRITERION FOR VARIABLE SELECTION IN LP-LOGISTIC DISCRIMINATION
    KRUSINSKA, E
    APPLIED MATHEMATICS LETTERS, 1993, 6 (01) : 39 - 42
  • [37] VARIABLE SELECTION AND COEFFICIENT ESTIMATION VIA REGULARIZED RANK REGRESSION
    Leng, Chenlei
    STATISTICA SINICA, 2010, 20 (01) : 167 - 181
  • [38] A joint regression variable and autoregressive order selection criterion
    Shi, PD
    Tsai, CL
    JOURNAL OF TIME SERIES ANALYSIS, 2004, 25 (06) : 923 - 941
  • [39] Variable Selection for Varying Coefficient Models Via Kernel Based Regularized Rank Regression
    Wang, Kang-ning
    Lin, Lu
    ACTA MATHEMATICAE APPLICATAE SINICA-ENGLISH SERIES, 2020, 36 (02): : 458 - 470
  • [40] Variable Selection for Varying Coefficient Models Via Kernel Based Regularized Rank Regression
    Kang-ning Wang
    Lu Lin
    Acta Mathematicae Applicatae Sinica, English Series, 2020, 36 : 458 - 470