Regularized (bridge) logistic regression for variable selection based on ROC criterion

被引:0
|
作者
Tian, Guo-Liang [1 ]
Fang, Hong-Bin [2 ]
Liu, Zhenqiu [2 ]
Tan, Ming T. [2 ]
机构
[1] Univ Hong Kong, Dept Stat & Actuarial Sci, Hong Kong, Hong Kong, Peoples R China
[2] Univ Maryland, Greenebaum Canc Ctr, Div Biostat, Baltimore, MD 21201 USA
关键词
AUC; EM algorithm; Lasso regression; Logistic regression; MM algorithm; ROC; Variable/feature selection;
D O I
暂无
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
It is well known that the bridge regression (with tuning parameter less or equal to 1) gives asymptotically unbiased estimates of the nonzero regression parameters while shrinking smaller regression parameters to zero to achieve variable selection. Despite advances in the last several decades in developing such regularized regression models, issues regarding the choice of penalty parameter and the computational methods for models fitting with parameter constraints even for bridge linear regression are still not resolved. In this article, we first propose a new criterion based on an area under the receiver operating characteristic (ROC) curve (AUC) to choose the appropriate penalty parameter as opposed to the conventional generalized cross-validation criterion. The model selected by the AUC criterion is shown to have better predictive accuracy while achieving sparsity simultaneously. We then approach the problem from a constrained parameter model and develop a fast minorization-maximization (MM) algorithm for non-linear optimization with positivity constraints for model fitting. This algorithm is further applied to bridge regression where the regression coefficients are constrained with l(p)-norm with the level of p selected by data for binary responses. Examples of prognostic factors and gene selection are presented to illustrate the proposed method.
引用
收藏
页码:493 / 502
页数:10
相关论文
共 50 条
  • [1] Regularized receiver operating characteristic-based logistic regression for grouped variable selection with composite criterion
    Li, Yang
    Yu, Chenqun
    Qin, Yichen
    Wang, Limin
    Chen, Jiaxu
    Yi, Danhui
    Shia, Ben-Chang
    Ma, Shuangge
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2015, 85 (13) : 2582 - 2595
  • [2] Regularized logistic regression and multiobjective variable selection for classifying MEG data
    Roberto Santana
    Concha Bielza
    Pedro Larrañaga
    Biological Cybernetics, 2012, 106 : 389 - 405
  • [3] Variable Selection in ROC Regression
    Wang, Binhuan
    COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2013, 2013
  • [4] Regularized logistic regression and multiobjective variable selection for classifying MEG data
    Santana, Roberto
    Bielza, Concha
    Larranaga, Pedro
    BIOLOGICAL CYBERNETICS, 2012, 106 (6-7) : 389 - 405
  • [5] Variable selection for logistic regression using a prediction-focused information criterion
    Claeskens, Gerda
    Croux, Christophe
    Van Kerckhoven, Johan
    BIOMETRICS, 2006, 62 (04) : 972 - 979
  • [6] Comparative study of L1 regularized logistic regression methods for variable selection
    El Guide, M.
    Jbilou, K.
    Koukouvinos, C.
    Lappa, A.
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2022, 51 (09) : 4957 - 4972
  • [7] Variable selection in logistic regression models
    Zellner, D
    Keller, F
    Zellner, GE
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2004, 33 (03) : 787 - 805
  • [8] Bayesian variable selection for logistic regression
    Tian, Yiqing
    Bondell, Howard D.
    Wilson, Alyson
    STATISTICAL ANALYSIS AND DATA MINING, 2019, 12 (05) : 378 - 393
  • [9] Variable selection for sparse logistic regression
    Zanhua Yin
    Metrika, 2020, 83 : 821 - 836
  • [10] Variable Selection in Logistic Regression Model
    Zhang Shangli
    Zhang Lili
    Qiu Kuanmin
    Lu Ying
    Cai Baigen
    CHINESE JOURNAL OF ELECTRONICS, 2015, 24 (04) : 813 - 817