Robust and sparse logistic regression

被引:0
|
作者
Cornilly, Dries [1 ,3 ]
Tubex, Lise [2 ]
Van Aelst, Stefan [1 ]
Verdonck, Tim [1 ,2 ]
机构
[1] Katholieke Univ Leuven, Dept Math, Celestijnenlaan 200B, B-3001 Leuven, Belgium
[2] Univ Antwerp, imec, Dept Math, Middelheimlaan 1, B-2020 Antwerp, Belgium
[3] Asteria IM, Rue Lausanne 15, CH-1202 Geneva, Switzerland
关键词
Elastic net; gamma-divergence; Logistic regression; Robustness; Sparsity; VARIABLE SELECTION; REGULARIZATION; MODEL;
D O I
10.1007/s11634-023-00572-4
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Logistic regression is one of the most popular statistical techniques for solving (binary) classification problems in various applications (e.g. credit scoring, cancer detection, ad click predictions and churn classification). Typically, the maximum likelihood estimator is used, which is very sensitive to outlying observations. In this paper, we propose a robust and sparse logistic regression estimator where robustness is achieved by means of the gamma-divergence. An elastic net penalty ensures sparsity in the regression coefficients such that the model is more stable and interpretable. We show that the influence function is bounded and demonstrate its robustness properties in simulations. The good performance of the proposed estimator is also illustrated in an empirical application that deals with classifying the type of fuel used by cars.
引用
收藏
页码:663 / 679
页数:17
相关论文
共 50 条
  • [21] Robust 1-bit Compressed Sensing and Sparse Logistic Regression: A Convex Programming Approach
    Plan, Yaniv
    Vershynin, Roman
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2013, 59 (01) : 482 - 494
  • [22] Large-Scale Sparse Logistic Regression
    Liu, Jun
    Chen, Jianhui
    Ye, Jieping
    KDD-09: 15TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2009, : 547 - 555
  • [23] Leukemia Prediction Using Sparse Logistic Regression
    Manninen, Tapio
    Huttunen, Heikki
    Ruusuvuori, Pekka
    Nykter, Matti
    PLOS ONE, 2013, 8 (08):
  • [24] Differentially Private Logistic Regression with Sparse Solutions
    Khanna, Amol
    Lu, Fred
    Raff, Edward
    Testa, Brian
    PROCEEDINGS OF THE 16TH ACM WORKSHOP ON ARTIFICIAL INTELLIGENCE AND SECURITY, AISEC 2023, 2023, : 1 - 9
  • [25] A Safe Screening Rule for Sparse Logistic Regression
    Wang, Jie
    Zhou, Jiayu
    Liu, Jun
    Wonka, Peter
    Ye, Jieping
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 27 (NIPS 2014), 2014, 27
  • [26] Approximate Sparse Multinomial Logistic Regression for Classification
    Kayabol, Koray
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2020, 42 (02) : 490 - 493
  • [27] Logistic Regression Under Sparse Data Conditions
    Walker, David A.
    Smith, Thomas J.
    JOURNAL OF MODERN APPLIED STATISTICAL METHODS, 2019, 18 (02)
  • [28] Distributed Parallel Sparse Multinomial Logistic Regression
    Lei, Dajiang
    Du, Meng
    Chen, Hao
    Li, Zhixing
    Wu, Yu
    IEEE ACCESS, 2019, 7 : 55496 - 55508
  • [29] Logistic regression with sparse common and distinctive covariates
    S. Park
    E. Ceulemans
    K. Van Deun
    Behavior Research Methods, 2023, 55 : 4143 - 4174
  • [30] Multiclass Classification by Sparse Multinomial Logistic Regression
    Abramovich, Felix
    Grinshtein, Vadim
    Levy, Tomer
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2021, 67 (07) : 4637 - 4646