A mixed-integer exponential cone programming formulation for feature subset selection in logistic regression

被引:1
|
作者
Ahari, Sahand Asgharieh [1 ]
Kocuk, Burak [2 ]
机构
[1] Univ Groningen, Fac Econ & Business, Groningen, Netherlands
[2] Sabanci Univ, Ind Engn Program, Istanbul, Turkiye
关键词
Mixed-integer exponential cone programming; Machine learning; Sparse logistic regression; Classification; VARIABLE SELECTION;
D O I
10.1016/j.ejco.2023.100069
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
Logistic regression is one of the widely-used classification tools to construct prediction models. For datasets with a large number of features, feature subset selection methods are considered to obtain accurate and interpretable prediction models, in which irrelevant and redundant features are removed. In this paper, we address the problem of feature subset selection in logistic regression using modern optimization techniques. To this end, we formulate this problem as a mixed-integer exponential cone program (MIEXP). To the best of our knowledge, this is the first time both nonlinear and discrete aspects of the underlying problem are fully considered within an exact optimization framework. We derive different versions of the MIEXP model by the means of regularization and goodness of fit measures including Akaike Information Criterion and Bayesian Information Criterion. Finally, we solve our MIEXP models using the solver MOSEK and evaluate the performance of our different versions over a set of toy examples and benchmark datasets. The results show that our approach is quite successful in obtaining accurate and interpretable pre-diction models compared to other methods from the literature.(c) 2023 The Author(s). Published by Elsevier Ltd on behalf of Association of European Operational Research Societies (EURO). This is an open access article under the CC BY-NC-ND license (http:// creativecommons.org/licenses/by-nc-nd/4.0/).
引用
收藏
页数:31
相关论文
共 50 条
  • [1] Feature subset selection for logistic regression via mixed integer optimization
    Toshiki Sato
    Yuichi Takano
    Ryuhei Miyashiro
    Akiko Yoshise
    Computational Optimization and Applications, 2016, 64 : 865 - 880
  • [2] Feature subset selection for logistic regression via mixed integer optimization
    Sato, Toshiki
    Takano, Yuichi
    Miyashiro, Ryuhei
    Yoshise, Akiko
    COMPUTATIONAL OPTIMIZATION AND APPLICATIONS, 2016, 64 (03) : 865 - 880
  • [3] Cost-Sensitive Best Subset Selection for Logistic Regression: A Mixed-Integer Conic Optimization Perspective
    Knauer, Ricardo
    Rodner, Erik
    ADVANCES IN ARTIFICIAL INTELLIGENCE, KI 2023, 2023, 14236 : 114 - 129
  • [4] Feature selection for multiclass discrimination via mixed-integer linear programming
    Iannarilli, FJ
    Rubin, PA
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2003, 25 (06) : 779 - 783
  • [5] Feature subset selection for the multinomial logit model via mixed-integer optimization
    Kamiya, Shunsuke
    Miyashiro, Ryuhei
    Takano, Yuichi
    22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89, 2019, 89
  • [6] A New Mixed-Integer Linear Programming Formulation for Multiple Responses Regression Clustering
    Angun, Ebru
    Altinoy, Alper
    2019 6TH INTERNATIONAL CONFERENCE ON CONTROL, DECISION AND INFORMATION TECHNOLOGIES (CODIT 2019), 2019, : 1634 - 1639
  • [7] A nonlinear mixed-integer programming approach for variable selection in linear regression model
    Roozbeh, Mahdi
    Babaie-Kafaki, Saman
    Aminifard, Zohre
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2023, 52 (11) : 5434 - 5445
  • [8] Selection of vendors - A mixed-integer programming approach
    Kasilingam, RG
    Lee, CP
    COMPUTERS & INDUSTRIAL ENGINEERING, 1996, 31 (1-2) : 347 - 350
  • [9] Combining regression and mixed-integer programming to model counterinsurgency
    Marvin L. King
    David R. Galbreath
    Alexandra M. Newman
    Amanda S. Hering
    Annals of Operations Research, 2020, 292 : 287 - 320
  • [10] Combining regression and mixed-integer programming to model counterinsurgency
    King, Marvin L.
    Galbreath, David R.
    Newman, Alexandra M.
    Hering, Amanda S.
    ANNALS OF OPERATIONS RESEARCH, 2020, 292 (01) : 287 - 320