A Splicing Approach to Best Subset of Groups Selection

被引:10
|
作者
Zhang, Yanhang [1 ,2 ]
Zhu, Junxian [3 ]
Zhu, Jin [2 ]
Wang, Xueqin [4 ]
机构
[1] Renmin Univ China, Sch Stat, Beijing 100872, Peoples R China
[2] Sun Yat Sen Univ, Southern China Ctr Stat Sci, Sch Math, Dept Stat Sci, Guangzhou 510275, Peoples R China
[3] Natl Univ Singapore, Saw Swee Hock Sch Publ Hlth, Singapore 117549, Singapore
[4] Univ Sci & Technol China, Int Inst Finance, Sch Management, Dept Stat & Finance, Hefei 230026, Peoples R China
基金
中国国家自然科学基金;
关键词
best subset of groups selection; group splicing; group information criterion; selection consistency of subset of groups; polynomial computational complexity; VARIABLE SELECTION; GENE-EXPRESSION; MODEL SELECTION; REGRESSION; SPARSITY; LASSO; RECOVERY; SIGNALS; UNION;
D O I
10.1287/ijoc.2022.1241
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Best subset of groups selection (BSGS) is the process of selecting a small part of nonoverlapping groups to achieve the best interpretability on the response variable. It has attracted increasing attention and has far-reaching applications in practice. However, due to the computational intractability of BSGS in high-dimensional settings, developing efficient algorithms for solving BSGS remains a research hotspot. In this paper, we propose a group -splicing algorithm that iteratively detects the relevant groups and excludes the irrelevant ones. Moreover, coupled with a novel group information criterion, we develop an adaptive algorithm to determine the optimal model size. Under certain conditions, it is certifiable that our algorithm can identify the optimal subset of groups in polynomial time with high probability. Finally, we demonstrate the efficiency and accuracy of our methods by compar-ing them with several state-of-the-art algorithms on both synthetic and real-world data sets.
引用
收藏
页码:104 / 119
页数:17
相关论文
共 50 条
  • [21] A Heuristic Approach for Selecting Best-Subset Including Ranking Within the Subset
    Choi, Seon Han
    Kim, Tag Gon
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2020, 50 (10): : 3852 - 3862
  • [22] Best subset selection via cross-validation criterion
    Takano, Yuichi
    Miyashiro, Ryuhei
    TOP, 2020, 28 (02) : 475 - 488
  • [23] Best subset selection via cross-validation criterion
    Yuichi Takano
    Ryuhei Miyashiro
    TOP, 2020, 28 : 475 - 488
  • [24] Variable selection in linear regression models: Choosing the best subset is not always the best choice
    Hanke, Moritz
    Dijkstra, Louis
    Foraita, Ronja
    Didelez, Vanessa
    BIOMETRICAL JOURNAL, 2023,
  • [25] Variable selection in linear regression models: Choosing the best subset is not always the best choice
    Hanke, Moritz
    Dijkstra, Louis
    Foraita, Ronja
    Didelez, Vanessa
    BIOMETRICAL JOURNAL, 2024, 66 (01)
  • [26] SIMULTANEOUS SELECTION OF EXTREME POPULATIONS - A SUBSET-SELECTION APPROACH
    MISHRA, SN
    DUDEWICZ, EJ
    BIOMETRICAL JOURNAL, 1987, 29 (04) : 471 - 483
  • [27] SIMULTANEOUS SELECTION OF EXTREME POPULATIONS - A SUBSET-SELECTION APPROACH
    MISHRA, SN
    DUDEWICZ, EJ
    BIOMETRICS, 1983, 39 (03) : 807 - 807
  • [28] Best Subset Selection for Double-Threshold-Variable Autoregressive Moving-Average Models: The Bayesian Approach
    Zheng, Xiaobing
    Liang, Kun
    Xia, Qiang
    Zhang, Dabin
    COMPUTATIONAL ECONOMICS, 2022, 59 (03) : 1175 - 1201
  • [29] Best Subset Selection for Double-Threshold-Variable Autoregressive Moving-Average Models: The Bayesian Approach
    Xiaobing Zheng
    Kun Liang
    Qiang Xia
    Dabin Zhang
    Computational Economics, 2022, 59 : 1175 - 1201
  • [30] A Bayesian Approach for Subset Selection in Contextual Bandits
    Li, Jialian
    Du, Chao
    Zhu, Jun
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 8384 - 8391