A Splicing Approach to Best Subset of Groups Selection

被引:10
|
作者
Zhang, Yanhang [1 ,2 ]
Zhu, Junxian [3 ]
Zhu, Jin [2 ]
Wang, Xueqin [4 ]
机构
[1] Renmin Univ China, Sch Stat, Beijing 100872, Peoples R China
[2] Sun Yat Sen Univ, Southern China Ctr Stat Sci, Sch Math, Dept Stat Sci, Guangzhou 510275, Peoples R China
[3] Natl Univ Singapore, Saw Swee Hock Sch Publ Hlth, Singapore 117549, Singapore
[4] Univ Sci & Technol China, Int Inst Finance, Sch Management, Dept Stat & Finance, Hefei 230026, Peoples R China
基金
中国国家自然科学基金;
关键词
best subset of groups selection; group splicing; group information criterion; selection consistency of subset of groups; polynomial computational complexity; VARIABLE SELECTION; GENE-EXPRESSION; MODEL SELECTION; REGRESSION; SPARSITY; LASSO; RECOVERY; SIGNALS; UNION;
D O I
10.1287/ijoc.2022.1241
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Best subset of groups selection (BSGS) is the process of selecting a small part of nonoverlapping groups to achieve the best interpretability on the response variable. It has attracted increasing attention and has far-reaching applications in practice. However, due to the computational intractability of BSGS in high-dimensional settings, developing efficient algorithms for solving BSGS remains a research hotspot. In this paper, we propose a group -splicing algorithm that iteratively detects the relevant groups and excludes the irrelevant ones. Moreover, coupled with a novel group information criterion, we develop an adaptive algorithm to determine the optimal model size. Under certain conditions, it is certifiable that our algorithm can identify the optimal subset of groups in polynomial time with high probability. Finally, we demonstrate the efficiency and accuracy of our methods by compar-ing them with several state-of-the-art algorithms on both synthetic and real-world data sets.
引用
收藏
页码:104 / 119
页数:17
相关论文
共 50 条
  • [1] An approach to direct selection of best subset AR model
    Sarkar, A
    Sharma, KMS
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 1997, 56 (03) : 273 - 291
  • [2] A Mixed-Integer Fractional Optimization Approach to Best Subset Selection
    Gomez, Andres
    Prokopyev, Oleg A.
    INFORMS JOURNAL ON COMPUTING, 2021, 33 (02) : 551 - 565
  • [3] SELECTION OF BEST SUBSET IN REGRESSION ANALYSIS
    HOCKING, RR
    LESLIE, RN
    TECHNOMETRICS, 1967, 9 (04) : 531 - &
  • [4] BEST-SUBSET SELECTION PROCEDURE
    Wang, Yu
    Luangkesorn, Louis
    Shuman, Larry J.
    PROCEEDINGS OF THE 2011 WINTER SIMULATION CONFERENCE (WSC), 2011, : 4310 - 4318
  • [5] On Sure Early Selection of the Best Subset
    Zhu, Ziwei
    Wu, Shihao
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2024, 70 (12) : 8870 - 8891
  • [6] SELECTION OF BEST SUBSET IN REGRESSION ANALYSIS
    HOCKING, RR
    TECHNOMETRICS, 1967, 9 (01) : 188 - &
  • [7] Subset selection of best simulated systems
    Alrefaei, Mahmoud H.
    Almomani, Mohammad
    JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2007, 344 (05): : 495 - 506
  • [8] Assessing Variable Importance for Best Subset Selection
    Seedorff, Jacob
    Cavanaugh, Joseph E.
    ENTROPY, 2024, 26 (09)
  • [9] Variable selection with stepwise and best subset approaches
    Zhang, Zhongheng
    ANNALS OF TRANSLATIONAL MEDICINE, 2016, 4 (07)
  • [10] SUBSET-SELECTION OF AN ALMOST BEST TREATMENT
    VANDERLAAN, P
    BIOMETRICAL JOURNAL, 1992, 34 (06) : 647 - 656