A Splicing Approach to Best Subset of Groups Selection

被引:10
|
作者
Zhang, Yanhang [1 ,2 ]
Zhu, Junxian [3 ]
Zhu, Jin [2 ]
Wang, Xueqin [4 ]
机构
[1] Renmin Univ China, Sch Stat, Beijing 100872, Peoples R China
[2] Sun Yat Sen Univ, Southern China Ctr Stat Sci, Sch Math, Dept Stat Sci, Guangzhou 510275, Peoples R China
[3] Natl Univ Singapore, Saw Swee Hock Sch Publ Hlth, Singapore 117549, Singapore
[4] Univ Sci & Technol China, Int Inst Finance, Sch Management, Dept Stat & Finance, Hefei 230026, Peoples R China
基金
中国国家自然科学基金;
关键词
best subset of groups selection; group splicing; group information criterion; selection consistency of subset of groups; polynomial computational complexity; VARIABLE SELECTION; GENE-EXPRESSION; MODEL SELECTION; REGRESSION; SPARSITY; LASSO; RECOVERY; SIGNALS; UNION;
D O I
10.1287/ijoc.2022.1241
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Best subset of groups selection (BSGS) is the process of selecting a small part of nonoverlapping groups to achieve the best interpretability on the response variable. It has attracted increasing attention and has far-reaching applications in practice. However, due to the computational intractability of BSGS in high-dimensional settings, developing efficient algorithms for solving BSGS remains a research hotspot. In this paper, we propose a group -splicing algorithm that iteratively detects the relevant groups and excludes the irrelevant ones. Moreover, coupled with a novel group information criterion, we develop an adaptive algorithm to determine the optimal model size. Under certain conditions, it is certifiable that our algorithm can identify the optimal subset of groups in polynomial time with high probability. Finally, we demonstrate the efficiency and accuracy of our methods by compar-ing them with several state-of-the-art algorithms on both synthetic and real-world data sets.
引用
收藏
页码:104 / 119
页数:17
相关论文
共 50 条
  • [41] Analyzing distributed Spark MLlib regression algorithms for accuracy, execution efficiency and scalability using best subset selection approach
    Piyush Sewal
    Hari Singh
    Multimedia Tools and Applications, 2024, 83 : 44047 - 44066
  • [42] Bayesian approach to best basis selection
    Pesquet, JC
    Krim, H
    Leporini, D
    Hamman, E
    1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 2634 - 2637
  • [43] A Feature Subset Selection Approach For Predicting Smoking Behaviours
    Long TonThat
    Vu Truong Son Dao
    Huynh Tran Minh Tri
    Minh Tuan Le
    2023 IEEE STATISTICAL SIGNAL PROCESSING WORKSHOP, SSP, 2023, : 145 - 149
  • [44] Axiomatic approach to feature subset selection based on relevance
    Wang, H
    Bell, D
    Murtagh, F
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1999, 21 (03) : 271 - 277
  • [45] Modified MMS: Minimization Approach for Model Subset Selection
    Rajathi, C.
    Rukmani, P.
    CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 77 (01): : 733 - 756
  • [46] A FIXED SUBSET-SIZE APPROACH TO SELECTION PROBLEM
    DESU, MM
    SOBEL, M
    BIOMETRIKA, 1968, 55 (02) : 401 - &
  • [47] Numerical evaluation of the good subset approach to variable selection
    Smit, Neill
    de Jongh, Riaan
    Venter, Hennie
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2025,
  • [48] Bayesian subset selection approach to ranking normal means
    Hamilton, Cody
    Bratcher, Tom L.
    Stamey, James D.
    JOURNAL OF APPLIED STATISTICS, 2008, 35 (08) : 847 - 851
  • [49] An efficient feature subset selection approach for machine learning
    Rincy, N. Thomas
    Gupta, Roopam
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (08) : 12737 - 12830
  • [50] Wrapper approach for feature subset selection using GA
    Zhou, Huilin
    Wu, Jianbin
    Wang, Yuhao
    Tian, Mao
    2007 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATION SYSTEMS, VOLS 1 AND 2, 2007, : 224 - +