A prototype knockoff filter for group selection with FDR control

被引:4
|
作者
Chen, Jiajie [1 ]
Hou, Anthony [2 ]
Hou, Thomas Y. [3 ]
机构
[1] Peking Univ, Sch Math Sci, Beijing 100871, Peoples R China
[2] Harvard Univ, Dept Stat, Cambridge, MA 02138 USA
[3] CALTECH, Appl & Computat Math, Pasadena, CA 91125 USA
基金
美国国家科学基金会;
关键词
variable selection; false discovery rate (FDR); group variable selection; knockoff filter; linear regression; FALSE DISCOVERY RATE;
D O I
10.1093/imaiai/iaz012
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
In many applications, we need to study a linear regression model that consists of a response variable and a large number of potential explanatory variables, and determine which variables are truly associated with the response. In Foygel Barber & Candes (2015, Ann. Statist., 43, 2055-2085), the authors introduced a new variable selection procedure called the knockoff filter to control the false discovery rate (FDR) and proved that this method achieves exact FDR control. In this paper, we propose a prototype knockoff filter for group selection by extending the Reid-Tibshirani (2016, Biostatistics, 17, 364-376) prototype method. Our prototype knockoff filter improves the computational efficiency and statistical power of the Reid-Tibshirani prototype method when it is applied for group selection. In some cases when the group features are spanned by one or a few hidden factors, we demonstrate that the Principal Component Analysis (PCA) prototype knockoff filter outperforms the Dai-Foygel Barber (2016, 33rd International Conference on Machine Learning (ICML 2016)) group knockoff filter. We present several numerical experiments to compare our prototype knockoff filter with the Reid-Tibshirani prototype method and the group knockoff filter. We have also conducted some analysis of the knockoff filter. Our analysis reveals that some knockoff path method statistics, including the Lasso path statistic, may lead to loss of power for certain design matrices and a specially designed response even if their signal strengths are still relatively strong.
引用
收藏
页码:271 / 288
页数:18
相关论文
共 50 条
  • [21] A robust knockoff filter for sparse regression analysis of microbiome compositional data
    Gianna Serafina Monti
    Peter Filzmoser
    Computational Statistics, 2024, 39 : 271 - 288
  • [22] Negative Group Delay Prototype Filter Based on the Ratio of Two Classical Chebyshev Filter Transfer Functions
    Kandic, Miodrag
    Bridges, Greg E.
    Progress In Electromagnetics Research B, 2024, 107 : 139 - 153
  • [23] The Informed Elastic Net for Fast Grouped Variable Selection and FDR Control in Genomics Research
    Machkour, Jasin
    Muma, Michael
    Palomar, Daniel P.
    2023 IEEE 9TH INTERNATIONAL WORKSHOP ON COMPUTATIONAL ADVANCES IN MULTI-SENSOR ADAPTIVE PROCESSING, CAMSAP, 2023, : 466 - 470
  • [24] Model-free latent confounder-adjusted feature selection with FDR control
    Xiao, Jian
    Li, Shaoting
    Chen, Jun
    Zhu, Wensheng
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2025, 205
  • [25] Knoop: practical enhancement of knockoff with over-parameterization for variable selection
    Zhang, Xiaochen
    Cai, Yunfeng
    Xiong, Haoyi
    MACHINE LEARNING, 2025, 114 (01)
  • [26] Prototype Selection Via Prototype Relevance
    Olvera-Lopez, J. Arturo
    Carrasco-Ochoa, J. Ariel
    Martinez-Trinidad, J. Fco.
    PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS AND APPLICATIONS, PROCEEDINGS, 2008, 5197 : 153 - 160
  • [27] Compositional knockoff filter for high-dimensional regression analysis of microbiome data
    Srinivasan, Arun
    Xue, Lingzhou
    Zhan, Xiang
    BIOMETRICS, 2021, 77 (03) : 984 - 995
  • [28] Possible selection bias of the control group
    Mondragón, HEV
    López, AL
    Tavez, FB
    Meza, JE
    Domínguez, JF
    Román, LF
    Gallegos, LJ
    SALUD PUBLICA DE MEXICO, 2000, 42 (05): : 378 - 379
  • [29] A Method of Feature Selection Based on Modified FDR
    Cao, Ben
    Ma, D. -B.
    Zhang, K. -F.
    Niu, C. -Y.
    2011 AASRI CONFERENCE ON ARTIFICIAL INTELLIGENCE AND INDUSTRY APPLICATION (AASRI-AIIA 2011), VOL 2, 2011, : 211 - 213
  • [30] Prototype Selection Methods
    Olvera Lopez, Jose Arturo
    Carrasco Ochoa, Jesus Ariel
    Martinez Trinidad, Jose Francisco
    COMPUTACION Y SISTEMAS, 2010, 13 (04): : 449 - 462