Model-Free Conditional Feature Screening with FDR Control

被引:10
|
作者
Tong, Zhaoxue [1 ]
Cai, Zhanrui [2 ]
Yang, Songshan [3 ]
Li, Runze [1 ]
机构
[1] Penn State Univ, University Pk, PA USA
[2] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
[3] Renmin Univ China, Beijing, Peoples R China
基金
美国国家科学基金会;
关键词
False discovery rate control; Ranking consistency; Sure screening; Ultra-high dimensional data analysis; FEATURE-SELECTION; FILTER; RATES;
D O I
10.1080/01621459.2022.2063130
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In this article, we propose a model-free conditional feature screening method with false discovery rate (FDR) control for ultra-high dimensional data. The proposed method is built upon a new measure of conditional independence. Thus, the new method does not require a specific functional form of the regression function and is robust to heavy-tailed responses and predictors. The variables to be conditional on are allowed to be multivariate. The proposed method enjoys sure screening and ranking consistency properties under mild regularity conditions. To control the FDR, we apply the Reflection via Data Splitting method and prove its theoretical guarantee using martingale theory and empirical process techniques. Simulated examples and real data analysis show that the proposed method performs very well compared with existing works. Supplementary materials for this article are available online.
引用
收藏
页码:2575 / 2587
页数:13
相关论文
共 50 条
  • [21] A model-free conditional screening approach via sufficient dimension reduction
    Huo, Lei
    Wen, Xuerong Meggie
    Yu, Zhou
    JOURNAL OF NONPARAMETRIC STATISTICS, 2020, 32 (04) : 970 - 988
  • [22] A model-free feature screening approach based on kernel density estimation
    Li, Xiangjie
    Wang, Lei
    Zhang, Jingxiao
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2017, 87 (12) : 2450 - 2468
  • [23] Model-free feature screening for high-dimensional survival data
    Lin, Yuanyuan
    Liu, Xianhui
    Hao, Meiling
    SCIENCE CHINA-MATHEMATICS, 2018, 61 (09) : 1617 - 1636
  • [24] Robust model-free feature screening for ultrahigh dimensional surrogate data
    Lai, Peng
    Chen, Yuanxing
    Zhang, Jie
    Dai, Bingying
    Zhang, Qingzhao
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2020, 90 (03) : 550 - 569
  • [25] Model-free feature screening for high-dimensional survival data
    Yuanyuan Lin
    Xianhui Liu
    Meiling Hao
    Science China Mathematics, 2018, 61 : 1617 - 1636
  • [26] Model-free feature screening via a modified composite quantile correlation
    Xu, Kai
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2017, 188 : 22 - 35
  • [27] An iterative model-free feature screening procedure: Forward recursive selection
    Xia, Siwei
    Yang, Yuehan
    KNOWLEDGE-BASED SYSTEMS, 2022, 246
  • [28] Model-free feature screening for high-dimensional survival data
    Yuanyuan Lin
    Xianhui Liu
    Meiling Hao
    ScienceChina(Mathematics), 2018, 61 (09) : 79 - 98
  • [29] Model-free conditional screening for ultrahigh-dimensional survival data via conditional distance correlation
    Cui, Hengjian
    Liu, Yanyan
    Mao, Guangcai
    Zhang, Jing
    BIOMETRICAL JOURNAL, 2023, 65 (03)
  • [30] Model-free feature screening for categorical outcomes: Nonlinear effect detection and false discovery rate control
    Zhang, Qingyang
    Du, Yuchun
    PLOS ONE, 2019, 14 (05):