Model-Free Conditional Feature Screening with FDR Control

被引:10
|
作者
Tong, Zhaoxue [1 ]
Cai, Zhanrui [2 ]
Yang, Songshan [3 ]
Li, Runze [1 ]
机构
[1] Penn State Univ, University Pk, PA USA
[2] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
[3] Renmin Univ China, Beijing, Peoples R China
基金
美国国家科学基金会;
关键词
False discovery rate control; Ranking consistency; Sure screening; Ultra-high dimensional data analysis; FEATURE-SELECTION; FILTER; RATES;
D O I
10.1080/01621459.2022.2063130
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In this article, we propose a model-free conditional feature screening method with false discovery rate (FDR) control for ultra-high dimensional data. The proposed method is built upon a new measure of conditional independence. Thus, the new method does not require a specific functional form of the regression function and is robust to heavy-tailed responses and predictors. The variables to be conditional on are allowed to be multivariate. The proposed method enjoys sure screening and ranking consistency properties under mild regularity conditions. To control the FDR, we apply the Reflection via Data Splitting method and prove its theoretical guarantee using martingale theory and empirical process techniques. Simulated examples and real data analysis show that the proposed method performs very well compared with existing works. Supplementary materials for this article are available online.
引用
收藏
页码:2575 / 2587
页数:13
相关论文
共 50 条
  • [41] Model-free feature screening via distance correlation for ultrahigh dimensional survival data
    Zhang, Jing
    Liu, Yanyan
    Cui, Hengjian
    STATISTICAL PAPERS, 2021, 62 (06) : 2711 - 2738
  • [42] A Revisit to Model-Free Control
    Li, Wanrong
    Yuan, Huawei
    Li, Sinan
    Zhu, Jianguo
    IEEE TRANSACTIONS ON POWER ELECTRONICS, 2022, 37 (12) : 14408 - 14421
  • [43] Distribution-free and model-free multivariate feature screening via multivariate rank distance correlation
    Zhao, Shaofei
    Fu, Guifang
    JOURNAL OF MULTIVARIATE ANALYSIS, 2022, 192
  • [44] An efficient model-free estimation of multiclass conditional probability
    Xu, Tu
    Wang, Junhui
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2013, 143 (12) : 2079 - 2088
  • [45] Model-free network control
    Shulman, Jason
    Malatino, Frank
    Gunaratne, Gemunu H.
    PHYSICA D-NONLINEAR PHENOMENA, 2020, 408
  • [46] Quantile feature screening for infinite dimensional data under FDR control
    Tian, Zhentao
    Zhang, Zhongzhan
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2025, 206
  • [47] Adaptive model-free sure independence screening
    Wen, Canhong
    Zhu, Shan
    Chen, Xin
    Wang, Xueqin
    STATISTICS AND ITS INTERFACE, 2017, 10 (03) : 399 - 406
  • [48] Model-free screening for variables with treatment interaction
    Bizuayehu, Shiferaw B.
    Xu, Jin
    STATISTICAL METHODS IN MEDICAL RESEARCH, 2022, 31 (10) : 1845 - 1859
  • [49] Entropy-based model-free feature screening for ultrahigh-dimensional multiclass classification
    Ni, Lyu
    Fang, Fang
    JOURNAL OF NONPARAMETRIC STATISTICS, 2016, 28 (03) : 515 - 530
  • [50] Model-free, monotone invariant and computationally efficient feature screening with data-adaptive threshold
    Deng, Linsui
    Zhang, Yilin
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2024, 228 : 23 - 33