Making the cut: improved ranking and selection for large-scale inference

被引:11
|
作者
Henderson, Nicholas C. [1 ]
Newton, Michael A. [1 ]
机构
[1] Univ Wisconsin, Madison, WI 53706 USA
基金
美国国家卫生研究院;
关键词
Empirical Bayes; Posterior expected rank; r-value; 2-STAGE; MODEL;
D O I
10.1111/rssb.12131
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Identifying leading measurement units from a large collection is a common inference task in various domains of large-scale inference. Testing approaches, which measure evidence against a null hypothesis rather than effect magnitude, tend to overpopulate lists of leading units with those associated with low measurement error. By contrast, local maximum likelihood approaches tend to favour units with high measurement error. Available Bayesian and empirical Bayesian approaches rely on specialized loss functions that result in similar deficiencies. We describe and evaluate a generic empirical Bayesian ranking procedure that populates the list of top units in a way that maximizes the expected overlap between the true and reported top lists for all list sizes. The procedure relates unit-specific posterior upper tail probabilities with their empirical distribution to yield a ranking variable. It discounts high variance units less than popular non-maximum-likelihood methods and thus achieves improved operating characteristics in the models considered.
引用
收藏
页码:781 / 804
页数:24
相关论文
共 50 条
  • [21] Algorithm of OMA for large-scale orthology inference
    Alexander CJ Roth
    Gaston H Gonnet
    Christophe Dessimoz
    BMC Bioinformatics, 9
  • [22] Large-scale simultaneous inference under dependence
    Tian, Jinjin
    Chen, Xu
    Katsevich, Eugene
    Goeman, Jelle
    Ramdas, Aaditya
    SCANDINAVIAN JOURNAL OF STATISTICS, 2023, 50 (02) : 750 - 796
  • [23] Multivariate Hawkes Processes for Large-Scale Inference
    Lemonnier, Remi
    Scaman, Kevin
    Kalogeratos, Argyris
    THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 2168 - 2174
  • [24] Large-scale inference of human genetic data
    Rivas, M. A.
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2019, 27 : 1064 - 1064
  • [25] Statistical inference with large-scale trait imputation
    Ren, Jingchen
    Pan, Wei
    STATISTICS IN MEDICINE, 2024, 43 (04) : 625 - 641
  • [26] Contextual Ranking of Behaviors for Large-scale Multiagent Simulations
    Parikh, Nidhi
    Marathe, Madhav V.
    Swarup, Samarth
    AAMAS'17: PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2017, : 1676 - 1678
  • [27] Ranking of closeness centrality for large-scale social networks
    Okamoto, Kazuya
    Chen, Wei
    Li, Xiang-Yang
    FRONTIERS IN ALGORITHMICS, 2008, 5059 : 186 - +
  • [28] RankRC: Large-Scale Nonlinear Rare Class Ranking
    Tayal, Aditya
    Coleman, Thomas F.
    Li, Yuying
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2015, 27 (12) : 3347 - 3359
  • [29] Unsupervised Domain Ranking in Large-Scale Web Crawls
    Cui, Yi
    Sparkman, Clint
    Lee, Hsin-Tsang
    Loguinov, Dmitri
    ACM TRANSACTIONS ON THE WEB, 2018, 12 (04)
  • [30] Large-scale microarray data based feature selection for improved molecular classification
    Lu, Liangqun
    Daigle, Bernie J., Jr.
    BMC BIOINFORMATICS, 2017, 18