Making the cut: improved ranking and selection for large-scale inference

被引:11
|
作者
Henderson, Nicholas C. [1 ]
Newton, Michael A. [1 ]
机构
[1] Univ Wisconsin, Madison, WI 53706 USA
基金
美国国家卫生研究院;
关键词
Empirical Bayes; Posterior expected rank; r-value; 2-STAGE; MODEL;
D O I
10.1111/rssb.12131
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Identifying leading measurement units from a large collection is a common inference task in various domains of large-scale inference. Testing approaches, which measure evidence against a null hypothesis rather than effect magnitude, tend to overpopulate lists of leading units with those associated with low measurement error. By contrast, local maximum likelihood approaches tend to favour units with high measurement error. Available Bayesian and empirical Bayesian approaches rely on specialized loss functions that result in similar deficiencies. We describe and evaluate a generic empirical Bayesian ranking procedure that populates the list of top units in a way that maximizes the expected overlap between the true and reported top lists for all list sizes. The procedure relates unit-specific posterior upper tail probabilities with their empirical distribution to yield a ranking variable. It discounts high variance units less than popular non-maximum-likelihood methods and thus achieves improved operating characteristics in the models considered.
引用
收藏
页码:781 / 804
页数:24
相关论文
共 50 条
  • [41] LABIN: Balanced Min Cut for Large-Scale Data
    Chen, Xiaojun
    Chen, Renjie
    Wu, Qingyao
    Fang, Yixiang
    Nie, Feiping
    Huang, Joshua Zhexue
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (03) : 725 - 736
  • [42] Seismic design of large-scale cut and cover tunnel
    Adachi, Y
    Fujii, Y
    Ishizaki, H
    MODERN TUNNELING SCIENCE AND TECHNOLOGY, VOLS I AND II, 2001, : 947 - 951
  • [43] A CUT OF LARGE-SCALE STRUCTURE ACROSS THE COMA SUPERCLUSTER
    KARACHENTSEV, ID
    KOPYLOV, AI
    IAU SYMPOSIA, 1988, (130): : 139 - 150
  • [44] Erratum to: Algorithm of OMA for large-scale orthology inference
    Alexander CJ Roth
    Gaston H Gonnet
    Christophe Dessimoz
    BMC Bioinformatics, 10
  • [45] Accelerating Large-Scale Inference with Anisotropic Vector Quantization
    Guo, Ruiqi
    Sun, Philip
    Lindgren, Erik
    Geng, Quan
    Simcha, David
    Chern, Felix
    Kumar, Sanjiv
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
  • [46] Distributed Bayesian Inference for Large-Scale IoT Systems
    Vlachou, Eleni
    Karras, Aristeidis
    Karras, Christos
    Theodorakopoulos, Leonidas
    Halkiopoulos, Constantinos
    Sioutas, Spyros
    BIG DATA AND COGNITIVE COMPUTING, 2024, 8 (01)
  • [47] Fast Hamiltonian sampling for large-scale structure inference
    Jasche, Jens
    Kitaura, Francisco S.
    MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY, 2010, 407 (01) : 29 - 42
  • [48] RANK: Large-Scale Inference With Graphical Nonlinear Knockoffs
    Fan, Yingying
    Demirkaya, Emre
    Li, Gaorong
    Lv, Jinchi
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2020, 115 (529) : 362 - 379
  • [49] Halo detection via large-scale Bayesian inference
    Merson, Alexander I.
    Jasche, Jens
    Abdalla, Filipe B.
    Lahav, Ofer
    Wandelt, Benjamin
    Jones, D. Heath
    Colless, Matthew
    MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY, 2016, 460 (02) : 1340 - 1355
  • [50] RANK: Large-scale inference with graphical nonlinear knockoffs
    Fan, Yingying
    Demirkaya, Emre
    Li, Gaorong
    Lv, Jinchi
    arXiv, 2017,