Rank Tests from Partially Ordered Data Using Importance and MCMC Sampling Methods

被引:0
|
作者
Mondal, Debashis [1 ]
Hinrichs, Nina [2 ]
机构
[1] Oregon State Univ, Dept Stat, Corvallis, OR 97330 USA
[2] Univ Chicago, Dept Comp Sci, Chicago, IL 60637 USA
关键词
Exact tests; fuzzy p-values; Gibbs sampling; iterval; censoring; linear extensions; linear rank statistics; perfect MCMC; proportional hazard model; topological sorting; PROPORTIONAL HAZARDS MODEL; INTERVAL-CENSORED-DATA; FAILURE TIME DATA; LINEAR EXTENSIONS; MARKOV-CHAINS; P-VALUES; DISTRIBUTIONS; SUBSEQUENCES; HYPOTHESES; STATISTICS;
D O I
10.1214/16-STS549
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We discuss distribution-free exact rank tests from partially ordered data that arise in various biological and other applications where the primary objective is to conduct testing of significance to assess the linear dependence or to compare different groups. The tests here are obtained by treating the usual rank statistics, based on the completely ordered data as "latent" or missing, and conceptualizing the "latent" p-value as the random probability under the null hypothesis of a test statistic that is as extreme, or more extreme, than the latent test statistics based on the completely ordered data. The latent p-value is then predicted by sampling linear extensions or the complete orderings that are consistent with the observed partially ordered data. The sampling methods explored here include importance sampling methods based on randomized topological sorting algorithms, Gibbs sampling methods, random-walk based Metropolis-Hasting sampling methods and random-walk based modern perfect Markov chain Monte Carlo sampling methods. We discuss running times of these sampling methods and their strength and weaknesses. A simulation experiment and three data examples are given. The simulation experiment illustrates how the exact rank tests from partially ordered data work when the desired result is known. The first data example concerns the light preference behavior of fruit flies and tests whether heterogeneity observed in average light-preference behavior can be explained by manipulations in serotonin signaling. The second one is a reanalysis of the lead absorption data in children of employees who worked in a lead battery factory and consolidates the results reported in Rosenbaum [Ann. Statist. 19 (1991) 1091-1097]. The third one reexamines the breast cosmesis data from Finkelstein
引用
收藏
页码:325 / 347
页数:23
相关论文
共 50 条
  • [31] Computing highly accurate confidence limits from discrete data using importance sampling
    Lloyd, Chris J.
    Li, Degui
    STATISTICS AND COMPUTING, 2014, 24 (04) : 663 - 673
  • [32] Bayesian analysis of longitudinal ordered data with flexible random effects using McMC: application to diabetic macular Edema data
    Mansourian, Marjan
    Kazemnejad, Anoshirvan
    Kazemi, Iraj
    Zayeri, Farid
    Soheilian, Masoud
    JOURNAL OF APPLIED STATISTICS, 2012, 39 (05) : 1087 - 1100
  • [33] Sampling and recovery of MRI data using low rank tensor models
    Banco, Daniel
    Aeron, Shuchin
    Hoge, W. Scott
    2016 38TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2016, : 448 - 452
  • [34] The Importance of Importance Sampling: Exploring Methods of Sampling from Alternatives in Discrete Choice Models of Crime Location Choice
    Sophie Curtis-Ham
    Wim Bernasco
    Oleg N. Medvedev
    Devon L. L. Polaschek
    Journal of Quantitative Criminology, 2022, 38 : 1003 - 1031
  • [35] The Importance of Importance Sampling: Exploring Methods of Sampling from Alternatives in Discrete Choice Models of Crime Location Choice
    Curtis-Ham, Sophie
    Bernasco, Wim
    Medvedev, Oleg N.
    Polaschek, Devon L. L.
    JOURNAL OF QUANTITATIVE CRIMINOLOGY, 2022, 38 (04) : 1003 - 1031
  • [36] A study on sampling methods for chloride profiles: simulations using data from EPMA
    Wall, Henrik
    Nilsson, Lars-Olof
    MATERIALS AND STRUCTURES, 2008, 41 (07) : 1275 - 1281
  • [37] A study on sampling methods for chloride profiles: simulations using data from EPMA
    Henrik Wall
    Lars-Olof Nilsson
    Materials and Structures, 2008, 41 : 1275 - 1281
  • [38] Statistical Monitoring of Time-to-Failure Data Using Rank Tests
    Li, Zhiguo
    Zhou, Shiyu
    Sievenpiper, Crispian
    Choubey, Suresh
    QUALITY AND RELIABILITY ENGINEERING INTERNATIONAL, 2012, 28 (03) : 321 - 333
  • [39] Reweighting for nonequilibrium Markov processes using sequential importance sampling methods
    Lee, HK
    Okabe, Y
    PHYSICAL REVIEW E, 2005, 71 (01):
  • [40] OBTAINING PAIRED COMPARISONS DATA FROM MULTIPLE RANK ORDERS USING PARTIALLY BALANCED INCOMPLETE BLOCK DESIGNS
    STRATON, RG
    AUSTRALIAN PSYCHOLOGIST, 1972, 7 (03) : 269 - 270