Rank Tests from Partially Ordered Data Using Importance and MCMC Sampling Methods

被引：0

作者：

Mondal, Debashis ^{[1
]}

Hinrichs, Nina ^{[2
]}

机构：

[1] Oregon State Univ, Dept Stat, Corvallis, OR 97330 USA

[2] Univ Chicago, Dept Comp Sci, Chicago, IL 60637 USA

来源：

STATISTICAL SCIENCE | 2016年 / 31卷 / 03期

关键词：

Exact tests; fuzzy p-values; Gibbs sampling; iterval; censoring; linear extensions; linear rank statistics; perfect MCMC; proportional hazard model; topological sorting; PROPORTIONAL HAZARDS MODEL; INTERVAL-CENSORED-DATA; FAILURE TIME DATA; LINEAR EXTENSIONS; MARKOV-CHAINS; P-VALUES; DISTRIBUTIONS; SUBSEQUENCES; HYPOTHESES; STATISTICS;

D O I：

10.1214/16-STS549

中图分类号：

O21 [概率论与数理统计]; C8 [统计学];

学科分类号：

020208 ; 070103 ; 0714 ;

摘要：

We discuss distribution-free exact rank tests from partially ordered data that arise in various biological and other applications where the primary objective is to conduct testing of significance to assess the linear dependence or to compare different groups. The tests here are obtained by treating the usual rank statistics, based on the completely ordered data as "latent" or missing, and conceptualizing the "latent" p-value as the random probability under the null hypothesis of a test statistic that is as extreme, or more extreme, than the latent test statistics based on the completely ordered data. The latent p-value is then predicted by sampling linear extensions or the complete orderings that are consistent with the observed partially ordered data. The sampling methods explored here include importance sampling methods based on randomized topological sorting algorithms, Gibbs sampling methods, random-walk based Metropolis-Hasting sampling methods and random-walk based modern perfect Markov chain Monte Carlo sampling methods. We discuss running times of these sampling methods and their strength and weaknesses. A simulation experiment and three data examples are given. The simulation experiment illustrates how the exact rank tests from partially ordered data work when the desired result is known. The first data example concerns the light preference behavior of fruit flies and tests whether heterogeneity observed in average light-preference behavior can be explained by manipulations in serotonin signaling. The second one is a reanalysis of the lead absorption data in children of employees who worked in a lead battery factory and consolidates the results reported in Rosenbaum [Ann. Statist. 19 (1991) 1091-1097]. The third one reexamines the breast cosmesis data from Finkelstein

引用

页码：325 / 347

页数：23

共 50 条

[21] Extracting partially ordered clusters from ordinal polytomous data
de Chiusole, Debora
Spoto, Andrea
Stefanutti, Luca
BEHAVIOR RESEARCH METHODS, 2020, 52 (02) : 503 - 520
[22] Visualizing Trace Variants from Partially Ordered Event Data
Schuster, Daniel
Schade, Lukas
van Zelst, Sebastiaan J.
van der Aalst, Wil M. P.
PROCESS MINING WORKSHOPS, ICPM 2021, 2022, 433 : 34 - 46
[23] Extracting partially ordered clusters from ordinal polytomous data
Debora de Chiusole
Andrea Spoto
Luca Stefanutti
Behavior Research Methods, 2020, 52 : 503 - 520
[24] A NOTE ON CHI-SQUARED APPROXIMATIONS FOR TESTS OF RANK INTERACTION WITH ORDERED CATEGORICAL-DATA
THOMAS, GE
COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 1995, 24 (02) : 439 - 459
[25] SAND RIPPLE CHARACTERIZATION USING AN EXTENDED SYNTHETIC APERTURE SONAR MODEL AND MCMC SAMPLING METHODS
Zare, Alina
Cobb, James T.
2013 OCEANS - SAN DIEGO, 2013,
[26] On distribution function estimation with partially rank-ordered set samples: estimating mercury level in fish using length frequency data
Nazari, Sahar
Jozani, Mohammad Jafari
Kharrati-Kopaei, Mahmood
STATISTICS, 2016, 50 (06) : 1387 - 1410
[27] A Boosting Approach for Learning to Rank Using SVD with Partially Labeled Data
Lin, Yuan
Lin, Hongfei
Yang, Zhihao
Su, Sui
INFORMATION RETRIEVAL TECHNOLOGY, PROCEEDINGS, 2009, 5839 : 330 - +
[28] Mixture Model Analysis of Partially Rank-Ordered Set Samples: Age Groups of Fish from Length-Frequency Data
Hatefi, Armin
Jozani, Mohammad Jafari
Ozturk, Omer
SCANDINAVIAN JOURNAL OF STATISTICS, 2015, 42 (03) : 848 - 871
[29] Bayesian Network Reconstruction Using Systems Genetics Data: Comparison of MCMC Methods
Tasaki, Shinya
Ben Sauerwine
Hoff, Bruce
Toyoshiba, Hiroyoshi
Gaiteri, Chris
Chaibub Neto, Elias
GENETICS, 2015, 199 (04) : 973 - U128
[30] Computing highly accurate confidence limits from discrete data using importance sampling
Chris J. Lloyd
Degui Li
Statistics and Computing, 2014, 24 : 663 - 673

← 1 2 3 4 5 →