Valid statistical inference methods for a case-control study with missing data

被引:5
|
作者
Tian, Guo-Liang [1 ]
Zhang, Chi [2 ]
Jiang, Xuejun [1 ]
机构
[1] South Univ Sci & Technol China, Dept Math, Shenzhen, Guangdong, Peoples R China
[2] Univ Hong Kong, Dept Stat & Actuarial Sci, Pokfulam Rd, Hong Kong, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Bootstrap methods; case-control study; missing at random; the mechanism augmentation method; Wald test; CONFIDENCE-INTERVAL CONSTRUCTION; INCOMPLETE DATA; CONTINGENCY-TABLES; PROPORTIONS; TESTS;
D O I
10.1177/0962280216649619
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
The main objective of this paper is to derive the valid sampling distribution of the observed counts in a case-control study with missing data under the assumption of missing at random by employing the conditional sampling method and the mechanism augmentation method. The proposed sampling distribution, called the case-control sampling distribution, can be used to calculate the standard errors of the maximum likelihood estimates of parameters via the Fisher information matrix and to generate independent samples for constructing small-sample bootstrap confidence intervals. Theoretical comparisons of the new case-control sampling distribution with two existing sampling distributions exhibit a large difference. Simulations are conducted to investigate the influence of the three different sampling distributions on statistical inferences. One finding is that the conclusion by the Wald test for testing independency under the two existing sampling distributions could be completely different (even contradictory) from the Wald test for testing the equality of the success probabilities in control/case groups under the proposed distribution. A real cervical cancer data set is used to illustrate the proposed statistical methods.
引用
收藏
页码:1001 / 1023
页数:23
相关论文
共 50 条
  • [31] Handling Missing Data in Matched Case-Control Studies Using Multiple Imputation
    Seaman, Shaun R.
    Keogh, Ruth H.
    BIOMETRICS, 2015, 71 (04) : 1150 - 1159
  • [32] WHEN IS A CASE-CONTROL STUDY A CASE-CONTROL STUDY?
    Mayo, Nancy E.
    Goldberg, Mark S.
    JOURNAL OF REHABILITATION MEDICINE, 2009, 41 (04) : 217 - 222
  • [33] Estimation and inference for semi-competing risks based on data from a nested case-control study
    Jazic, Ina
    Lee, Stephanie
    Haneuse, Sebastien
    STATISTICAL METHODS IN MEDICAL RESEARCH, 2020, 29 (11) : 3326 - 3339
  • [34] WHEN IS A CASE-CONTROL STUDY NOT A CASE-CONTROL STUDY?
    Mayo, Nancy E.
    Goldberg, Mark S.
    JOURNAL OF REHABILITATION MEDICINE, 2009, 41 (04) : 209 - 216
  • [35] Inference on haplotype effects in case-control studies using unphased genotype data
    Epstein, MP
    Satten, GA
    AMERICAN JOURNAL OF HUMAN GENETICS, 2003, 73 (06) : 1316 - 1329
  • [36] Statistical Learning Methods Applicable to Genome-Wide Association Studies on Unbalanced Case-Control Disease Data
    Dai, Xiaotian
    Fu, Guifang
    Zhao, Shaofei
    Zeng, Yifei
    GENES, 2021, 12 (05)
  • [37] On the robustness of weighted methods for fitting models to case-control data
    Scott, A
    Wild, C
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2002, 64 : 207 - 219
  • [38] Application of sequential haplotype scan methods to case-control data
    Zhaoxia Yu
    Daniel J Schaid
    BMC Proceedings, 1 (Suppl 1)
  • [39] Enabling network inference methods to handle missing data and outliers
    Folch-Fortuny, Abel
    Villaverde, Alejandro F.
    Ferrer, Alberto
    Banga, Julio R.
    BMC BIOINFORMATICS, 2015, 16
  • [40] Statistical tests of genetic association for case-control study designs
    Wang, Kai
    BIOSTATISTICS, 2012, 13 (04) : 724 - 733