Sample-weighted semiparametric estimation of cause-specific cumulative risk and incidence using left- or interval-censored data from electronic health records

被引:0
|
作者
Hyun, Noorie [1 ]
Katki, Hormuzd A. [2 ]
Graubard, Barry I. [2 ]
机构
[1] Med Coll Wisconsin, Div Biostat, Wauwatosa, WI 53226 USA
[2] NCI, Div Canc Epidemiol & Genet, Rockville, MD USA
关键词
competing risks; left; interval censoring; nonparametric maximum likelihood estimation; stratified random sample; MAXIMUM-LIKELIHOOD-ESTIMATION; COMPETING RISKS; HUMAN-PAPILLOMAVIRUS; MODEL; INFERENCE;
D O I
10.1002/sim.8544
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Electronic health records (EHRs) can be a cost-effective data source for forming cohorts and developing risk models in the context of disease screening. However, important issues need to be handled: competing outcomes, left-censoring of prevalent disease, interval-censoring of incident disease, and uncertainty of prevalent disease when accurate disease ascertainment is not conducted at baseline. Furthermore, novel tests that are costly and limited in availability can be conducted on stored biospecimens selected as samples from EHRs by using different sampling fractions. We extend sample-weighted semiparametric marginal mixture models to estimating competing risks. For flexible modeling of relative risks, a general transformation of the subdistribution hazard function and regression parameters is used. We propose a numerical algorithm for nonparametrically calculating the maximum likelihood estimates for subdistribution hazard functions and regression parameters. Methods for calculating the consistent confidence intervals for relative and absolute risk estimates are presented. The proposed algorithm and methods show reliable finite sample performance through simulation studies. We apply our methods to a cohort assembled from EHRs at a health maintenance organization where we estimate cumulative risk of cervical precancer/cancer and incidence of infection-clearance by HPV genotype among human papillomavirus (HPV) positive women. There is no significant difference in 3-year HPV-clearance rates across different HPV types, but 3-year cumulative risk of progression-to-precancer/cancer from HPV-16 is relatively higher than the other HPV genotypes.
引用
收藏
页码:2387 / 2402
页数:16
相关论文
共 5 条
  • [1] FLEXIBLE RISK PREDICTION MODELS FOR LEFT OR INTERVAL-CENSORED DATA FROM ELECTRONIC HEALTH RECORDS
    Hyun, Noorie
    Cheung, Li C.
    Pan, Qing
    Schiffman, Mark
    Katki, Hormuzd A.
    ANNALS OF APPLIED STATISTICS, 2017, 11 (02): : 1063 - 1084
  • [2] The risk of all-cause and cause-specific mortality in people prescribed mirtazapine: an active comparator cohort study using electronic health records
    Joseph, Rebecca M.
    Jack, Ruth H.
    Morriss, Richard
    Knaggs, Roger David
    Butler, Debbie
    Hollis, Chris
    Hippisley-Cox, Julia
    Coupland, Carol
    BMC MEDICINE, 2022, 20 (01)
  • [3] The risk of all-cause and cause-specific mortality in people prescribed mirtazapine: an active comparator cohort study using electronic health records
    Rebecca M. Joseph
    Ruth H. Jack
    Richard Morriss
    Roger David Knaggs
    Debbie Butler
    Chris Hollis
    Julia Hippisley-Cox
    Carol Coupland
    BMC Medicine, 20
  • [4] Body Mass Index and Cause-Specific Mortality: Population-Based Study Among 2 Million UK Adults Using Electronic Health Records Linked to National Mortality Data
    Bhaskaran, Krishnan
    Douglas, Ian
    Smeeth, Liam
    PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2016, 25 : 114 - 114
  • [5] Challenges in risk estimation using routinely collected clinical data: The example of estimating cervical cancer risks from electronic health-records
    Landy, Rebecca
    Cheung, Li C.
    Schiffman, Mark
    Gage, Julia C.
    Hyun, Noorie
    Wentzensen, Nicolas
    Kinney, Walter K.
    Castle, Philip E.
    Fetterman, Barbara
    Poitras, Nancy E.
    Lorey, Thomas
    Sasieni, Peter D.
    Katki, Hormuzd A.
    PREVENTIVE MEDICINE, 2018, 111 : 429 - 435