Use of random forest to estimate population attributable fractions from a case-control study of Salmonella enterica serotype Enteritidis infections

被引:29
|
作者
Gu, W. [1 ]
Vieira, A. R. [1 ]
Hoekstra, R. M. [2 ]
Griffin, P. M. [1 ]
Cole, D. [1 ]
机构
[1] Ctr Dis Control & Prevent, Enter Dis Epidemiol Branch, Atlanta, GA 30333 USA
[2] Ctr Dis Control, Div Foodborne Waterborne & Environm Dis Atlanta, Atlanta, GA 30333 USA
关键词
Causality; counterfactual; foodborne diseases; logistic regression; machine learning; CLASSIFICATION;
D O I
10.1017/S095026881500014X
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
To design effective food safety programmes we need to estimate how many sporadic foodborne illnesses are caused by specific food sources based on case-control studies. Logistic regression has substantive limitations for analysing structured questionnaire data with numerous exposures and missing values. We adapted random forest to analyse data of a case-control study of Salmonella enterica serotype Enteritidis illness for source attribution. For estimation of summary population attributable fractions (PAFs) of exposures grouped into transmission routes, we devised a counterfactual estimator to predict reductions in illness associated with removing grouped exposures. For the purpose of comparison, we fitted the data using logistic regression models with stepwise forward and backward variable selection. Our results show that the forward and backward variable selection of logistic regression models were not consistent for parameter estimation, with different significant exposures identified. By contrast, the random forest model produced estimated PAFs of grouped exposures consistent in rank order with results obtained from outbreak data, with egg-related exposures having the highest estimated PAF (22.1%, 95% confidence interval 8.5-31.8). Random forest might be structurally more coherent and efficient than logistic regression models for attributing Salmonella illnesses to sources involving many causal pathways.
引用
收藏
页码:2786 / 2794
页数:9
相关论文
共 50 条
  • [21] Emergence of Salmonella enteritidis phage type 4 in the Caribbean:: Case-control study in Trinidad and Tobago, West Indies
    Indar-Harrinauth, L
    Daniels, N
    Prabhakar, P
    Brown, C
    Baccus-Taylor, G
    Comissiong, E
    Hospedales, J
    CLINICAL INFECTIOUS DISEASES, 2001, 32 (06) : 890 - 896
  • [22] Population-Attributable Risks for Ischemic Stroke in a Community in South Brazil: A Case-Control Study
    Mallmann, Adroaldo Baseggio
    Fuchs, Sandra Costa
    Gus, Miguel
    Fuchs, Flavio Danni
    Moreira, Leila Beltrami
    PLOS ONE, 2012, 7 (04):
  • [23] Reptiles, amphibians, and human Salmonella infection:: A population-based, case-control study
    Mermin, J
    Hutwagner, L
    Vugia, D
    Shallow, S
    Daily, P
    Bender, J
    Koehler, J
    Marcus, R
    Angulo, FJ
    CLINICAL INFECTIOUS DISEASES, 2004, 38 : S253 - S261
  • [24] On the use of population attributable fraction to determine sample size for case-control studies of gene-environment interaction
    Yang, QH
    Khoury, MJ
    Friedman, JM
    Flanders, WD
    EPIDEMIOLOGY, 2003, 14 (02) : 161 - 167
  • [25] On Estimation of Time-Dependent Attributable Fraction from Population-Based Case-Control Studies
    Zhao, Wei
    Chen, Ying Qing
    Hsu, Li
    BIOMETRICS, 2017, 73 (03) : 866 - 875
  • [26] HCV INFECTION, HBSAG CARRIER STATE AND HCC - RELATIVE RISK AND POPULATION ATTRIBUTABLE RISK FROM A CASE-CONTROL STUDY IN ITALY
    STROFFOLINI, T
    CHIARAMONTE, M
    TIRIBELLI, C
    VILLA, E
    SIMONETTI, RG
    RAPICETTA, M
    STAZI, MA
    BERTIN, T
    CROCE, L
    TRANDE, P
    MAGLIOCCO, A
    CHIONNE, P
    HEPATOLOGY, 1992, 16 (02) : 595 - 595
  • [27] The role of the contaminated environment in the occurrence of sporadic non-typhoidal Salmonella infections in Michigan children: Findings from a population-based case-control study
    Younus, M.
    Wilkins, M.
    Nguyen, C.
    Davies, H. D.
    Rahbar, H.
    Funk, J.
    Siddiqi, A.
    Saeed, A. M.
    AMERICAN JOURNAL OF EPIDEMIOLOGY, 2008, 167 (11) : S77 - S77
  • [28] National outbreak of Salmonella Enteritidis phage type 14b in England, September to December 2009: case-control study
    Janmohamed, K.
    Zenner, D.
    Little, C.
    Lane, C.
    Wain, J.
    Charlett, A.
    Adak, B.
    Morgan, D.
    EUROSURVEILLANCE, 2011, 16 (15): : 5 - 10
  • [29] Case-control study of disease determinants for non-typhoidal Salmonella infections among Michigan children
    Younus M.
    Wilkins M.J.
    Davies H.D.
    Rahbar M.H.
    Funk J.
    Nguyen C.
    Siddiqi A.-E.A.
    Cho S.
    Saeed M.
    BMC Research Notes, 3 (1)
  • [30] Prostatitis, other genitourinary infections and prostate cancer: results from a population-based case-control study
    Boehm, Katharina
    Valdivieso, Roger
    Meskawi, Malek
    Larcher, Alessandro
    Schiffmann, Jonas
    Sun, Maxine
    Graefen, Markus
    Saad, Fred
    Parent, Marie-Elise
    Karakiewicz, Pierre I.
    WORLD JOURNAL OF UROLOGY, 2016, 34 (03) : 425 - 430