Use of random forest to estimate population attributable fractions from a case-control study of Salmonella enterica serotype Enteritidis infections

被引:29
|
作者
Gu, W. [1 ]
Vieira, A. R. [1 ]
Hoekstra, R. M. [2 ]
Griffin, P. M. [1 ]
Cole, D. [1 ]
机构
[1] Ctr Dis Control & Prevent, Enter Dis Epidemiol Branch, Atlanta, GA 30333 USA
[2] Ctr Dis Control, Div Foodborne Waterborne & Environm Dis Atlanta, Atlanta, GA 30333 USA
关键词
Causality; counterfactual; foodborne diseases; logistic regression; machine learning; CLASSIFICATION;
D O I
10.1017/S095026881500014X
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
To design effective food safety programmes we need to estimate how many sporadic foodborne illnesses are caused by specific food sources based on case-control studies. Logistic regression has substantive limitations for analysing structured questionnaire data with numerous exposures and missing values. We adapted random forest to analyse data of a case-control study of Salmonella enterica serotype Enteritidis illness for source attribution. For estimation of summary population attributable fractions (PAFs) of exposures grouped into transmission routes, we devised a counterfactual estimator to predict reductions in illness associated with removing grouped exposures. For the purpose of comparison, we fitted the data using logistic regression models with stepwise forward and backward variable selection. Our results show that the forward and backward variable selection of logistic regression models were not consistent for parameter estimation, with different significant exposures identified. By contrast, the random forest model produced estimated PAFs of grouped exposures consistent in rank order with results obtained from outbreak data, with egg-related exposures having the highest estimated PAF (22.1%, 95% confidence interval 8.5-31.8). Random forest might be structurally more coherent and efficient than logistic regression models for attributing Salmonella illnesses to sources involving many causal pathways.
引用
收藏
页码:2786 / 2794
页数:9
相关论文
共 50 条
  • [41] Use of Antidepressants and Induced Abortions: Population Based Case-Control Study from Three Nordic Countries
    Kieler, Helle
    Furu, Kari
    Gissler, Mika
    Noergaard, Mette
    Valdimarsdottir, Unnur
    Haglund, Bengt
    PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2014, 23 : 167 - 168
  • [42] Autoimmune Diseases, Infections, Use of Antibiotics and the Risk of Acute Myeloid Leukemia: A National Population-Based Case-Control Study
    Ostgard, Lene Sofie Granfeldt
    Norgaard, Mette
    Pedersen, Lars
    Ostgard, Rene
    Overgaard, Ulrik Malthe
    Schollkopf, Claudia
    Severinsen, Marianne Tang
    Marcher, Claus Werenberg
    Jensen, Morten Krogh
    BLOOD, 2017, 130
  • [43] Autoimmune diseases, infections, use of antibiotics and the risk of acute myeloid leukaemia: a national population-based case-control study
    Ostgard, Lene S. G.
    Norgaard, Mette
    Pedersen, Lars
    Ostgard, Rene D.
    Medeiros, Bruno C.
    Overgaard, Ulrik M.
    Schollkopf, Claudia
    Severinsen, Marianne
    Marcher, Claus W.
    Jensen, Morten K.
    BRITISH JOURNAL OF HAEMATOLOGY, 2018, 181 (02) : 205 - 214
  • [44] Infections and the risk of incident giant cell arteritis: a population-based, case-control study
    Rhee, Rennie L.
    Grayson, Peter C.
    Merkel, Peter A.
    Tomasson, Gunnar
    ANNALS OF THE RHEUMATIC DISEASES, 2017, 76 (06) : 1031 - 1035
  • [45] Medically diagnosed infections and risk of childhood leukaemia: a population-based case-control study
    Chang, Jeffrey S.
    Tsai, Chia-Rung
    Tsai, Yi-Wen
    Wiemels, Joseph L.
    INTERNATIONAL JOURNAL OF EPIDEMIOLOGY, 2012, 41 (04) : 1050 - 1059
  • [46] Infections and risk of type I diabetes in childhood: A population-based case-control study
    Altobelli, E
    Petrocelli, R
    Verrotti, A
    Valenti, M
    EUROPEAN JOURNAL OF EPIDEMIOLOGY, 2003, 18 (05) : 425 - 430
  • [47] Infections and the Risk of Incident Giant Cell Arteritis: A Population-Based, Case-Control Study
    Rhee, Rennie L.
    Grayson, Peter C.
    Merkel, Peter A.
    Tomasson, Gunnar
    ARTHRITIS & RHEUMATOLOGY, 2016, 68
  • [48] INFECTIONS AND THE RISK OF INCIDENT GIANT CELL ARTERITIS: A POPULATION-BASED CASE-CONTROL STUDY
    Rhee, Rennie L.
    Grayson, Peter C.
    Merkel, Peter A.
    Tomasson, Gunnar
    RHEUMATOLOGY, 2017, 56 : 52 - 53
  • [49] Epidemiological and Clinical Characteristics of Non-Typhoidal Salmonella Bloodstream Infections in Central Israel: A Case-Control Study
    Israel, Yael
    Muhsen, Khitam
    Rokney, Assaf
    Adler, Amos
    MICROORGANISMS, 2022, 10 (10)
  • [50] Statin use and the risk of kidney cancer: a population-based case-control study
    Chiu, Hui-Fen
    Kuo, Chien-Chun
    Kuo, Hsin-Wei
    Lee, I-Ming
    Lee, Chien-Te
    Yang, Chun-Yuh
    EXPERT OPINION ON DRUG SAFETY, 2012, 11 (04) : 543 - 549