Use of random forest to estimate population attributable fractions from a case-control study of Salmonella enterica serotype Enteritidis infections
被引:29
|
作者:
Gu, W.
论文数: 0引用数: 0
h-index: 0
机构:
Ctr Dis Control & Prevent, Enter Dis Epidemiol Branch, Atlanta, GA 30333 USACtr Dis Control & Prevent, Enter Dis Epidemiol Branch, Atlanta, GA 30333 USA
Gu, W.
[1
]
Vieira, A. R.
论文数: 0引用数: 0
h-index: 0
机构:
Ctr Dis Control & Prevent, Enter Dis Epidemiol Branch, Atlanta, GA 30333 USACtr Dis Control & Prevent, Enter Dis Epidemiol Branch, Atlanta, GA 30333 USA
Vieira, A. R.
[1
]
Hoekstra, R. M.
论文数: 0引用数: 0
h-index: 0
机构:
Ctr Dis Control, Div Foodborne Waterborne & Environm Dis Atlanta, Atlanta, GA 30333 USACtr Dis Control & Prevent, Enter Dis Epidemiol Branch, Atlanta, GA 30333 USA
Hoekstra, R. M.
[2
]
Griffin, P. M.
论文数: 0引用数: 0
h-index: 0
机构:
Ctr Dis Control & Prevent, Enter Dis Epidemiol Branch, Atlanta, GA 30333 USACtr Dis Control & Prevent, Enter Dis Epidemiol Branch, Atlanta, GA 30333 USA
Griffin, P. M.
[1
]
Cole, D.
论文数: 0引用数: 0
h-index: 0
机构:
Ctr Dis Control & Prevent, Enter Dis Epidemiol Branch, Atlanta, GA 30333 USACtr Dis Control & Prevent, Enter Dis Epidemiol Branch, Atlanta, GA 30333 USA
Cole, D.
[1
]
机构:
[1] Ctr Dis Control & Prevent, Enter Dis Epidemiol Branch, Atlanta, GA 30333 USA
[2] Ctr Dis Control, Div Foodborne Waterborne & Environm Dis Atlanta, Atlanta, GA 30333 USA
To design effective food safety programmes we need to estimate how many sporadic foodborne illnesses are caused by specific food sources based on case-control studies. Logistic regression has substantive limitations for analysing structured questionnaire data with numerous exposures and missing values. We adapted random forest to analyse data of a case-control study of Salmonella enterica serotype Enteritidis illness for source attribution. For estimation of summary population attributable fractions (PAFs) of exposures grouped into transmission routes, we devised a counterfactual estimator to predict reductions in illness associated with removing grouped exposures. For the purpose of comparison, we fitted the data using logistic regression models with stepwise forward and backward variable selection. Our results show that the forward and backward variable selection of logistic regression models were not consistent for parameter estimation, with different significant exposures identified. By contrast, the random forest model produced estimated PAFs of grouped exposures consistent in rank order with results obtained from outbreak data, with egg-related exposures having the highest estimated PAF (22.1%, 95% confidence interval 8.5-31.8). Random forest might be structurally more coherent and efficient than logistic regression models for attributing Salmonella illnesses to sources involving many causal pathways.
机构:
Karolinska Inst, Inst Environm Med, S-10401 Stockholm, SwedenKarolinska Inst, Inst Environm Med, S-10401 Stockholm, Sweden
Sandberg, M.
Bengtsson, C.
论文数: 0引用数: 0
h-index: 0
机构:
Karolinska Inst, Inst Environm Med, S-10401 Stockholm, SwedenKarolinska Inst, Inst Environm Med, S-10401 Stockholm, Sweden
Bengtsson, C.
Klareskog, L.
论文数: 0引用数: 0
h-index: 0
机构:
Karolinska Inst, Rheumatol Unit, Dept Med, Stockholm, SwedenKarolinska Inst, Inst Environm Med, S-10401 Stockholm, Sweden
Klareskog, L.
Alfredsson, L.
论文数: 0引用数: 0
h-index: 0
机构:
Karolinska Inst, Inst Environm Med, S-10401 Stockholm, SwedenKarolinska Inst, Inst Environm Med, S-10401 Stockholm, Sweden
Alfredsson, L.
Saevarsdottir, S.
论文数: 0引用数: 0
h-index: 0
机构:
Karolinska Inst, Inst Environm Med, S-10401 Stockholm, Sweden
Karolinska Inst, Rheumatol Unit, Dept Med, Stockholm, SwedenKarolinska Inst, Inst Environm Med, S-10401 Stockholm, Sweden
机构:
Univ Western Australia, Sch Populat & Global Hlth, Genet Epidemiol Grp, 32 Stirling Highway,M431, Crawley, WA, AustraliaUniv Western Australia, Sch Populat & Global Hlth, Genet Epidemiol Grp, 32 Stirling Highway,M431, Crawley, WA, Australia
Pirikahu, Sarah
Jones, Geoffrey
论文数: 0引用数: 0
h-index: 0
机构:
Massey Univ, Sch Math & Computat Sci, Private Bag 11222, Palmerston North, New ZealandUniv Western Australia, Sch Populat & Global Hlth, Genet Epidemiol Grp, 32 Stirling Highway,M431, Crawley, WA, Australia
Jones, Geoffrey
Hazelton, Martin L.
论文数: 0引用数: 0
h-index: 0
机构:
Univ Otago, Dept Math & Stat, 362 Leith St, Dunedin 9016, New ZealandUniv Western Australia, Sch Populat & Global Hlth, Genet Epidemiol Grp, 32 Stirling Highway,M431, Crawley, WA, Australia
机构:
Taipei Med Univ Hosp, Dept Otolaryngol, Taipei, TaiwanTaipei Med Univ Hosp, Dept Otolaryngol, Taipei, Taiwan
Hung, Shih-Han
Lin, Herng-Ching
论文数: 0引用数: 0
h-index: 0
机构:
Taipei Med Univ, Sch Hlth Care Adm, Taipei, Taiwan
Gen Cathay Hosp, Div Gastroenterol, Dept Internal Med, Taipei, TaiwanTaipei Med Univ Hosp, Dept Otolaryngol, Taipei, Taiwan
Lin, Herng-Ching
Chung, Shiu-Dong
论文数: 0引用数: 0
h-index: 0
机构:
Taipei Med Univ, Sleep Res Ctr, Taipei, Taiwan
Far Eastern Mem Hosp, Div Urol, Dept Surg, New Taipei City 220, TaiwanTaipei Med Univ Hosp, Dept Otolaryngol, Taipei, Taiwan