Collider bias undermines our understanding of COVID-19 disease risk and severity

被引:539
作者
Griffith, Gareth J. [1 ,2 ]
Morris, Tim T. [1 ,2 ]
Tudball, Matthew J. [1 ,2 ]
Herbert, Annie [1 ,2 ]
Mancano, Giulia [1 ,2 ]
Pike, Lindsey [1 ,2 ]
Sharp, Gemma C. [1 ,2 ]
Sterne, Jonathan [2 ]
Palmer, Tom M. [1 ,2 ]
Smith, George Davey [1 ,2 ]
Tilling, Kate [1 ,2 ]
Zuccolo, Luisa [1 ,2 ]
Davies, Neil M. [1 ,2 ,3 ]
Hemani, Gibran [1 ,2 ]
机构
[1] Univ Bristol, Med Res Council, Integrat Epidemiol Unit, Bristol BS8 2BN, Avon, England
[2] Univ Bristol, Bristol Med Sch, Populat Hlth Sci, Oakfield House, Bristol BS8 2BN, Avon, England
[3] Norwegian Univ Sci & Technol, Dept Publ Hlth & Nursing, KG Jebsen Ctr Genet Epidemiol, NTNU, Trondheim, Norway
基金
英国惠康基金; 英国医学研究理事会;
关键词
RENIN-ANGIOTENSIN SYSTEM; SELECTION BIAS; NEGATIVE CONTROLS; CAUSAL-MODELS; INHIBITORS; PACKAGE; TOOL;
D O I
10.1038/s41467-020-19478-2
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Numerous observational studies have attempted to identify risk factors for infection with SARS-CoV-2 and COVID-19 disease outcomes. Studies have used datasets sampled from patients admitted to hospital, people tested for active infection, or people who volunteered to participate. Here, we highlight the challenge of interpreting observational evidence from such non-representative samples. Collider bias can induce associations between two or more variables which affect the likelihood of an individual being sampled, distorting associations between these variables in the sample. Analysing UK Biobank data, compared to the wider cohort the participants tested for COVID-19 were highly selected for a range of genetic, behavioural, cardiovascular, demographic, and anthropometric traits. We discuss the mechanisms inducing these problems, and approaches that could help mitigate them. While collider bias should be explored in existing studies, the optimal way to mitigate the problem is to use appropriate sampling strategies at the study design stage. Many published studies of the current SARS-CoV-2 pandemic have analysed data from non-representative samples from populations. Here, using UK BioBank samples, Gibran Hemani and colleagues discuss the potential for such studies to suffer from collider bias, and provide suggestions for optimising study design to account for this.
引用
收藏
页数:12
相关论文
共 102 条
[1]   Absence of obesity paradox in patients with chronic heart failure and diabetes mellitus: a propensity-matched study [J].
Adamopoulos, Chris ;
Meyer, Philippe ;
Desai, Ravi V. ;
Karatzidou, Kyparissi ;
Ovalle, Fernando ;
White, Michel ;
Aban, Inmaculada ;
Love, Thomas E. ;
Deedwania, Prakash ;
Anker, Stefan D. ;
Ahmed, Ali .
EUROPEAN JOURNAL OF HEART FAILURE, 2011, 13 (02) :200-206
[2]   Educational Note: Paradoxical collider effect in the analysis of non-communicable disease epidemiological data: a reproducible illustration and web application [J].
Angel Luque-Fernandez, Miguel ;
Schomaker, Michael ;
Redondo-Sanchez, Daniel ;
Sanchez Perez, Maria Jose ;
Vaidya, Anand ;
Schnitzer, Mireille E. .
INTERNATIONAL JOURNAL OF EPIDEMIOLOGY, 2019, 48 (02) :640-653
[3]  
[Anonymous], 2020, CAR HOM DEATHS UNT L
[4]  
[Anonymous], **DATA OBJECT**, DOI DOI 10.5523/BRIS.PNOAT8CXO0U52P6YNFAEKEIGI
[5]  
[Anonymous], 2020, DAILY MAIL
[6]  
[Anonymous], 2020, Coronavirus (COVID-19): Scaling up our testing programmes
[7]  
[Anonymous], 2020, NEW COVID 19 SURV UN
[8]  
[Anonymous], 2020, BBC News
[9]  
[Anonymous], 2020, UK BIOB MAK INF HLTH
[10]  
[Anonymous], 2020, OFFICE NATL STAT