Attacks on genetic privacy via uploads to genealogical databases

被引:25
|
作者
Edge, Michael D. [1 ,2 ,3 ]
Coop, Graham [1 ,2 ]
机构
[1] Univ Calif Davis, Ctr Populat Biol, Davis, CA 95616 USA
[2] Univ Calif Davis, Dept Evolut & Ecol, Davis, CA 95616 USA
[3] Univ Southern Calif, Quantitat & Computat Biol, Dept Biol Sci, Los Angeles, CA 90007 USA
来源
ELIFE | 2020年 / 9卷
基金
美国国家卫生研究院;
关键词
WHOLE-GENOME ASSOCIATION; DESCENT; IDENTITY; INFERENCE; HOMOZYGOSITY; RELATIVES; ANCESTRY; POLICY; FUTURE;
D O I
10.7554/eLife.51810
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Direct-to-consumer (DTC) genetics services are increasingly popular, with tens of millions of customers. Several DTC genealogy services allow users to upload genetic data to search for relatives, identified as people with genomes that share identical by state (IBS) regions. Here, we describe methods by which an adversary can learn database genotypes by uploading multiple datasets. For example, an adversary who uploads approximately 900 genomes could recover at least one allele at SNP sites across up to 82% of the genome of a median person of European ancestries. In databases that detect IBS segments using unphased genotypes, approximately 100 falsified uploads can reveal enough genetic information to allow genome-wide genetic imputation. We provide a proof-of-concept demonstration in the GEDmatch database, and we suggest countermeasures that will prevent the exploits we describe.
引用
收藏
页数:25
相关论文
共 50 条
  • [31] A Privacy-Enhancing Architecture for Databases
    Wahlstrom, Kirsten
    Quirchmayr, Gerald
    JOURNAL OF RESEARCH AND PRACTICE IN INFORMATION TECHNOLOGY, 2008, 40 (03): : 151 - 162
  • [32] Security and privacy for web databases and services
    Ferrari, E
    Thuraisingham, B
    ADVANCES IN DATABASE TECHNOLOGY - EDBT 2004, PROCEEDINGS, 2004, 2992 : 17 - 28
  • [33] Security and privacy issues for sensor databases
    Thuraisingham, B
    SENSOR LETTERS, 2004, 2 (01) : 37 - 47
  • [34] Privacy Protection in Outsourced Spatial Databases
    Kamel, Ibrahim
    Ba-Hutair, Mohammed N.
    INTERNATIONAL JOURNAL OF SEMANTIC COMPUTING, 2016, 10 (03) : 347 - 363
  • [35] Privacy problems with anonymized transaction databases
    Mielikäinen, T
    DISCOVERY SCIENCE, PROCEEDINGS, 2004, 3245 : 219 - 229
  • [36] Genealogical databases as a tool for extending follow-up in clinical reviews
    Thuy-Van Ho
    Chowdhury, Naweed
    Kandl, Christopher
    Hoover, Cindy
    Robinson, Ann
    Hoover, Larry
    INTERNATIONAL FORUM OF ALLERGY & RHINOLOGY, 2016, 6 (08) : 880 - 882
  • [37] The Sorenson Molecular Genealogical Foundation (SMGF) and the construction of a publicly accessible genetic genealogical database
    Woodward, SR
    Perego, U
    Myres, N
    AMERICAN JOURNAL OF PHYSICAL ANTHROPOLOGY, 2006, : 189 - 189
  • [38] Genetic diversity and genealogical origins of domestic chicken
    Eltanany, M.
    Distl, O.
    WORLDS POULTRY SCIENCE JOURNAL, 2010, 66 (04) : 715 - 726
  • [39] A Survey on Privacy: Terminology, Mechanisms and Attacks
    Boussada, Rihab
    Elhdhili, Mohamed Elhoucine
    Saidane, Leila Azouz
    2016 IEEE/ACS 13TH INTERNATIONAL CONFERENCE OF COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2016,
  • [40] Online Attacks on Picture Owner Privacy
    Pijani, Bizhan Alipour
    Imine, Abdessamad
    Rusinowitch, Michael
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, DEXA 2020, PT II, 2020, 12392 : 33 - 47