Protecting genomic data analytics in the cloud: state of the art and opportunities

被引:0
|
作者
Haixu Tang
Xiaoqian Jiang
Xiaofeng Wang
Shuang Wang
Heidi Sofia
Dov Fox
Kristin Lauter
Bradley Malin
Amalio Telenti
Li Xiong
Lucila Ohno-Machado
机构
[1] Indiana University,School of Informatics and Computing
[2] University of California San Diego,Department of Biomedical Informatics
[3] National Human Genome Research Institute,School of Law
[4] University of San Diego,Department of Biomedical Informatics, School of Medicine
[5] Microsoft Research,Department of Mathematics and Computer Science
[6] Vanderbilt University,undefined
[7] The J. Craig Venter Institute,undefined
[8] Emory University,undefined
来源
关键词
Edit Distance; Data Owner; Public Cloud; Cryptographic Protocol; Homomorphic Encryption;
D O I
暂无
中图分类号
学科分类号
摘要
The outsourcing of genomic data into public cloud computing settings raises concerns over privacy and security. Significant advancements in secure computation methods have emerged over the past several years, but such techniques need to be rigorously evaluated for their ability to support the analysis of human genomic data in an efficient and cost-effective manner. With respect to public cloud environments, there are concerns about the inadvertent exposure of human genomic data to unauthorized users. In analyses involving multiple institutions, there is additional concern about data being used beyond agreed research scope and being prcoessed in untrused computational environments, which may not satisfy institutional policies. To systematically investigate these issues, the NIH-funded National Center for Biomedical Computing iDASH (integrating Data for Analysis, ‘anonymization’ and SHaring) hosted the second Critical Assessment of Data Privacy and Protection competition to assess the capacity of cryptographic technologies for protecting computation over human genomes in the cloud and promoting cross-institutional collaboration. Data scientists were challenged to design and engineer practical algorithms for secure outsourcing of genome computation tasks in working software, whereby analyses are performed only on encrypted data. They were also challenged to develop approaches to enable secure collaboration on data from genomic studies generated by multiple organizations (e.g., medical centers) to jointly compute aggregate statistics without sharing individual-level records. The results of the competition indicated that secure computation techniques can enable comparative analysis of human genomes, but greater efficiency (in terms of compute time and memory utilization) are needed before they are sufficiently practical for real world environments.
引用
收藏
相关论文
共 50 条
  • [11] Protecting aggregate genomic data
    Zerhouni, Elias A.
    Nabel, Elizabeth G.
    SCIENCE, 2008, 322 (5898) : 44 - 44
  • [12] Big Data Provenance: Challenges, State of the Art and Opportunities
    Wang, Jianwu
    Crawl, Daniel
    Purawat, Shweta
    Nguyen, Mai
    Altintas, Ilkay
    PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2015, : 2509 - 2516
  • [13] State of the Art of Construction Analytics
    Naderpajouh, Nader
    Choi, Juyeong
    Hastak, Makarand
    CONSTRUCTION RESEARCH CONGRESS 2016: OLD AND NEW CONSTRUCTION TECHNOLOGIES CONVERGE IN HISTORIC SAN JUAN, 2016, : 970 - 979
  • [14] Business Analytics — State of the Art
    Peter Chamoni
    Peter Gluchowski
    Controlling & Management Review, 2017, 61 (4) : 8 - 17
  • [15] Learning analytics: state of the art
    Hernandez-de-Menendez, Marcela
    Morales-Menendez, Ruben
    Escobar, Carlos A.
    Ramirez Mendoza, Ricardo A.
    INTERNATIONAL JOURNAL OF INTERACTIVE DESIGN AND MANUFACTURING - IJIDEM, 2022, 16 (03): : 1209 - 1230
  • [16] The State of the Art of Visual Analytics
    Ham, Dong-Han
    EKC 2009 PROCEEDINGS OF EU-KOREA CONFERENCE ON SCIENCE AND TECHNOLOGY, 2010, 135 : 213 - 222
  • [17] Learning analytics: state of the art
    Marcela Hernández-de-Menéndez
    Ruben Morales-Menendez
    Carlos A. Escobar
    Ricardo A. Ramírez Mendoza
    International Journal on Interactive Design and Manufacturing (IJIDeM), 2022, 16 : 1209 - 1230
  • [18] Protecting Sensitive Data in the Information Age: State of the Art and Future Prospects
    Stach, Christoph
    Gritti, Clementine
    Braecker, Julia
    Behringer, Michael
    Mitschang, Bernhard
    FUTURE INTERNET, 2022, 14 (11):
  • [19] State of Art of Data Mining and Learning Analytics Tools in Higher Education
    Salihoun, Mohammed
    INTERNATIONAL JOURNAL OF EMERGING TECHNOLOGIES IN LEARNING, 2020, 15 (21): : 58 - 76
  • [20] The State of the Art in Visual Analytics for 3D Urban Data
    Miranda, Fabio
    Ortner, Thomas
    Moreira, Gustavo
    Hosseini, Maryam
    Vuckovic, Milena
    Biljecki, Filip
    Silva, Claudio T.
    Lage, Marcos
    Ferreira, Nivan
    COMPUTER GRAPHICS FORUM, 2024, 43 (03)