Protecting genomic data analytics in the cloud: state of the art and opportunities

被引:0
|
作者
Haixu Tang
Xiaoqian Jiang
Xiaofeng Wang
Shuang Wang
Heidi Sofia
Dov Fox
Kristin Lauter
Bradley Malin
Amalio Telenti
Li Xiong
Lucila Ohno-Machado
机构
[1] Indiana University,School of Informatics and Computing
[2] University of California San Diego,Department of Biomedical Informatics
[3] National Human Genome Research Institute,School of Law
[4] University of San Diego,Department of Biomedical Informatics, School of Medicine
[5] Microsoft Research,Department of Mathematics and Computer Science
[6] Vanderbilt University,undefined
[7] The J. Craig Venter Institute,undefined
[8] Emory University,undefined
来源
关键词
Edit Distance; Data Owner; Public Cloud; Cryptographic Protocol; Homomorphic Encryption;
D O I
暂无
中图分类号
学科分类号
摘要
The outsourcing of genomic data into public cloud computing settings raises concerns over privacy and security. Significant advancements in secure computation methods have emerged over the past several years, but such techniques need to be rigorously evaluated for their ability to support the analysis of human genomic data in an efficient and cost-effective manner. With respect to public cloud environments, there are concerns about the inadvertent exposure of human genomic data to unauthorized users. In analyses involving multiple institutions, there is additional concern about data being used beyond agreed research scope and being prcoessed in untrused computational environments, which may not satisfy institutional policies. To systematically investigate these issues, the NIH-funded National Center for Biomedical Computing iDASH (integrating Data for Analysis, ‘anonymization’ and SHaring) hosted the second Critical Assessment of Data Privacy and Protection competition to assess the capacity of cryptographic technologies for protecting computation over human genomes in the cloud and promoting cross-institutional collaboration. Data scientists were challenged to design and engineer practical algorithms for secure outsourcing of genome computation tasks in working software, whereby analyses are performed only on encrypted data. They were also challenged to develop approaches to enable secure collaboration on data from genomic studies generated by multiple organizations (e.g., medical centers) to jointly compute aggregate statistics without sharing individual-level records. The results of the competition indicated that secure computation techniques can enable comparative analysis of human genomes, but greater efficiency (in terms of compute time and memory utilization) are needed before they are sufficiently practical for real world environments.
引用
收藏
相关论文
共 50 条
  • [41] Towards Cloud-based Analytics-as-a-Service (CLAaaS) for Big Data Analytics in the Cloud
    Zulkernine, Farhana
    Martin, Patrick
    Zou, Ying
    Bauer, Michael
    Gwadry-Sridhar, Femida
    Aboulnaga, Ashraf
    2013 IEEE INTERNATIONAL CONGRESS ON BIG DATA, 2013, : 62 - 69
  • [42] Cloud Kotta: Enabling Secure and Scalable Data Analytics in the Cloud
    Babuji, Yadu N.
    Chard, Kyle
    Gerow, Aaron
    Duede, Eamon
    2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2016, : 302 - 310
  • [43] The opportunities, challenges and obligations of Fitness Data Analytics
    Bhargava, Yesoda
    Nabi, Javaid
    INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND DATA SCIENCE, 2020, 167 : 1354 - 1362
  • [44] How data analytics is changing entrepreneurial opportunities?
    Sedkaoui, Soraya
    INTERNATIONAL JOURNAL OF INNOVATION SCIENCE, 2018, 10 (02) : 274 - 294
  • [45] Challenges and opportunities of big data analytics in healthcare
    Goyal, Priyanshi
    Malviya, Rishabha
    HEALTH CARE SCIENCE, 2023, 2 (05): : 328 - 338
  • [46] Big Education: Opportunities for Big Data Analytics
    Cen, Ling
    Ruta, Dymitr
    Ng, Jason
    2015 IEEE INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), 2015, : 502 - 506
  • [47] Libraries and Institutional Data Analytics: Challenges and Opportunities
    Booth, H. Austin
    Hendrix, Dean
    JOURNAL OF ACADEMIC LIBRARIANSHIP, 2015, 41 (05): : 695 - 699
  • [48] Protecting Genomic Data Privacy with Probabilistic Modeling
    Simmons, Sean
    Berger, Bonnie
    Sahinalp, Cenk
    PACIFIC SYMPOSIUM ON BIOCOMPUTING 2019, 2019, : 403 - 414
  • [50] Scalable genomic data exchange and analytics with sBeacon
    Anuradha Wickramarachchi
    Brendan Hosking
    Yatish Jain
    John Grimes
    Mitchell J. O’Brien
    Tracey Wright
    Mark A. Burgess
    Victor San Kho Lin
    Florian Reisinger
    Oliver Hofmann
    Michael Lawley
    Laurence O. W. Wilson
    Natalie A. Twine
    Denis C. Bauer
    Nature Biotechnology, 2023, 41 : 1510 - 1512