Protecting genomic data analytics in the cloud: state of the art and opportunities

被引:0
|
作者
Haixu Tang
Xiaoqian Jiang
Xiaofeng Wang
Shuang Wang
Heidi Sofia
Dov Fox
Kristin Lauter
Bradley Malin
Amalio Telenti
Li Xiong
Lucila Ohno-Machado
机构
[1] Indiana University,School of Informatics and Computing
[2] University of California San Diego,Department of Biomedical Informatics
[3] National Human Genome Research Institute,School of Law
[4] University of San Diego,Department of Biomedical Informatics, School of Medicine
[5] Microsoft Research,Department of Mathematics and Computer Science
[6] Vanderbilt University,undefined
[7] The J. Craig Venter Institute,undefined
[8] Emory University,undefined
来源
关键词
Edit Distance; Data Owner; Public Cloud; Cryptographic Protocol; Homomorphic Encryption;
D O I
暂无
中图分类号
学科分类号
摘要
The outsourcing of genomic data into public cloud computing settings raises concerns over privacy and security. Significant advancements in secure computation methods have emerged over the past several years, but such techniques need to be rigorously evaluated for their ability to support the analysis of human genomic data in an efficient and cost-effective manner. With respect to public cloud environments, there are concerns about the inadvertent exposure of human genomic data to unauthorized users. In analyses involving multiple institutions, there is additional concern about data being used beyond agreed research scope and being prcoessed in untrused computational environments, which may not satisfy institutional policies. To systematically investigate these issues, the NIH-funded National Center for Biomedical Computing iDASH (integrating Data for Analysis, ‘anonymization’ and SHaring) hosted the second Critical Assessment of Data Privacy and Protection competition to assess the capacity of cryptographic technologies for protecting computation over human genomes in the cloud and promoting cross-institutional collaboration. Data scientists were challenged to design and engineer practical algorithms for secure outsourcing of genome computation tasks in working software, whereby analyses are performed only on encrypted data. They were also challenged to develop approaches to enable secure collaboration on data from genomic studies generated by multiple organizations (e.g., medical centers) to jointly compute aggregate statistics without sharing individual-level records. The results of the competition indicated that secure computation techniques can enable comparative analysis of human genomes, but greater efficiency (in terms of compute time and memory utilization) are needed before they are sufficiently practical for real world environments.
引用
收藏
相关论文
共 50 条
  • [31] Cloud Supported Building Data Analytics
    Petri, Ioan
    Rana, Omer
    Rezgui, Yacine
    Li, Haijiang
    Beach, Tom
    Zou, Mengsong
    Diaz-Montes, Javier
    Parashar, Manish
    2014 14TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID), 2014, : 641 - 650
  • [32] Data Analytics using Cloud Computing
    Maheshwari, Prakhar
    Singhal, Alankar
    Qadeer, Mohammed A.
    2017 9TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMMUNICATION NETWORKS (CICN), 2017, : 82 - 87
  • [33] Data protection in cloud computing: A Survey of the State-of-Art
    Amamou, Sonia
    Trifa, Zied
    Khmakhem, Maher
    KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS (KES 2019), 2019, 159 : 155 - 161
  • [34] Big Data Analytics for Sustainable Products: A State-of-the-Art Review and Analysis
    Gholami, Hamed
    Lee, Jocelyn Ke Yin
    Ali, Ahad
    SUSTAINABILITY, 2023, 15 (17)
  • [35] Big data analytics in food industry: a state-of-the-art literature review
    Aftab Siddique
    Ashish Gupta
    Jason T. Sawyer
    Tung-Shi Huang
    Amit Morey
    npj Science of Food, 9 (1)
  • [36] Privacy preserving big data analytics: A critical analysis of state-of-the-art
    Pramanik, M. Ileas
    Lau, Raymond Y. K.
    Hossain, Md Sakir
    Rahoman, Md Mizanur
    Debnath, Sumon Kumar
    Rashed, Md Golam
    Uddin, Md Zasim
    WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2021, 11 (01)
  • [37] Real Time Analytics - State of the art
    Trinks, Sebastian
    Felden, Carsten
    2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2017, : 4843 - 4845
  • [38] The state of the art and taxonomy of big data analytics: view from new big data framework
    Azlinah Mohamed
    Maryam Khanian Najafabadi
    Yap Bee Wah
    Ezzatul Akmal Kamaru Zaman
    Ruhaila Maskat
    Artificial Intelligence Review, 2020, 53 : 989 - 1037
  • [39] The state of the art and taxonomy of big data analytics: view from new big data framework
    Mohamed, Azlinah
    Najafabadi, Maryam Khanian
    Wah, Yap Bee
    Zaman, Ezzatul Akmal Kamaru
    Maskat, Ruhaila
    ARTIFICIAL INTELLIGENCE REVIEW, 2020, 53 (02) : 989 - 1037
  • [40] Authenticable Data Analytics Over Encrypted Data in the Cloud
    Chen, Lanxiang
    Mu, Yi
    Zeng, Lingfang
    Rezaeibagha, Fatemeh
    Deng, Robert H.
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2023, 18 : 1800 - 1813