Synthetic Data Digital Twins and Data Trusts Control for Privacy in Health Data Sharing

被引:0
|
作者
Lomotey, Richard K. [1 ]
Kumi, Sandra [2 ]
Ray, Madhurima [3 ]
Deters, Ralph [2 ]
机构
[1] Penn State Univ, Informat Sci & Tech, Monaca, PA 15061 USA
[2] Univ Saskatchewan, Dept Comp Sci, Saskatoon, SK, Canada
[3] Penn State Univ, Dept Comp Sci, Monaca, PA USA
关键词
Synthetic Health Data; Digital Twins; Data Trusts; Machine Learning; Artificial Intelligence; Privacy; Middleware; FRAMEWORK;
D O I
10.1145/3643650.3658605
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Health data sharing is very valuable for medical research since it has the propensity to improve diagnostics, policy, medication, and so on. At the same time, sharing health data needs to be done without compromising the privacy of patients and stakeholders. However, recent advances in AI/ML and sophisticated analytics have proven to introduce biases that can easily identify patients based on their healthcare data, which violates privacy. In this work, we sort to address this major issue by exploring two emerging topics that are gaining attention from industry, academia, and governments, i.e., digital twins and data trusts. First, we proposed the use of digital twins (DTs) to generate synthetic records of patient's heart rate data. DTs are virtual replicas of the actual data and were created using two synthetic data generative models - Gaussian Copula (GC) and Tabular Variational Autoencoder (TVAE). The GC and TVAE achieved a maximum data quality score of 88% and 96% respectively. Next, we posit that the DTs should be shared with a data trusts layer. Data trusts are fiduciary frameworks that govern multi-party data sharing. The data trusts enforce access controls (based on metrics such as location, role-based, and policy-based) to the synthetic health data and reports to the data subject. The preliminary evaluations of the work show that merging the two techniques (i.e., synthetic data digital twins and data trusts) enforces better privacy for health data access. The synthetic data ensures more anonymization while the data trusts provide easy auditing, tracking, and efficient reporting to the patient or data subject. The paper also detailed the architectural design of the data trusts and evaluated the efficiency of the access control techniques.
引用
收藏
页码:1 / 10
页数:10
相关论文
共 50 条
  • [21] ENABLING DATA SHARING THROUGH DATA TRUSTS IN LEO SATELLITE INTERNET
    Wang, Ruyan
    Zhang, Shiqi
    Yang, Boran
    Yang, Zhigang
    Zhang, Puning
    Wu, Dapeng
    IEEE WIRELESS COMMUNICATIONS, 2024, 31 (01) : 70 - 76
  • [22] Genetic Data Sharing and Privacy
    Marco D. Sorani
    John K. Yue
    Sourabh Sharma
    Geoffrey T. Manley
    Adam R. Ferguson
    Shelly R. Cooper
    Kristen Dams-O’Connor
    Wayne A. Gordon
    Hester F. Lingsma
    Andrew I. R. Maas
    David K. Menon
    Diane J. Morabito
    Pratik Mukherjee
    David O. Okonkwo
    Ava M. Puccio
    Alex B. Valadka
    Esther L. Yuh
    Neuroinformatics, 2015, 13 : 1 - 6
  • [23] Genetic Data Sharing and Privacy
    Sorani, Marco D.
    Yue, John K.
    Sharma, Sourabh
    Manley, Geoffrey T.
    Ferguson, Adam R.
    Cooper, Shelly R.
    Dams-O'Connor, Kristen
    Gordon, Wayne A.
    Lingsma, Hester F.
    Maas, Andrew I. R.
    Menon, David K.
    Morabito, Diane J.
    Mukherjee, Pratik
    Okonkwo, David O.
    Puccio, Ava M.
    Valadka, Alex B.
    Yuh, Esther L.
    NEUROINFORMATICS, 2015, 13 (01) : 1 - 6
  • [24] Spatial data trusts: an emerging governance framework for sharing spatial data
    Radosevic, Nenad
    Duckham, Matt
    Rahaman, Mohammad Saiedur
    Ho, Serene
    Williams, Katherine
    Hashem, Tanzima
    Tao, Yaguang
    INTERNATIONAL JOURNAL OF DIGITAL EARTH, 2023, 16 (01) : 1607 - 1639
  • [25] Towards effective data sharing in ophthalmology: data standardization and data privacy
    Halfpenny, William
    Baxter, Sally L.
    CURRENT OPINION IN OPHTHALMOLOGY, 2022, 33 (05) : 418 - 424
  • [26] Mastering data privacy: leveraging K-anonymity for robust health data sharing
    Karagiannis, Stylianos
    Ntantogian, Christoforos
    Magkos, Emmanouil
    Tsohou, Aggeliki
    Ribeiro, Luis Landeiro
    INTERNATIONAL JOURNAL OF INFORMATION SECURITY, 2024, 23 (03) : 2189 - 2201
  • [27] Generation and evaluation of privacy preserving synthetic health data
    Yale, Andrew
    Dash, Saloni
    Dutta, Ritik
    Guyon, Isabelle
    Pavao, Adrien
    Bennett, Kristin P.
    NEUROCOMPUTING, 2020, 416 : 244 - 255
  • [28] Edge Centric Secure Data Sharing with Digital Twins in Smart Ecosystems
    Cathey, Glen
    Benson, James
    Gupta, Maanak
    Sandhu, Ravi
    2021 THIRD IEEE INTERNATIONAL CONFERENCE ON TRUST, PRIVACY AND SECURITY IN INTELLIGENT SYSTEMS AND APPLICATIONS (TPS-ISA 2021), 2021, : 70 - 79
  • [29] Federated Knowledge Recycling: Privacy-preserving synthetic data sharing
    Lomurno, Eugenio
    Matteucci, Matteo
    PATTERN RECOGNITION LETTERS, 2025, 191 : 124 - 130
  • [30] Digital Public Health and Digital Health: twins or Neighbors? A data protection perspective
    Silva, A. Macedo
    Ventura, M.
    EUROPEAN JOURNAL OF PUBLIC HEALTH, 2024, 34