Synthetic Data Digital Twins and Data Trusts Control for Privacy in Health Data Sharing

被引:0
|
作者
Lomotey, Richard K. [1 ]
Kumi, Sandra [2 ]
Ray, Madhurima [3 ]
Deters, Ralph [2 ]
机构
[1] Penn State Univ, Informat Sci & Tech, Monaca, PA 15061 USA
[2] Univ Saskatchewan, Dept Comp Sci, Saskatoon, SK, Canada
[3] Penn State Univ, Dept Comp Sci, Monaca, PA USA
关键词
Synthetic Health Data; Digital Twins; Data Trusts; Machine Learning; Artificial Intelligence; Privacy; Middleware; FRAMEWORK;
D O I
10.1145/3643650.3658605
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Health data sharing is very valuable for medical research since it has the propensity to improve diagnostics, policy, medication, and so on. At the same time, sharing health data needs to be done without compromising the privacy of patients and stakeholders. However, recent advances in AI/ML and sophisticated analytics have proven to introduce biases that can easily identify patients based on their healthcare data, which violates privacy. In this work, we sort to address this major issue by exploring two emerging topics that are gaining attention from industry, academia, and governments, i.e., digital twins and data trusts. First, we proposed the use of digital twins (DTs) to generate synthetic records of patient's heart rate data. DTs are virtual replicas of the actual data and were created using two synthetic data generative models - Gaussian Copula (GC) and Tabular Variational Autoencoder (TVAE). The GC and TVAE achieved a maximum data quality score of 88% and 96% respectively. Next, we posit that the DTs should be shared with a data trusts layer. Data trusts are fiduciary frameworks that govern multi-party data sharing. The data trusts enforce access controls (based on metrics such as location, role-based, and policy-based) to the synthetic health data and reports to the data subject. The preliminary evaluations of the work show that merging the two techniques (i.e., synthetic data digital twins and data trusts) enforces better privacy for health data access. The synthetic data ensures more anonymization while the data trusts provide easy auditing, tracking, and efficient reporting to the patient or data subject. The paper also detailed the architectural design of the data trusts and evaluated the efficiency of the access control techniques.
引用
收藏
页码:1 / 10
页数:10
相关论文
共 50 条
  • [41] Transparent sharing of digital health data: A call to action
    Slotwiner, David J.
    Tarakji, Khaldoun G.
    Al-Khatib, Sana M.
    Passman, Rod S.
    Saxon, Leslie A.
    Peters, Nicholas S.
    McCall, Debbe
    HEART RHYTHM, 2019, 16 (09) : E95 - E106
  • [42] Data Anonymization as a Vector Quantization Problem: Control Over Privacy for Health Data
    Miche, Yoan
    Oliver, Ian
    Holtmanns, Silke
    Kalliola, Aapo
    Akusok, Anton
    Lendasse, Amaury
    Bjork, Kaj-Mikael
    AVAILABILITY, RELIABILITY, AND SECURITY IN INFORMATION SYSTEMS, CD-ARES 2016, PAML 2016, 2016, 9817 : 193 - 203
  • [43] PRIVACY CONSIDERATIONS FOR SHARING GENOMICS DATA
    Oestreich, Marie
    Chen, Dingfan
    Schultze, Joachim L.
    Fritz, Mario
    Becker, Matthias
    EXCLI JOURNAL, 2021, 20 : 1243 - 1260
  • [44] Summary Statistic Privacy in Data Sharing
    Lin Z.
    Wang S.
    Sekar V.
    Fanti G.
    IEEE Journal on Selected Areas in Information Theory, 2024, 5 : 369 - 384
  • [45] Data Sharing and Privacy in Pharmaceutical Studies
    Chen, Rufan
    Zhang, Yi
    Dou, Zuochao
    Chen, Feng
    Xie, Kang
    Wang, Shuang
    CURRENT PHARMACEUTICAL DESIGN, 2021, 27 (07) : 911 - 918
  • [46] Data sharing: guard the privacy of donors
    Shirley Y. Hill
    Nature, 2017, 548 : 281 - 281
  • [47] Data sharing: guard the privacy of donors
    Hill, Shirley Y.
    NATURE, 2017, 548 (7667) : 281 - 281
  • [48] Preserving Privacy While Sharing Data
    Garfinkel, Simson L.
    Bowen, Claire McKay
    MIT SLOAN MANAGEMENT REVIEW, 2022, 63 (04) : 7 - +
  • [49] Dynamics in Data Privacy and Sharing Economics
    Ray, Shubhadip
    Palanivel, Tharangini
    Herman, Norbert
    Li, Yixuan
    IEEE Transactions on Technology and Society, 2021, 2 (03): : 114 - 115
  • [50] Privacy Risk in Cybersecurity Data Sharing
    Bhatia, Jaspreet
    Breaux, Travis D.
    Friedberg, Liora
    Hibshi, Hanan
    Smullen, Daniel
    WISCS'16: PROCEEDINGS OF THE 2016 ACM WORKSHOP ON INFORMATION SHARING AND COLLABORATIVE SECURITY, 2016, : 57 - 64