Privacy-Preserving Techniques in Generative AI and Large Language Models: A Narrative Review

被引:3
|
作者
Feretzakis, Georgios [1 ]
Papaspyridis, Konstantinos [2 ]
Gkoulalas-Divanis, Aris [3 ]
Verykios, Vassilios S. [1 ]
机构
[1] Hellen Open Univ, Sch Sci & Technol, Patras 26335, Greece
[2] Univ Toronto, Comp Sci, Toronto, ON M5S 2E4, Canada
[3] Merat Healthcare, Dublin D02 NY19, Ireland
关键词
privacy-preserving techniques; generative AI; large language models (LLMs); differential privacy; federated learning; homomorphic encryption; secure multi-party computation; model inversion; membership inference; privacy-enhancing technologies; post-quantum cryptography; CHALLENGES;
D O I
10.3390/info15110697
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Generative AI, including large language models (LLMs), has transformed the paradigm of data generation and creative content, but this progress raises critical privacy concerns, especially when models are trained on sensitive data. This review provides a comprehensive overview of privacy-preserving techniques aimed at safeguarding data privacy in generative AI, such as differential privacy (DP), federated learning (FL), homomorphic encryption (HE), and secure multi-party computation (SMPC). These techniques mitigate risks like model inversion, data leakage, and membership inference attacks, which are particularly relevant to LLMs. Additionally, the review explores emerging solutions, including privacy-enhancing technologies and post-quantum cryptography, as future directions for enhancing privacy in generative AI systems. Recognizing that achieving absolute privacy is mathematically impossible, the review emphasizes the necessity of aligning technical safeguards with legal and regulatory frameworks to ensure compliance with data protection laws. By discussing the ethical and legal implications of privacy risks in generative AI, the review underscores the need for a balanced approach that considers performance, scalability, and privacy preservation. The findings highlight the need for ongoing research and innovation to develop privacy-preserving techniques that keep pace with the scaling of generative AI, especially in large language models, while adhering to regulatory and ethical standards.
引用
收藏
页数:25
相关论文
共 50 条
  • [41] Privacy-Preserving Learning Analytics: Challenges and Techniques
    Gursoy, Mehmet Emre
    Inan, Ali
    Nergiz, Mehmet Ercan
    Saygin, Yucel
    IEEE TRANSACTIONS ON LEARNING TECHNOLOGIES, 2017, 10 (01): : 68 - 81
  • [42] Privacy-Preserving Techniques and System for Streaming Databases
    de Oliveira, Anderson Santana
    Kerschbaum, Florian
    Lim, Hoon Wei
    Yu, Su-Yang
    Proceedings of 2012 ASE/IEEE International Conference on Privacy, Security, Risk and Trust and 2012 ASE/IEEE International Conference on Social Computing (SocialCom/PASSAT 2012), 2012, : 728 - 733
  • [43] Privacy-preserving healthcare informatics: a review
    Chong, Kah Meng
    16TH IMT-GT INTERNATIONAL CONFERENCE ON MATHEMATICS, STATISTICS AND THEIR APPLICATIONS (ICMSA 2020), 2021, 36
  • [44] Survey on Privacy-Preserving Techniques for Microdata Publication
    Carvalho, Tania
    Moniz, Nuno
    Faria, Pedro
    Antunes, Luis
    ACM COMPUTING SURVEYS, 2023, 55 (14S)
  • [45] Privacy-Preserving Proof of Storage in Large Group
    Ren, Yongjun
    Han, Jin
    Wang, Jin
    Fang, Liming
    49TH ANNUAL IEEE INTERNATIONAL CARNAHAN CONFERENCE ON SECURITY TECHNOLOGY (ICCST), 2015, : 269 - 272
  • [46] Privacy-Preserving Triangle Counting in Large Graphs
    Ding, Xiaofeng
    Zhang, Xiaodong
    Bao, Zhifeng
    Jin, Hai
    CIKM'18: PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2018, : 1283 - 1292
  • [47] Advancing Power System Services With Privacy-Preserving Federated Learning Techniques: A Review
    Zheng, Ran
    Sumper, Andreas
    Aragues-Penalba, Monica
    Galceran-Arellano, Samuel
    IEEE ACCESS, 2024, 12 : 76753 - 76780
  • [48] Investigation of Privacy-Preserving Data Models and Contributions
    Ahuja, Kamlesh
    Sharma, Navneet
    Mishra, Durgesh Kumar
    Vyas, Ram Krishan
    PROCEEDINGS OF THE 2019 6TH INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT (INDIACOM), 2019, : 779 - 783
  • [49] Truthful and privacy-preserving generalized linear models
    Qiu, Yuan
    Liu, Jinyan
    Wang, Di
    INFORMATION AND COMPUTATION, 2024, 301
  • [50] Transforming Assessment: The Impacts and Implications of Large Language Models and Generative AI
    Hao, Jiangang
    von Davier, Alina A.
    Yaneva, Victoria
    Lottridge, Susan
    von Davier, Matthias
    Harris, Deborah J.
    EDUCATIONAL MEASUREMENT-ISSUES AND PRACTICE, 2024, 43 (02) : 16 - 29