Privacy-Preserving Techniques in Generative AI and Large Language Models: A Narrative Review

被引：3

作者：

Feretzakis, Georgios ^{[1
]}

Papaspyridis, Konstantinos ^{[2
]}

Gkoulalas-Divanis, Aris ^{[3
]}

Verykios, Vassilios S. ^{[1
]}

机构：

[1] Hellen Open Univ, Sch Sci & Technol, Patras 26335, Greece

[2] Univ Toronto, Comp Sci, Toronto, ON M5S 2E4, Canada

[3] Merat Healthcare, Dublin D02 NY19, Ireland

来源：

INFORMATION | 2024年 / 15卷 / 11期

关键词：

privacy-preserving techniques; generative AI; large language models (LLMs); differential privacy; federated learning; homomorphic encryption; secure multi-party computation; model inversion; membership inference; privacy-enhancing technologies; post-quantum cryptography; CHALLENGES;

D O I：

10.3390/info15110697

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Generative AI, including large language models (LLMs), has transformed the paradigm of data generation and creative content, but this progress raises critical privacy concerns, especially when models are trained on sensitive data. This review provides a comprehensive overview of privacy-preserving techniques aimed at safeguarding data privacy in generative AI, such as differential privacy (DP), federated learning (FL), homomorphic encryption (HE), and secure multi-party computation (SMPC). These techniques mitigate risks like model inversion, data leakage, and membership inference attacks, which are particularly relevant to LLMs. Additionally, the review explores emerging solutions, including privacy-enhancing technologies and post-quantum cryptography, as future directions for enhancing privacy in generative AI systems. Recognizing that achieving absolute privacy is mathematically impossible, the review emphasizes the necessity of aligning technical safeguards with legal and regulatory frameworks to ensure compliance with data protection laws. By discussing the ethical and legal implications of privacy risks in generative AI, the review underscores the need for a balanced approach that considers performance, scalability, and privacy preservation. The findings highlight the need for ongoing research and innovation to develop privacy-preserving techniques that keep pace with the scaling of generative AI, especially in large language models, while adhering to regulatory and ethical standards.

引用

页数：25

共 50 条

[41] Privacy-Preserving Learning Analytics: Challenges and Techniques
Gursoy, Mehmet Emre
Inan, Ali
Nergiz, Mehmet Ercan
Saygin, Yucel
IEEE TRANSACTIONS ON LEARNING TECHNOLOGIES, 2017, 10 (01): : 68 - 81
[42] Privacy-Preserving Techniques and System for Streaming Databases
de Oliveira, Anderson Santana
Kerschbaum, Florian
Lim, Hoon Wei
Yu, Su-Yang
Proceedings of 2012 ASE/IEEE International Conference on Privacy, Security, Risk and Trust and 2012 ASE/IEEE International Conference on Social Computing (SocialCom/PASSAT 2012), 2012, : 728 - 733
[43] Privacy-preserving healthcare informatics: a review
Chong, Kah Meng
16TH IMT-GT INTERNATIONAL CONFERENCE ON MATHEMATICS, STATISTICS AND THEIR APPLICATIONS (ICMSA 2020), 2021, 36
[44] Survey on Privacy-Preserving Techniques for Microdata Publication
Carvalho, Tania
Moniz, Nuno
Faria, Pedro
Antunes, Luis
ACM COMPUTING SURVEYS, 2023, 55 (14S)
[45] Privacy-Preserving Proof of Storage in Large Group
Ren, Yongjun
Han, Jin
Wang, Jin
Fang, Liming
49TH ANNUAL IEEE INTERNATIONAL CARNAHAN CONFERENCE ON SECURITY TECHNOLOGY (ICCST), 2015, : 269 - 272
[46] Privacy-Preserving Triangle Counting in Large Graphs
Ding, Xiaofeng
Zhang, Xiaodong
Bao, Zhifeng
Jin, Hai
CIKM'18: PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2018, : 1283 - 1292
[47] Advancing Power System Services With Privacy-Preserving Federated Learning Techniques: A Review
Zheng, Ran
Sumper, Andreas
Aragues-Penalba, Monica
Galceran-Arellano, Samuel
IEEE ACCESS, 2024, 12 : 76753 - 76780
[48] Investigation of Privacy-Preserving Data Models and Contributions
Ahuja, Kamlesh
Sharma, Navneet
Mishra, Durgesh Kumar
Vyas, Ram Krishan
PROCEEDINGS OF THE 2019 6TH INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT (INDIACOM), 2019, : 779 - 783
[49] Truthful and privacy-preserving generalized linear models
Qiu, Yuan
Liu, Jinyan
Wang, Di
INFORMATION AND COMPUTATION, 2024, 301
[50] Transforming Assessment: The Impacts and Implications of Large Language Models and Generative AI
Hao, Jiangang
von Davier, Alina A.
Yaneva, Victoria
Lottridge, Susan
von Davier, Matthias
Harris, Deborah J.
EDUCATIONAL MEASUREMENT-ISSUES AND PRACTICE, 2024, 43 (02) : 16 - 29

← 1 2 3 4 5 →