Privacy-Preserving Techniques in Generative AI and Large Language Models: A Narrative Review

被引：3

作者：

Feretzakis, Georgios ^{[1
]}

Papaspyridis, Konstantinos ^{[2
]}

Gkoulalas-Divanis, Aris ^{[3
]}

Verykios, Vassilios S. ^{[1
]}

机构：

[1] Hellen Open Univ, Sch Sci & Technol, Patras 26335, Greece

[2] Univ Toronto, Comp Sci, Toronto, ON M5S 2E4, Canada

[3] Merat Healthcare, Dublin D02 NY19, Ireland

来源：

INFORMATION | 2024年 / 15卷 / 11期

关键词：

privacy-preserving techniques; generative AI; large language models (LLMs); differential privacy; federated learning; homomorphic encryption; secure multi-party computation; model inversion; membership inference; privacy-enhancing technologies; post-quantum cryptography; CHALLENGES;

D O I：

10.3390/info15110697

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Generative AI, including large language models (LLMs), has transformed the paradigm of data generation and creative content, but this progress raises critical privacy concerns, especially when models are trained on sensitive data. This review provides a comprehensive overview of privacy-preserving techniques aimed at safeguarding data privacy in generative AI, such as differential privacy (DP), federated learning (FL), homomorphic encryption (HE), and secure multi-party computation (SMPC). These techniques mitigate risks like model inversion, data leakage, and membership inference attacks, which are particularly relevant to LLMs. Additionally, the review explores emerging solutions, including privacy-enhancing technologies and post-quantum cryptography, as future directions for enhancing privacy in generative AI systems. Recognizing that achieving absolute privacy is mathematically impossible, the review emphasizes the necessity of aligning technical safeguards with legal and regulatory frameworks to ensure compliance with data protection laws. By discussing the ethical and legal implications of privacy risks in generative AI, the review underscores the need for a balanced approach that considers performance, scalability, and privacy preservation. The findings highlight the need for ongoing research and innovation to develop privacy-preserving techniques that keep pace with the scaling of generative AI, especially in large language models, while adhering to regulatory and ethical standards.

引用

页数：25

共 50 条

[31] Natural Language Understanding with Privacy-Preserving BERT
Qu, Chen
Kong, Weize
Yang, Liu
Zhang, Mingyang
Bendersky, Michael
Najork, Marc
PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 1488 - 1497
[32] Privacy-preserving AI Services Through Data Decentralization
Meurisch, Christian
Bayrak, Bekir
Muhlhauser, Max
WEB CONFERENCE 2020: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2020), 2020, : 190 - 200
[33] Privacy-preserving Techniques for Proximity Based LBS
Freni, Dario
MDM: 2009 10TH INTERNATIONAL CONFERENCE ON MOBILE DATA MANAGEMENT, 2009, : 387 - 388
[34] Smart Metering privacy-preserving techniques in a nutshell
Souri, Hajer
Dhraief, Amine
Tlili, Syrine
Drira, Khalil
Belghith, Abdelfettah
5TH INTERNATIONAL CONFERENCE ON AMBIENT SYSTEMS, NETWORKS AND TECHNOLOGIES (ANT-2014), THE 4TH INTERNATIONAL CONFERENCE ON SUSTAINABLE ENERGY INFORMATION TECHNOLOGY (SEIT-2014), 2014, 32 : 1087 - 1094
[35] Research progress on location privacy-preserving techniques
Wan, Sheng
Li, Feng-Hua
Niu, Ben
Sun, Zhe
Li, Hui
Tongxin Xuebao/Journal on Communications, 2016, 37 (12): : 124 - 141
[36] Understanding Privacy-Preserving Techniques in Digital Cryptocurrencies
Zhang, Yue
Gai, Keke
Qiu, Meikang
Ding, Kai
ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2020, PT III, 2020, 12454 : 3 - 18
[37] A Review on Privacy-Preserving Data Mining
Li, Xueyun
Yan, Zheng
Zhang, Peng
2014 IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY (CIT), 2014, : 769 - 774
[38] A taxonomy of privacy-preserving record linkage techniques
Vatsalan, Dinusha
Christen, Peter
Verykios, Vassilios S.
INFORMATION SYSTEMS, 2013, 38 (06) : 946 - 969
[39] Privacy-Preserving Artificial Intelligence Techniques in Biomedicine
Torkzadehmahani, Reihaneh
Nasirigerdeh, Reza
Blumenthal, David B.
Kacprowski, Tim
List, Markus
Matschinske, Julian
Spaeth, Julian
Wenke, Nina Kerstin
Baumbach, Jan
METHODS OF INFORMATION IN MEDICINE, 2022, 61 : E12 - E27
[40] Investigation on Privacy-Preserving Techniques for Personal Data
Hamza, Rafik
Zettsu, Koji
ICDAR '21: PROCEEDINGS OF THE 2021 WORKSHOP ON INTELLIGENT CROSS-DATA ANALYSIS AND RETRIEVAL, 2021, : 62 - 66

← 1 2 3 4 5 →