Privacy-Preserving Techniques in Generative AI and Large Language Models: A Narrative Review

被引:3
|
作者
Feretzakis, Georgios [1 ]
Papaspyridis, Konstantinos [2 ]
Gkoulalas-Divanis, Aris [3 ]
Verykios, Vassilios S. [1 ]
机构
[1] Hellen Open Univ, Sch Sci & Technol, Patras 26335, Greece
[2] Univ Toronto, Comp Sci, Toronto, ON M5S 2E4, Canada
[3] Merat Healthcare, Dublin D02 NY19, Ireland
关键词
privacy-preserving techniques; generative AI; large language models (LLMs); differential privacy; federated learning; homomorphic encryption; secure multi-party computation; model inversion; membership inference; privacy-enhancing technologies; post-quantum cryptography; CHALLENGES;
D O I
10.3390/info15110697
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Generative AI, including large language models (LLMs), has transformed the paradigm of data generation and creative content, but this progress raises critical privacy concerns, especially when models are trained on sensitive data. This review provides a comprehensive overview of privacy-preserving techniques aimed at safeguarding data privacy in generative AI, such as differential privacy (DP), federated learning (FL), homomorphic encryption (HE), and secure multi-party computation (SMPC). These techniques mitigate risks like model inversion, data leakage, and membership inference attacks, which are particularly relevant to LLMs. Additionally, the review explores emerging solutions, including privacy-enhancing technologies and post-quantum cryptography, as future directions for enhancing privacy in generative AI systems. Recognizing that achieving absolute privacy is mathematically impossible, the review emphasizes the necessity of aligning technical safeguards with legal and regulatory frameworks to ensure compliance with data protection laws. By discussing the ethical and legal implications of privacy risks in generative AI, the review underscores the need for a balanced approach that considers performance, scalability, and privacy preservation. The findings highlight the need for ongoing research and innovation to develop privacy-preserving techniques that keep pace with the scaling of generative AI, especially in large language models, while adhering to regulatory and ethical standards.
引用
收藏
页数:25
相关论文
共 50 条
  • [31] Natural Language Understanding with Privacy-Preserving BERT
    Qu, Chen
    Kong, Weize
    Yang, Liu
    Zhang, Mingyang
    Bendersky, Michael
    Najork, Marc
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 1488 - 1497
  • [32] Privacy-preserving AI Services Through Data Decentralization
    Meurisch, Christian
    Bayrak, Bekir
    Muhlhauser, Max
    WEB CONFERENCE 2020: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2020), 2020, : 190 - 200
  • [33] Privacy-preserving Techniques for Proximity Based LBS
    Freni, Dario
    MDM: 2009 10TH INTERNATIONAL CONFERENCE ON MOBILE DATA MANAGEMENT, 2009, : 387 - 388
  • [34] Smart Metering privacy-preserving techniques in a nutshell
    Souri, Hajer
    Dhraief, Amine
    Tlili, Syrine
    Drira, Khalil
    Belghith, Abdelfettah
    5TH INTERNATIONAL CONFERENCE ON AMBIENT SYSTEMS, NETWORKS AND TECHNOLOGIES (ANT-2014), THE 4TH INTERNATIONAL CONFERENCE ON SUSTAINABLE ENERGY INFORMATION TECHNOLOGY (SEIT-2014), 2014, 32 : 1087 - 1094
  • [35] Research progress on location privacy-preserving techniques
    Wan, Sheng
    Li, Feng-Hua
    Niu, Ben
    Sun, Zhe
    Li, Hui
    Tongxin Xuebao/Journal on Communications, 2016, 37 (12): : 124 - 141
  • [36] Understanding Privacy-Preserving Techniques in Digital Cryptocurrencies
    Zhang, Yue
    Gai, Keke
    Qiu, Meikang
    Ding, Kai
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2020, PT III, 2020, 12454 : 3 - 18
  • [37] A Review on Privacy-Preserving Data Mining
    Li, Xueyun
    Yan, Zheng
    Zhang, Peng
    2014 IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY (CIT), 2014, : 769 - 774
  • [38] A taxonomy of privacy-preserving record linkage techniques
    Vatsalan, Dinusha
    Christen, Peter
    Verykios, Vassilios S.
    INFORMATION SYSTEMS, 2013, 38 (06) : 946 - 969
  • [39] Privacy-Preserving Artificial Intelligence Techniques in Biomedicine
    Torkzadehmahani, Reihaneh
    Nasirigerdeh, Reza
    Blumenthal, David B.
    Kacprowski, Tim
    List, Markus
    Matschinske, Julian
    Spaeth, Julian
    Wenke, Nina Kerstin
    Baumbach, Jan
    METHODS OF INFORMATION IN MEDICINE, 2022, 61 : E12 - E27
  • [40] Investigation on Privacy-Preserving Techniques for Personal Data
    Hamza, Rafik
    Zettsu, Koji
    ICDAR '21: PROCEEDINGS OF THE 2021 WORKSHOP ON INTELLIGENT CROSS-DATA ANALYSIS AND RETRIEVAL, 2021, : 62 - 66