Individual privacy levels in query-based anonymization

被引:1
|
作者
Schiegg, Sascha [1 ]
Strohmeier, Florian [1 ]
Gerl, Armin [2 ]
Kosch, Harald [1 ]
机构
[1] Univ Passau, Passau, Germany
[2] HM Univ Appl Sci Munich, Munich, Germany
关键词
Data privacy; Query processing; Data warehouses;
D O I
10.1145/3664476.3670920
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Artificial intelligence systems such as Large Language Models (LLM) derive their knowledge from large datasets. Systems like ChatGPT therefore rely on shared data to train on. For companies, releasing data to the public domain requires anonymization as soon as an individual is identifiable. While there are several privacy models that guarantee a certain level of distortion applied to a dataset, to mitigate re-identification, e.g. with k-anonymity, the required level is generally defined by the data processor. We propose the idea of combining individual privacy levels defined by the data subjects themselves with a privacy language, such as the Layered Privacy Language (LPL) [10], to get a more fine-grained understanding of the effectively required privacy level. Queries that target subsets of the dataset to be released can only benefit from lower privacy requirements set by data subjects, as these response subsets may do not contain users with high privacy requirements, which can then lead to more utility. By analyzing the results of different queries to a privacy-aware data-transforming database system, we demonstrate the characteristics required for this assumption to be truly effective. For a more realistic evaluation, we also consider changes in the underlying data sources.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Query-Based Linked Data Anonymization
    Delanaux, Remy
    Bonifati, Angela
    Rousset, Marie-Christine
    Thion, Romuald
    SEMANTIC WEB - ISWC 2018, PT I, 2018, 11136 : 530 - 546
  • [2] Side-Channel Attacks on Query-Based Data Anonymization
    Boenisch, Franziska
    Munz, Reinhard
    Tiepelt, Marcel
    Hanisch, Simon
    Kuhn, Christiane
    Francis, Paul
    CCS '21: PROCEEDINGS OF THE 2021 ACM SIGSAC CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY, 2021, : 1254 - 1265
  • [3] Query-Based Data Pricing
    Koutris, Paraschos
    Upadhyaya, Prasang
    Balazinska, Magdalena
    Howe, Bill
    Suciu, Dan
    JOURNAL OF THE ACM, 2015, 62 (05)
  • [4] Dynamic query-based debugging
    Lencevicius, R
    Hölzle, U
    Singh, AK
    ECOOP'99 - OBJECT-ORIENTED PROGRAMMING, 1999, 1628 : 135 - 160
  • [5] Snapshot query-based debugging
    Potanin, A
    Noble, J
    Biddle, R
    2004 AUSTRALIAN SOFTWARE ENGINEERING CONFERENCE, PROCEEDINGS, 2004, : 251 - 259
  • [6] A query-based quantum eigensolver
    Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, Sichuan, China
    不详
    不详
    不详
    Quantum Eng., 2020, 3
  • [7] Secure Anonymous Query-Based Encryption for Data Privacy Preserving in Cloud: Moye(Omega)
    Konan, Martin
    Wang, Wenyong
    INTERNATIONAL JOURNAL OF INFORMATION SECURITY AND PRIVACY, 2018, 12 (04) : 1 - 23
  • [8] A Profit-maximizing Mechanism for Query-based Data Trading with Personalized Differential Privacy
    Cai, Hui
    Zhu, Yanmin
    Li, Jie
    Yu, Jiadi
    COMPUTER JOURNAL, 2021, 64 (02): : 264 - 280
  • [9] Query-Based Access Control for Ontologies
    Knechtel, Martin
    Stuckenschmidt, Heiner
    WEB REASONING AND RULE SYSTEMS, 2010, 6333 : 73 - +
  • [10] Query-based learning of XPath expressions
    Carme, Julien
    Ceresna, Michal
    Goebel, Max
    GRAMMATICAL INFERENCE: ALGORITHMS AND APPLICATIONS, PROCEEDINGS, 2006, 4201 : 342 - 343