Privacy-Preserving Data Publishing in Process Mining

被引:17
|
作者
Rafiei, Majid [1 ]
van der Aalst, Wil M. P. [1 ]
机构
[1] Rhein Westfal TH Aachen, Chair Proc & Data Sci, Aachen, Germany
关键词
Responsible process mining; Privacy preservation; Privacy metadata; Process mining; Event logs; DIFFERENTIAL PRIVACY;
D O I
10.1007/978-3-030-58638-6_8
中图分类号
F [经济];
学科分类号
02 ;
摘要
Process mining aims to provide insights into the actual processes based on event data. These data are often recorded by information systems and are widely available. However, they often contain sensitive private information that should be analyzed responsibly. Therefore, privacy issues in process mining are recently receiving more attention. Privacy preservation techniques obviously need to modify the original data, yet, at the same time, they are supposed to preserve the data utility. Privacy-preserving transformations of the data may lead to incorrect or misleading analysis results. Hence, new infrastructures need to be designed for publishing the privacy-aware event data whose aim is to provide metadata regarding the privacy-related transformations on event data without revealing details of privacy preservation techniques or the protected information. In this paper, we provide formal definitions for the main anonymization operations, used by privacy models in process mining. These are used to create an infrastructure for recording the privacy metadata. We advocate the proposed privacy metadata in practice by designing a privacy extension for the XES standard and a general data structure for event data which are not in the form of standard event logs.
引用
收藏
页码:122 / 138
页数:17
相关论文
共 50 条
  • [1] DATA MINING AS A TOOL IN PRIVACY-PRESERVING DATA PUBLISHING
    Sramka, Michal
    NILCRYPT 10, 2010, 45 : 151 - 159
  • [2] Privacy-Preserving Data Publishing
    Liu, Ruilin
    Wang, Hui
    2010 IEEE 26TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOPS (ICDE 2010), 2010, : 305 - 308
  • [3] Privacy-Preserving Data Publishing
    Chen, Bee-Chung
    Kifer, Daniel
    LeFevre, Kristen
    Machanavajjhala, Ashwin
    FOUNDATIONS AND TRENDS IN DATABASES, 2009, 2 (1-2): : 1 - 167
  • [4] Privacy-preserving data mining
    Agrawal, R
    Srikant, R
    SIGMOD RECORD, 2000, 29 (02) : 439 - 450
  • [5] Privacy-Preserving Characterization and Data Publishing
    Ren, Jian
    Li, Tongtong
    2024 INTERNATIONAL CONFERENCE ON COMPUTING, NETWORKING AND COMMUNICATIONS, ICNC, 2024, : 549 - 553
  • [6] Privacy-preserving publishing for streaming data
    Huang, Xuezhen
    Liu, Jiqiang
    Han, Zhen
    Yang, Jun
    Journal of Computational Information Systems, 2015, 11 (05): : 1863 - 1877
  • [7] Privacy-Preserving Sequential Data Publishing
    Wang, Huili
    Ma, Wenping
    Zheng, Haibin
    Liang, Zhi
    Wu, Qianhong
    NETWORK AND SYSTEM SECURITY, NSS 2019, 2019, 11928 : 596 - 614
  • [8] Privacy-Preserving Big Data Publishing
    Zakerzadeh, Hessam
    Aggarwal, Charu C.
    Barker, Ken
    PROCEEDINGS OF THE 27TH INTERNATIONAL CONFERENCE ON SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT, 2015,
  • [9] Privacy-Preserving Publishing of Hierarchical Data
    Ozalp, Ismet
    Gursoy, Mehmet Emre
    Nergiz, Mehmet Ercan
    Saygin, Yucel
    ACM TRANSACTIONS ON PRIVACY AND SECURITY, 2016, 19 (03)
  • [10] Privacy-Preserving Data Publishing: An Overview
    Wong, Raymond Chi-Wing
    Fu, Ada Wai-Chee
    Synthesis Lectures on Data Management, 2010, 2 (01): : 1 - 138