Protecting Machine Learning Models from Training Data Set Extraction

被引:0
|
作者
Kalinin, M. O. [1 ]
Muryleva, A. A. [1 ]
Platonov, V. V. [1 ]
机构
[1] Peter Great St Petersburg Polytech Univ, St Petersburg 195251, Russia
关键词
noising; machine learning; training set; membership inference; Gaussian noise; PRIVACY;
D O I
10.3103/S0146411624700871
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The problem of protecting machine learning models from the threat of data privacy violation implementing membership inference in training data sets is considered. A method of protective noising of the training set is proposed. It is experimentally shown that Gaussian noising of training data with a scale of 0.2 is the simplest and most effective way to protect machine learning models from membership inference in the training set. In comparison with alternatives, this method is easy to implement, universal in relation to types of models, and allows reducing the effectiveness of membership inference to 26 percentage points.
引用
收藏
页码:1234 / 1241
页数:8
相关论文
共 50 条
  • [42] Learning earth system models from observations: machine learning or data assimilation?
    Geer, A. J.
    PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 2021, 379 (2194):
  • [43] On the Development of a Digital Twin for Underwater UXO Detection Using Magnetometer-Based Data in Application for the Training Set Generation for Machine Learning Models
    Blachnik, Marcin
    Przylucki, Roman
    Golak, Slawomir
    Sciegienka, Piotr
    Wieczorek, Tadeusz
    SENSORS, 2023, 23 (15)
  • [44] Evaluating machine learning technologies for food computing from a data set perspective
    Nauman Ullah Gilal
    Khaled Al-Thelaya
    Jumana Khalid Al-Saeed
    Mohamed Abdallah
    Jens Schneider
    James She
    Jawad Hussain Awan
    Marco Agus
    Multimedia Tools and Applications, 2024, 83 : 32041 - 32068
  • [45] Evaluating machine learning technologies for food computing from a data set perspective
    Gilal, Nauman Ullah
    Al-Thelaya, Khaled
    Al-Saeed, Jumana Khalid
    Abdallah, Mohamed
    Schneider, Jens
    She, James
    Awan, Jawad Hussain
    Agus, Marco
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (11) : 32041 - 32068
  • [46] SODA: Protecting Proprietary Information in On-Device Machine Learning Models
    Atrey, Akanksha
    Sinha, Ritwik
    Mitra, Saayan
    Shenoy, Prashant
    2023 IEEE/ACM SYMPOSIUM ON EDGE COMPUTING, SEC 2023, 2023, : 121 - 132
  • [47] Machine Learning Training on Encrypted Data with TFHE
    Montero, Luis
    Frery, Jordan
    Kherfallah, Celia
    Bredehoft, Roman
    Stoian, Andrei
    PROCEEDINGS OF THE 10TH ACM INTERNATIONAL WORKSHOP ON SECURITY AND PRIVACY ANALYTICS, IWSPA 2024, 2024, : 71 - 76
  • [48] Effect of Training Data Order for Machine Learning
    Mange, Jeremy
    2019 6TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE (CSCI 2019), 2019, : 406 - 407
  • [49] Improving the accuracy of machine-learning models with data from machine test repetitions
    Bustillo, Andres
    Reis, Roberto
    Machado, Alisson R.
    Pimenov, Danil Yu.
    JOURNAL OF INTELLIGENT MANUFACTURING, 2022, 33 (01) : 203 - 221
  • [50] Improving the accuracy of machine-learning models with data from machine test repetitions
    Andres Bustillo
    Roberto Reis
    Alisson R. Machado
    Danil Yu. Pimenov
    Journal of Intelligent Manufacturing, 2022, 33 : 203 - 221