Label-Only Membership Inference Attack Based on Model Explanation

被引:0
|
作者
Ma, Yao [1 ]
Zhai, Xurong [1 ]
Yu, Dan [1 ]
Yang, Yuli [1 ]
Wei, Xingyu [2 ]
Chen, Yongle [1 ]
机构
[1] Taiyuan Univ Technol, Coll Comp Sci & Technol, Jinzhong 030600, Peoples R China
[2] Tsinghua Univ, Res Ctr Identificat & Resolut Syst, Jiashan Novat Ctr, Yangtze Delta Reg Inst, Beijing 314100, Zhejiang, Peoples R China
关键词
Machine Learning; Membership Inference Attack; Forgettable Examples; Feature Attribution; Confidence Estimate;
D O I
10.1007/s11063-024-11682-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
It is well known that machine learning models (e.g., image recognition) can unintentionally leak information about the training set. Conventional membership inference relies on posterior vectors, and this task becomes extremely difficult when the posterior is masked. However, current label-only membership inference attacks require a large number of queries during the generation of adversarial samples, and thus incorrect inference generates a large number of invalid queries. Therefore, we introduce a label-only membership inference attack based on model explanations. It can transform a label-only attack into a traditional membership inference attack by observing neighborhood consistency and perform fine-grained membership inference for vulnerable samples. We use feature attribution to simplify the high-dimensional neighborhood sampling process, quickly identify decision boundaries and recover a posteriori vectors. It also compares different privacy risks faced by different samples through finding vulnerable samples. The method is validated on CIFAR-10, CIFAR-100 and MNIST datasets. The results show that membership attributes can be identified even using a simple sampling method. Furthermore, vulnerable samples expose the model to greater privacy risks.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] Label-Only Membership Inference Attack Against Federated Distillation
    Wang, Xi
    Zhao, Yanchao
    Zhang, Jiale
    Chen, Bing
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2023, PT II, 2024, 14488 : 394 - 410
  • [2] Label-Only Membership Inference Attacks
    Choquette-Choo, Christopher A.
    Tramer, Florian
    Carlini, Nicholas
    Papernot, Nicolas
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [3] Label-Only Membership Inference Attacks and Defenses in Semantic Segmentation Models
    Zhang, Guangsheng
    Liu, Bo
    Zhu, Tianqing
    Ding, Ming
    Zhou, Wanlei
    IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2023, 20 (02) : 1435 - 1449
  • [4] Membership Leakage in Label-Only Exposures
    Li, Zheng
    Zhang, Yang
    CCS '21: PROCEEDINGS OF THE 2021 ACM SIGSAC CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY, 2021, : 880 - 895
  • [5] Label-only membership inference attacks on machine unlearning without dependence of posteriors
    Lu, Zhaobo
    Liang, Hai
    Zhao, Minghao
    Lv, Qingzhe
    Liang, Tiancai
    Wang, Yilei
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2022, 37 (11) : 9424 - 9441
  • [6] Optimizing Label-Only Membership Inference Attacks by Global Relative Decision Boundary Distances
    Xu, Jiacheng
    Hu, Jianpeng
    Yu, Chunqing
    Tan, Chengxiang
    INFORMATION SECURITY, PT I, ISC 2024, 2025, 15257 : 107 - 126
  • [7] Label-Only Model Inversion Attacks: Attack With the Least Information
    Zhu, Tianqing
    Ye, Dayong
    Zhou, Shuai
    Liu, Bo
    Zhou, Wanlei
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2023, 18 : 991 - 1005
  • [8] Unstoppable Attack: Label-Only Model Inversion Via Conditional Diffusion Model
    Liu, Rongke
    Wang, Dong
    Ren, Yizhi
    Wang, Zhen
    Guo, Kaitian
    Qin, Qianqian
    Liu, Xiaolei
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2024, 19 : 3958 - 3973
  • [9] POSTER: Double-Dip: Thwarting Label-Only Membership Inference Attacks with Transfer Learning and Randomization
    Rajabi, Arezoo
    Pimple, Reeya
    Janardhanan, Aiswarya
    Asokraj, Surudhi
    Ramasubramanian, Bhaskar
    Poovendran, Radha
    PROCEEDINGS OF THE 19TH ACM ASIA CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY, ACM ASIACCS 2024, 2024, : 1937 - 1939
  • [10] Label-Only Model Inversion Attacks via Boundary Repulsion
    Kahla, Mostafa
    Chen, Si
    Just, Hoang Anh
    Jia, Ruoxi
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 15025 - 15033