Generative Perturbation Analysis for Probabilistic Black-Box Anomaly Attribution

被引:0
|
作者
Ide, Tsuyoshi [1 ]
Abe, Naoki [1 ]
机构
[1] IBM Res, Thomas J Watson Res Ctr, Yorktown Hts, NY 10598 USA
关键词
explainable AI (XAI); anomaly attribution; generative model; variational inference; Shapley value; integrated gradient;
D O I
10.1145/3580305.3599365
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We address the task of probabilistic anomaly attribution in the black-box regression setting, where the goal is to compute the probability distribution of the attribution score of each input variable, given an observed anomaly. The training dataset is assumed to be unavailable. This task differs from the standard XAI (explainable AI) scenario, since we wish to explain the anomalous deviation from a black-box prediction rather than the black-box model itself. We begin by showing that mainstream model-agnostic explanation methods, such as the Shapley values, are not suitable for this task because of their "deviation-agnostic property." We then propose a novel framework for probabilistic anomaly attribution that allows us to not only compute attribution scores as the predictive mean but also quantify the uncertainty of those scores. This is done by considering a generative process for perturbations that counter-factually bring the observed anomalous observation back to normalcy. We introduce a variational Bayes algorithm for deriving the distributions of per variable attribution scores. To the best of our knowledge, this is the first probabilistic anomaly attribution framework that is free from being deviation-agnostic.
引用
收藏
页码:845 / 856
页数:12
相关论文
共 50 条
  • [21] Probabilistic Stabilizability Certificates for a Class of Black-Box Linear Systems
    Fabiani, Filippo
    Margellos, Kostas
    Goulart, Paul J.
    IEEE CONTROL SYSTEMS LETTERS, 2022, 6 : 584 - 589
  • [22] Probabilistic Black-Box Checking via Active MDP Learning
    Shijubo, Junya
    Waga, Masaki
    Suenaga, Kohei
    ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2023, 22 (05)
  • [23] Mitigating Black-Box Adversarial Attacks via Output Noise Perturbation
    Aithal, Manjushree B.
    Li, Xiaohua
    IEEE ACCESS, 2022, 10 : 12395 - 12411
  • [24] Data-free Universal Adversarial Perturbation and Black-box Attack
    Zhang, Chaoning
    Benz, Philipp
    Karjauv, Adil
    Kweon, In So
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 7848 - 7857
  • [25] Universal Perturbation Generation for Black-box Attack Using Evolutionary Algorithms
    Wang, Siyu
    Shi, Yucheng
    Han, Yahong
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 1277 - 1282
  • [26] MFPP: Morphological Fragmental Perturbation Pyramid for Black-Box Model Explanations
    Yang, Qing
    Zhu, Xia
    Fwu, Jong-Kae
    Ye, Yun
    You, Ganmei
    Zhu, Yuan
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 1376 - 1383
  • [27] Black-box attacks against log anomaly detection with adversarial examples
    Lu, Siyang
    Wang, Mingquan
    Wang, Dongdong
    Wei, Xiang
    Xiao, Sizhe
    Wang, Zhiwei
    Han, Ningning
    Wang, Liqiang
    INFORMATION SCIENCES, 2023, 619 : 249 - 262
  • [28] THE MATHEMATICAL WORLD IN THE BLACK-BOX - SIGNIFICANCE OF THE BLACK-BOX AS A MEDIUM OF MATHEMATIZING
    MAASS, J
    SCHLOGLMANN, W
    CYBERNETICS AND SYSTEMS, 1988, 19 (04) : 295 - 309
  • [29] INSIDE THE BLACK-BOX
    HORGAN, J
    IEEE SPECTRUM, 1986, 23 (11) : 65 - 65
  • [30] Probabilistic Permutation Graph Search: Black-Box Optimization for Fairness in Ranking
    Vardasbi, Ali
    Sarvi, Fatemeh
    de Rijke, Maarten
    PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22), 2022, : 715 - 725