Model-contrastive explanations through symbolic reasoning

被引:4
|
作者
Malandri, Lorenzo
Mercorio, Fabio [1 ]
Mezzanzanica, Mario
Seveso, Andrea
机构
[1] Univ Milano Bicocca, Dept Stat & Quantitat Methods, Milan, Italy
关键词
eXplainable AI; Contrastive explanation methods for XAI; Post -hoc explainability; XAI Interpretability;
D O I
10.1016/j.dss.2023.114040
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Explaining how two machine learning classification models differ in their behaviour is gaining significance in eXplainable AI, given the increasing diffusion of learning-based decision support systems. Human decisionmakers deal with more than one machine learning model in several practical situations. Consequently, the importance of understanding how two machine learning models work beyond their prediction performances is key to understanding their behaviour, differences, and likeness. Some attempts have been made to address these problems, for instance, by explaining text classifiers in a timecontrastive fashion. In this paper, we present MERLIN, a novel eXplainable AI approach that provides contrastive explanations of two machine learning models, introducing the concept of model-contrastive explanations. We propose an encoding that allows MERLIN to work with both text and tabular data and with mixed continuous and discrete features. To show the effectiveness of our approach, we evaluate it on an extensive set of benchmark datasets. MERLIN is also implemented as a python-pip package.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] Explanations based on the Missing: Towards Contrastive Explanations with Pertinent Negatives
    Dhurandhar, Amit
    Chen, Pin-Yu
    Luss, Ronny
    Tu, Chun-Chen
    Ting, Paishun
    Shanmugam, Karthikeyan
    Das, Payel
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [32] Personalized Federated Learning With Model-Contrastive Learning for Multi-Modal User Modeling in Human-Centric Metaverse
    Zhou, Xiaokang
    Yang, Qiuyue
    Zheng, Xuzhe
    Liang, Wei
    Wang, Kevin I-Kai
    Ma, Jianhua
    Pan, Yi
    Jin, Qun
    IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2024, 42 (04) : 817 - 831
  • [33] Symbolic Assume-Guarantee Reasoning through BDD Learning
    He, Fei
    Wang, Bow-Yaw
    Yin, Liangze
    Zhu, Lei
    36TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE 2014), 2014, : 1071 - 1082
  • [34] Symbolic Explanations for Hyperparameter Optimization
    Segel, Sarah
    Graf, Helena
    Tornede, Alexander
    Bischl, Bernd
    Lindauer, Marius
    INTERNATIONAL CONFERENCE ON AUTOMATED MACHINE LEARNING, VOL 224, 2023, 224
  • [35] Contrastive counterfactual visual explanations with overdetermination
    Adam White
    Kwun Ho Ngan
    James Phelan
    Kevin Ryan
    Saman Sadeghi Afgeh
    Constantino Carlos Reyes-Aldasoro
    Artur d’Avila Garcez
    Machine Learning, 2023, 112 : 3497 - 3525
  • [36] Contrastive counterfactual visual explanations with overdetermination
    White, Adam
    Ngan, Kwun Ho
    Phelan, James
    Ryan, Kevin
    Afgeh, Saman Sadeghi
    Reyes-Aldasoro, Constantino Carlos
    Garcez, Artur d'Avila
    MACHINE LEARNING, 2023, 112 (09) : 3497 - 3525
  • [37] Towards a Formulation of Fuzzy Contrastive Explanations
    Bloch, Isabelle
    Lesot, Marie Jeanne
    2022 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE), 2022,
  • [38] Persuasive Contrastive Explanations for Bayesian Networks
    Koopman, Tara
    Renooij, Silja
    SYMBOLIC AND QUANTITATIVE APPROACHES TO REASONING WITH UNCERTAINTY, ECSQARU 2021, 2021, 12897 : 229 - 242
  • [39] Contrastive Explanations of Text Classifiers as a Service
    Malandri, Lorenzo
    Mercorio, Fabio
    Mezzanzanica, Mario
    Nobani, Navid
    Seveso, Andrea
    NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES: PROCEEDINGS OF THE DEMONSTRATIONS SESSION, 2022, : 46 - 53
  • [40] Conversational Explanations of Machine Learning Predictions Through Class-contrastive Counterfactual Statements
    Sokol, Kacper
    Flach, Peter
    PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 5785 - 5786