Learning outside the Black-Box: The pursuit of interpretable models

被引:0
|
作者
Crabbe, Jonathan [1 ]
Zhang, Yao [1 ]
Zame, William R. [2 ]
van der Schaar, Mihaela [1 ]
机构
[1] Univ Cambridge, Cambridge CB2 1TN, England
[2] Univ Calif Los Angeles, Los Angeles, CA 90024 USA
关键词
REPRESENTATION; REGRESSION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Machine Learning has proved its ability to produce accurate models - but the deployment of these models outside the machine learning community has been hindered by the difficulties of interpreting these models. This paper proposes an algorithm that produces a continuous global interpretation of any given continuous black-box function. Our algorithm employs a variation of projection pursuit in which the ridge functions are chosen to be Meijer G-functions, rather than the usual polynomial splines. Because Meijer G-functions are differentiable in their parameters, we can "tune" the parameters of the representation by gradient descent; as a consequence, our algorithm is efficient. Using five familiar data sets from the UCI repository and two familiar machine learning algorithms, we demonstrate that our algorithm produces global interpretations that are both highly accurate and parsimonious (involve a small number of terms). Our interpretations permit easy understanding of the relative importance of features and feature interactions. Our interpretation algorithm represents a leap forward from the previous state of the art.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Interpretable Companions for Black-Box Models
    Pan, Danqing
    Wang, Tong
    Hara, Satoshi
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108 : 2444 - 2453
  • [2] Understanding the black-box: towards interpretable and reliable deep learning models
    Qamar, Tehreem
    Bawany, Narmeen Zakaria
    PEERJ COMPUTER SCIENCE, 2023, 9
  • [3] Explaining Black-Box Models Using Interpretable Surrogates
    Kuttichira, Deepthi Praveenlal
    Gupta, Sunil
    Li, Cheng
    Rana, Santu
    Venkatesh, Svetha
    PRICAI 2019: TRENDS IN ARTIFICIAL INTELLIGENCE, PT I, 2019, 11670 : 3 - 15
  • [4] Interpretable Approaches to Detect Bias in Black-Box Models
    Tan, Sarah
    PROCEEDINGS OF THE 2018 AAAI/ACM CONFERENCE ON AI, ETHICS, AND SOCIETY (AIES'18), 2018, : 382 - 383
  • [5] IHCP: interpretable hepatitis C prediction system based on black-box machine learning models
    Yongxian Fan
    Xiqian Lu
    Guicong Sun
    BMC Bioinformatics, 24
  • [6] IHCP: interpretable hepatitis C prediction system based on black-box machine learning models
    Fan, Yongxian
    Lu, Xiqian
    Sun, Guicong
    BMC BIOINFORMATICS, 2023, 24 (01)
  • [7] Learning Groupwise Explanations for Black-Box Models
    Gao, Jingyue
    Wang, Xiting
    Wang, Yasha
    Yan, Yulan
    Xie, Xing
    PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 2396 - 2402
  • [8] Hybrid Predictive Models: When an Interpretable Model Collaborates with a Black-box Model
    Wang, Tong
    Lin, Qihang
    JOURNAL OF MACHINE LEARNING RESEARCH, 2021, 22
  • [9] Hybrid predictive models: when an interpretable model collaborates with a black-box model
    Wang, Tong
    Lin, Qihang
    Journal of Machine Learning Research, 2021, 22
  • [10] Decision-making framework with double-loop learning through interpretable black-box machine learning models
    Bohanec, Marko
    Robnik-Sikonja, Marko
    Borstnar, Mirjana Kljajic
    INDUSTRIAL MANAGEMENT & DATA SYSTEMS, 2017, 117 (07) : 1389 - 1406