Advancing interpretability of machine-learning prediction models

被引:2
|
作者
Trenary, Laurie [1 ,2 ]
DelSole, Timothy [1 ,2 ]
机构
[1] George Mason Univ, Dept Atmospher Ocean & Earth Sci, Fairfax, VA 22030 USA
[2] George Mason Univ, Ctr Ocean Land Atmosphere Studies, Fairfax, VA 22030 USA
来源
基金
美国国家科学基金会;
关键词
machine learning; model interpretation; subseasonal prediction;
D O I
10.1017/eds.2022.13
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
This paper proposes an approach to diagnosing the skill of a machine-learning prediction model based on finding combinations of variables that minimize the normalized mean square error of the predictions. This technique is attractive because it compresses the positive skill of a forecast model into the smallest number of components. The resulting components can then be analyzed much like principal components, including the construction of regression maps for investigating sources of skill. The technique is illustrated with a machine-learning model of week 3-4 predictions of western US wintertime surface temperatures. The technique reveals at least two patterns of large-scale temperature variations that are skillfully predicted. The predictability of these patterns is generally consistent between climate model simulations and observations. The predictability is determined largely by sea surface temperature variations in the Pacific, particularly the region associated with the El Nino-Southern Oscillation. This result is not surprising, but the fact that it emerges naturally from the technique demonstrates that the technique can be helpful in "explaining" the source of predictability in machine-learning models. Impact Statement Machine learning has emerged as a powerful tool for climate prediction, but the resulting models often are too complex to interpret. Methods for extracting meaningful knowledge from machine-learning models have been developed (e.g., explainable AI), but most of these methods apply only to low-dimensional outputs. In contrast, many climate applications require predicting spatial fields. This paper proposes an approach to reducing the dimension of the output by finding components with the most skill. This technique is illustrated by training separate machine-learning models at hundreds of spatial locations, and then using this technique to show that only a few patterns are predicted with significant skill. Individual patterns can then be analyzed using regression techniques to diagnose the source of the skill.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] Optimizing machine-learning models for mutagenicity prediction through better feature selection
    Shinada, Nicolas K.
    Koyama, Naoki
    Ikemori, Megumi
    Nishioka, Tomoki
    Hitaoka, Seiji
    Hakura, Atsushi
    Asakura, Shoji
    Matsuoka, Yukiko
    Palaniappan, Sucheendra K.
    MUTAGENESIS, 2022, 37 (3-4) : 191 - 202
  • [22] Comparative efficacy of machine-learning models in prediction of reducing uncertainties in biosurfactant production
    Srdjan Jovic
    Dejan Guresic
    Ljiljana Babincev
    Nenad Draskovic
    Vidosav Dekic
    Bioprocess and Biosystems Engineering, 2019, 42 : 1695 - 1699
  • [23] Prediction of chronic damage in systemic lupus erythematosus by using machine-learning models
    Ceccarelli, Fulvia
    Sciandrone, Marco
    Perricone, Carlo
    Galvan, Giulio
    Morelli, Francesco
    Vicente, Luis Nunes
    Leccese, Ilaria
    Massaro, Laura
    Cipriano, Enrica
    Spinelli, Francesca Romana
    Alessandri, Cristiano
    Valesini, Guido
    Conti, Fabrizio
    PLOS ONE, 2017, 12 (03):
  • [24] PREDICTION OF CHRONIC DAMAGE IN SYSTEMIC LUPUS ERYTHEMATOSUS BY USING MACHINE-LEARNING MODELS
    Ceccarelli, F.
    Sciandrone, M.
    Perricone, C.
    Galvan, G.
    Morelli, F.
    Vicente, L. N.
    Leccese, I.
    Massaro, L.
    Cipriano, E.
    Spinelli, F. R.
    Alessandri, C.
    Valesini, G.
    Conti, F.
    ANNALS OF THE RHEUMATIC DISEASES, 2017, 76 : 1449 - 1449
  • [25] Comparative efficacy of machine-learning models in prediction of reducing uncertainties in biosurfactant production
    Jovic, Srdjan
    Guresic, Dejan
    Babincev, Ljiljana
    Draskovic, Nenad
    Dekic, Vidosav
    BIOPROCESS AND BIOSYSTEMS ENGINEERING, 2019, 42 (10) : 1695 - 1699
  • [26] Physics-based and machine-learning models for accurate scour depth prediction
    Jatoliya, Ajay
    Bhattacharya, Debayan
    Manna, Bappaditya
    Bento, Ana Margarida
    Ferradosa, Tiago Fazeres
    PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 2024, 382 (2264):
  • [27] Interpretability and Explainability of Machine Learning Models: Achievements and Challenges
    Henriques, J.
    Rocha, T.
    de Carvalho, P.
    Silva, C.
    Paredes, S.
    INTERNATIONAL CONFERENCE ON BIOMEDICAL AND HEALTH INFORMATICS 2022, ICBHI 2022, 2024, 108 : 81 - 94
  • [28] Measuring Interpretability for Different Types of Machine Learning Models
    Zhou, Qing
    Liao, Fenglu
    Mou, Chao
    Wang, Ping
    TRENDS AND APPLICATIONS IN KNOWLEDGE DISCOVERY AND DATA MINING: PAKDD 2018 WORKSHOPS, 2018, 11154 : 295 - 308
  • [29] Groundwater Prediction Using Machine-Learning Tools
    Hussein, Eslam A.
    Thron, Christopher
    Ghaziasgar, Mehrdad
    Bagula, Antoine
    Vaccari, Mattia
    ALGORITHMS, 2020, 13 (11)
  • [30] Anxiety onset in adolescents: a machine-learning prediction
    Alice V. Chavanne
    Marie Laure Paillère Martinot
    Jani Penttilä
    Yvonne Grimmer
    Patricia Conrod
    Argyris Stringaris
    Betteke van Noort
    Corinna Isensee
    Andreas Becker
    Tobias Banaschewski
    Arun L. W. Bokde
    Sylvane Desrivières
    Herta Flor
    Antoine Grigis
    Hugh Garavan
    Penny Gowland
    Andreas Heinz
    Rüdiger Brühl
    Frauke Nees
    Dimitri Papadopoulos Orfanos
    Tomáš Paus
    Luise Poustka
    Sarah Hohmann
    Sabina Millenet
    Juliane H. Fröhner
    Michael N. Smolka
    Henrik Walter
    Robert Whelan
    Gunter Schumann
    Jean-Luc Martinot
    Eric Artiges
    Molecular Psychiatry, 2023, 28 : 639 - 646