Explainable data stream mining: Why the new models are better

被引:0
|
作者
Hu, Hanqing [1 ]
Kantardzic, Mehmed [1 ]
Kar, Shreyas [1 ]
机构
[1] Univ Louisville, CECS, Louisville, KY 40292 USA
来源
关键词
Explanable machine learning; data stream mining; concept drift; CONCEPT DRIFT;
D O I
10.3233/IDT-230065
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Explainable Machine Learning brings expandability, interpretability, and accountability to Data Mining Algorithms. Existing explanation frameworks focus on explaining the decision process of a single model in a static dataset. However, in data stream mining changes in data distribution over time, called concept drift, may require updating the learning models to reflect the current data environment. It is therefore important to go beyond static models and understand what has changed among the learning models before and after a concept drift. We propose a Data Stream Explanability framework (DSE) that works together with a typical data stream mining framework where support vector machine models are used. DSE aims to help non-expert users understand model dynamics in a concept drifting data stream. DSE visualizes differences between SVM models before and after concept drift, to produce explanations on why the new model fits the data better. A survey was carried out between expert and non-expert users on the effectiveness of the framework. Although results showed non-expert users on average responded with less understanding of the issue compared to expert users, the difference is not statistically significant. This indicates that DSE successfully brings the explanability of model change to non-expert users.
引用
收藏
页码:371 / 385
页数:15
相关论文
共 50 条
  • [41] Better Interpretable Models for Proteomics Data Analysis Using Rule-Based Mining
    Jayrannejad, Fahrnaz
    Conrad, Tim O. F.
    TOWARDS INTEGRATIVE MACHINE LEARNING AND KNOWLEDGE EXTRACTION, 2017, 10344 : 67 - 88
  • [42] Attribute Pattern Weights (APW): A Scale to Detect Concept Drift in Data Stream Mining Models
    Ramakrishna, B.
    Rao, S. Krishna Mohan
    2018 INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND INFORMATICS (ICCCI), 2018,
  • [43] Information Technology for Medical Data Stream Mining
    Perova, Iryna
    Brazhnykova, Yelizaveta
    Miroshnychenko, Nelia
    Bodyanskiy, Yevgeniy
    15TH INTERNATIONAL CONFERENCE ON ADVANCED TRENDS IN RADIOELECTRONICS, TELECOMMUNICATIONS AND COMPUTER ENGINEERING (TCSET - 2020), 2020, : 93 - 97
  • [44] Modeling Dynamical Systems with Data Stream Mining
    Osojnik, Aljaz
    Panov, Pance
    Dzeroski, Saso
    COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2016, 13 (02) : 453 - 473
  • [45] A Probabilistic Condensed Representation of Data for Stream Mining
    Geilke, Michael
    Karwath, Andreas
    Kramer, Stefan
    2014 INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA), 2014, : 297 - 303
  • [46] Data stream mining and soft computing applications
    Chen, Mu-Yen
    Lughofer, Edwin
    APPLIED SOFT COMPUTING, 2018, 68 : 667 - 668
  • [47] Sequential Pattern Mining from Stream Data
    Koper, Adam
    Hung Son Nguyen
    ADVANCED DATA MINING AND APPLICATIONS, PT II, 2011, 7121 : 278 - 291
  • [48] Mining Building Metadata by Data Stream Comparison
    Holmegaard, Emil
    Kjaergaard, Mikkel Baun
    2016 IEEE CONFERENCE ON TECHNOLOGIES FOR SUSTAINABILITY (SUSTECH), 2016,
  • [49] KAPPA as Drift Detector in Data Stream Mining
    Mahdi, Osama A.
    Pardede, Eric
    Ali, Nawfal
    12TH INTERNATIONAL CONFERENCE ON AMBIENT SYSTEMS, NETWORKS AND TECHNOLOGIES (ANT) / THE 4TH INTERNATIONAL CONFERENCE ON EMERGING DATA AND INDUSTRY 4.0 (EDI40) / AFFILIATED WORKSHOPS, 2021, 184 : 314 - 321
  • [50] Survey and Research Issues in Data Stream Mining
    Agrawal, Lalit
    Adane, Dattatraya
    BIOSCIENCE BIOTECHNOLOGY RESEARCH COMMUNICATIONS, 2020, 13 (14): : 146 - 149