Explainable data stream mining: Why the new models are better

被引:0
|
作者
Hu, Hanqing [1 ]
Kantardzic, Mehmed [1 ]
Kar, Shreyas [1 ]
机构
[1] Univ Louisville, CECS, Louisville, KY 40292 USA
来源
关键词
Explanable machine learning; data stream mining; concept drift; CONCEPT DRIFT;
D O I
10.3233/IDT-230065
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Explainable Machine Learning brings expandability, interpretability, and accountability to Data Mining Algorithms. Existing explanation frameworks focus on explaining the decision process of a single model in a static dataset. However, in data stream mining changes in data distribution over time, called concept drift, may require updating the learning models to reflect the current data environment. It is therefore important to go beyond static models and understand what has changed among the learning models before and after a concept drift. We propose a Data Stream Explanability framework (DSE) that works together with a typical data stream mining framework where support vector machine models are used. DSE aims to help non-expert users understand model dynamics in a concept drifting data stream. DSE visualizes differences between SVM models before and after concept drift, to produce explanations on why the new model fits the data better. A survey was carried out between expert and non-expert users on the effectiveness of the framework. Although results showed non-expert users on average responded with less understanding of the issue compared to expert users, the difference is not statistically significant. This indicates that DSE successfully brings the explanability of model change to non-expert users.
引用
收藏
页码:371 / 385
页数:15
相关论文
共 50 条
  • [21] A New Gradual Forgetting Approach for Mining Data Stream with Concept Drift
    Li, Yingrong
    Wei, Yang
    Kolesnikova, Anastasiya
    Lee, Won Don
    ISISE 2008: INTERNATIONAL SYMPOSIUM ON INFORMATION SCIENCE AND ENGINEERING, VOL 1, 2008, : 556 - 559
  • [22] A New Data-Stream-Mining-Based Battery Equalization Method
    Lin, Cheng
    Mu, Hao
    Zhao, Li
    Cao, Wanke
    ENERGIES, 2015, 8 (07) : 6543 - 6565
  • [23] Why fuzzy in data mining?
    Vamos, T
    ISUMA 2003: FOURTH INTERNATIONAL SYMPOSIUM ON UNCERTAINTY MODELING AND ANALYSIS, 2003, : 46 - 49
  • [24] Data Stream Mining: Challenges and Techniques
    Khan, Latifur
    22ND INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2010), PROCEEDINGS, VOL 2, 2010, : 295 - 295
  • [25] Stream mining on univariate uncertain data
    Liu, Ying-Ho
    APPLIED INTELLIGENCE, 2013, 39 (02) : 315 - 344
  • [26] IoT Big Data Stream Mining
    Morales, Gianmarco De Francisci
    Bifet, Albert
    Khan, Latifur
    Gama, Joao
    Fan, Wei
    KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, : 2119 - 2120
  • [27] Data Stream Mining: the Bounded Rationality
    Gama, Joao
    INFORMATICA-JOURNAL OF COMPUTING AND INFORMATICS, 2013, 37 (01): : 21 - 26
  • [28] Research and Evolvement of Data Stream Mining
    Sun Yafeng
    Yang Xiaopin
    Huang Zhiping
    ISTM/2009: 8TH INTERNATIONAL SYMPOSIUM ON TEST AND MEASUREMENT, VOLS 1-6, 2009, : 1438 - 1441
  • [29] Mining Sequential Patterns in Data Stream
    Huang, Qinhua
    Ouyang, Weimin
    ADVANCES IN NEURAL NETWORKS - ISNN 2009, PT 2, PROCEEDINGS, 2009, 5552 : 865 - 874
  • [30] Data stream mining: The bounded rationality
    LIAAD-INESC TEC, and FEP, University of Porto, R. Ceuta 118-6, 4050-190 Porto, Portugal
    Informatica, 2013, 1 (21-25):