Explainable data stream mining: Why the new models are better

被引:0
|
作者
Hu, Hanqing [1 ]
Kantardzic, Mehmed [1 ]
Kar, Shreyas [1 ]
机构
[1] Univ Louisville, CECS, Louisville, KY 40292 USA
来源
关键词
Explanable machine learning; data stream mining; concept drift; CONCEPT DRIFT;
D O I
10.3233/IDT-230065
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Explainable Machine Learning brings expandability, interpretability, and accountability to Data Mining Algorithms. Existing explanation frameworks focus on explaining the decision process of a single model in a static dataset. However, in data stream mining changes in data distribution over time, called concept drift, may require updating the learning models to reflect the current data environment. It is therefore important to go beyond static models and understand what has changed among the learning models before and after a concept drift. We propose a Data Stream Explanability framework (DSE) that works together with a typical data stream mining framework where support vector machine models are used. DSE aims to help non-expert users understand model dynamics in a concept drifting data stream. DSE visualizes differences between SVM models before and after concept drift, to produce explanations on why the new model fits the data better. A survey was carried out between expert and non-expert users on the effectiveness of the framework. Although results showed non-expert users on average responded with less understanding of the issue compared to expert users, the difference is not statistically significant. This indicates that DSE successfully brings the explanability of model change to non-expert users.
引用
收藏
页码:371 / 385
页数:15
相关论文
共 50 条
  • [1] Clustering Models for Data Stream Mining
    Mythily, R.
    Banu, Aisha
    Raghunathan, Shriram
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGIES, ICICT 2014, 2015, 46 : 619 - 626
  • [2] Agents and stream data mining: A new perspective
    Ong, KL
    Zhang, ZL
    Ng, WK
    Lim, EP
    IEEE INTELLIGENT SYSTEMS, 2005, 20 (03) : 60 - 67
  • [3] A New Approach for Mining Frequent Items in Data Stream
    Tu, Li
    Chen, Ling
    2010 INTERNATIONAL COLLOQUIUM ON COMPUTING, COMMUNICATION, CONTROL, AND MANAGEMENT (CCCM2010), VOL II, 2010, : 225 - 228
  • [4] New optimization models for data mining
    Glover, Fred W.
    Kochenberger, Gary
    INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGY & DECISION MAKING, 2006, 5 (04) : 605 - 609
  • [5] Data Mining Methods and Cost Estimation Models Why is it so hard to infuse new ideas?
    Hihn, Jairus
    Menzies, Tim
    2015 30TH IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING WORKSHOP (ASEW), 2015, : 5 - 9
  • [6] New Policy of Maximal Frequent Itemsets in Data Stream Mining
    Xu, ChongHuan
    Ju, ChunHua
    ADVANCED MECHANICAL ENGINEERING, PTS 1 AND 2, 2010, 26-28 : 118 - +
  • [7] Towards a new approach for mining frequent itemsets on data stream
    Chedy Raïssi
    Pascal Poncelet
    Maguelonne Teisseire
    Journal of Intelligent Information Systems, 2007, 28 : 23 - 36
  • [8] Towards a new approach for mining frequent itemsets on data stream
    Raissi, Chedy
    Poncelet, Pascal
    Teisseire, Maguelonne
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2007, 28 (01) : 23 - 36
  • [9] A New Method for Data Stream Mining Based on the Misclassification Error
    Rutkowski, Leszek
    Jaworski, Maciej
    Pietruczuk, Lena
    Duda, Piotr
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 26 (05) : 1048 - 1059
  • [10] Data and Event Stream Mining
    Naveen, Kavya
    Sathyanarayana, M. V.
    Naveen, N. C.
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2008, 8 (04): : 140 - 143