Explainable data stream mining: Why the new models are better

被引：0

作者：

Hu, Hanqing ^{[1
]}

Kantardzic, Mehmed ^{[1
]}

Kar, Shreyas ^{[1
]}

机构：

[1] Univ Louisville, CECS, Louisville, KY 40292 USA

来源：

INTELLIGENT DECISION TECHNOLOGIES-NETHERLANDS | 2024年 / 18卷 / 01期

关键词：

Explanable machine learning; data stream mining; concept drift; CONCEPT DRIFT;

D O I：

10.3233/IDT-230065

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Explainable Machine Learning brings expandability, interpretability, and accountability to Data Mining Algorithms. Existing explanation frameworks focus on explaining the decision process of a single model in a static dataset. However, in data stream mining changes in data distribution over time, called concept drift, may require updating the learning models to reflect the current data environment. It is therefore important to go beyond static models and understand what has changed among the learning models before and after a concept drift. We propose a Data Stream Explanability framework (DSE) that works together with a typical data stream mining framework where support vector machine models are used. DSE aims to help non-expert users understand model dynamics in a concept drifting data stream. DSE visualizes differences between SVM models before and after concept drift, to produce explanations on why the new model fits the data better. A survey was carried out between expert and non-expert users on the effectiveness of the framework. Although results showed non-expert users on average responded with less understanding of the issue compared to expert users, the difference is not statistically significant. This indicates that DSE successfully brings the explanability of model change to non-expert users.

引用

页码：371 / 385

页数：15

共 50 条

[1] Clustering Models for Data Stream Mining
Mythily, R.
Banu, Aisha
Raghunathan, Shriram
PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGIES, ICICT 2014, 2015, 46 : 619 - 626
[2] Agents and stream data mining: A new perspective
Ong, KL
Zhang, ZL
Ng, WK
Lim, EP
IEEE INTELLIGENT SYSTEMS, 2005, 20 (03) : 60 - 67
[3] A New Approach for Mining Frequent Items in Data Stream
Tu, Li
Chen, Ling
2010 INTERNATIONAL COLLOQUIUM ON COMPUTING, COMMUNICATION, CONTROL, AND MANAGEMENT (CCCM2010), VOL II, 2010, : 225 - 228
[4] New optimization models for data mining
Glover, Fred W.
Kochenberger, Gary
INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGY & DECISION MAKING, 2006, 5 (04) : 605 - 609
[5] Data Mining Methods and Cost Estimation Models Why is it so hard to infuse new ideas?
Hihn, Jairus
Menzies, Tim
2015 30TH IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING WORKSHOP (ASEW), 2015, : 5 - 9
[6] New Policy of Maximal Frequent Itemsets in Data Stream Mining
Xu, ChongHuan
Ju, ChunHua
ADVANCED MECHANICAL ENGINEERING, PTS 1 AND 2, 2010, 26-28 : 118 - +
[7] Towards a new approach for mining frequent itemsets on data stream
Chedy Raïssi
Pascal Poncelet
Maguelonne Teisseire
Journal of Intelligent Information Systems, 2007, 28 : 23 - 36
[8] Towards a new approach for mining frequent itemsets on data stream
Raissi, Chedy
Poncelet, Pascal
Teisseire, Maguelonne
JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2007, 28 (01) : 23 - 36
[9] A New Method for Data Stream Mining Based on the Misclassification Error
Rutkowski, Leszek
Jaworski, Maciej
Pietruczuk, Lena
Duda, Piotr
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 26 (05) : 1048 - 1059
[10] Data and Event Stream Mining
Naveen, Kavya
Sathyanarayana, M. V.
Naveen, N. C.
INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2008, 8 (04): : 140 - 143

← 1 2 3 4 5 →