Scanflow: A multi-graph framework for Machine Learning workflow management, supervision, and debugging

被引：5

作者：

Bravo-Rocca, Gusseppe ^{[1
]}

Liu, Peini ^{[1
]}

Guitart, Jordi ^{[1
,2
]}

Dholakia, Ajay ^{[3
]}

Ellison, David ^{[3
]}

Falkanger, Jeffrey ^{[3
]}

Hodak, Miroslav ^{[3
]}

机构：

[1] Barcelona Supercomp Ctr BSC, Emerging Technol Artificial Intelligence, Barcelona, Spain

[2] Univ Politecn Catalunya UPC, Comp Architecture Dept, Barcelona, Spain

[3] Lenovo, Lenovo Infrastruct Solut Grp, Morrisville, NC USA

来源：

EXPERT SYSTEMS WITH APPLICATIONS | 2022年 / 202卷

关键词：

Machine Learning; Symbolic knowledge; Graph; Robustness; Containerization; Concept drift;

D O I：

10.1016/j.eswa.2022.117232

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Machine Learning (ML) is more than just training models, the whole workflow must be considered. Once deployed, a ML model needs to be watched and constantly supervised and debugged to guarantee its validity and robustness in unexpected situations. Debugging in ML aims to identify (and address) the model weaknesses in not trivial contexts. Several techniques have been proposed to identify different types of model weaknesses, such as bias in classification, model decay, adversarial attacks, etc., yet there is not a generic framework that allows them to work in a collaborative, modular, portable, iterative way and, more importantly, flexible enough to allow both human- and machine-driven techniques. In this paper, we propose a novel containerized directed graph framework to support and accelerate end-to-end ML workflow management, supervision, and debugging. The framework allows defining and deploying ML workflows in containers, tracking their metadata, checking their behavior in production, and improving the models by using both learned and human-provided knowledge. We demonstrate these capabilities by integrating in the framework two hybrid systems to detect data drift distribution which identify the samples that are far from the latent space of the original distribution, ask for human intervention, and whether retrain the model or wrap it with a filter to remove the noise of corrupted data at inference time. We test these systems on MNIST-C, CIFAR-10-C, and FashionMNIST-C datasets, obtaining promising accuracy results with the help of human involvement.

引用

页数：19

共 50 条

[1] A Multi-Graph Fusion Framework for Patient Representation Learning
Liu, Yuxi
Zhang, Zhenhao
Qin, Shaowen
Salim, Flora D.
2024 IEEE 12TH INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS, ICHI 2024, 2024, : 222 - 227
[2] A Multi-graph Fusion Based Spatiotemporal Dynamic Learning Framework
Wang, Xu
Chen, Lianliang
Zhang, Hongbo
Wang, Pengkun
Zhou, Zhengyang
Wang, Yang
WSDM 2023 - Proceedings of the 16th ACM International Conference on Web Search and Data Mining, 2023, : 294 - 302
[3] MULTI-GRAPH LEARNING OF SPECTRAL GRAPH DICTIONARIES
Thanou, Dorina
Frossard, Pascal
2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 3397 - 3401
[4] Parallel multi-graph classification using extreme learning machine and MapReduce
Pang, Jun
Gu, Yu
Xu, Jia
Kong, Xiaowang
Yu, Ge
NEUROCOMPUTING, 2017, 261 : 171 - 183
[5] Parallel Multi-graph Classification Using Extreme Learning Machine and MapReduce
Pang, Jun
Gu, Yu
Xu, Jia
Kong, Xiaowang
Yu, Ge
PROCEEDINGS OF ELM-2015, VOL 1: THEORY, ALGORITHMS AND APPLICATIONS (I), 2016, 6 : 77 - 92
[6] Positive and Unlabeled Multi-Graph Learning
Wu, Jia
Pan, Shirui
Zhu, Xingquan
Zhang, Chengqi
Wu, Xindong
IEEE TRANSACTIONS ON CYBERNETICS, 2017, 47 (04) : 818 - 829
[7] Multi-graph Fusion Graph Convolutional Networks with pseudo-label supervision
Yang, Yachao
Sun, Yanfeng
Ju, Fujiao
Wang, Shaofan
Gao, Junbin
Yin, Baocai
NEURAL NETWORKS, 2023, 158 : 305 - 317
[8] Multi-graph embedding for partial label learning
Li, Hongyan
Vong, Chi Man
Wan, Zhonglin
NEURAL COMPUTING & APPLICATIONS, 2023, 35 (27): : 20253 - 20271
[9] LEARNING MULTI-GRAPH REGULARIZATION FOR SVM CLASSIFICATION
Mygdalis, Vasileios
Tefas, Anastasios
Pitas, Ioannis
2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 1608 - 1612
[10] A novel multi-graph framework for salient object detection
Ye Lu
Kedong Zhou
Xiyin Wu
Penghan Gong
The Visual Computer, 2019, 35 : 1683 - 1699

← 1 2 3 4 5 →