Evaluating Machine Learning Algorithms for Applications with Humans in the Loop

被引：0

作者：

Gopalakrishna, Aravind Kota ^{[1
]}

Ozcelebi, Tanir ^{[1
]}

Lukkien, Johan J. ^{[1
]}

Liotta, Antonio ^{[2
]}

机构：

[1] Eindhoven Univ Technol, Dept Math & Comp Sci, Syst Architecture & Networking Grp, Eindhoven, Netherlands

[2] Eindhoven Univ Technol, Dept Elect Engn, Electroopt Commun, Eindhoven, Netherlands

来源：

PROCEEDINGS OF THE 2017 IEEE 14TH INTERNATIONAL CONFERENCE ON NETWORKING, SENSING AND CONTROL (ICNSC 2017) | 2017年

关键词：

SCALES;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Applications employing data classification such as smart lighting that involve human factors such as perception lead to non-deterministic input-output relationships where more than one output may be acceptable for a given input. For these so called non-deterministic multiple output classification (nDMOC) problems, the relationship between the input and output may change over time making it difficult for the machine learning (ML) algorithms in a batch setting to make predictions for a given context. In this paper, we describe the nature of nDMOC problems and discuss the Relevance Score (RS) that is suitable in this context as a performance metric. RS determines the extent by which a predicted output is relevant to the user's context and behaviors, taking into account the inconsistencies that come with human (perception) factors. We tailor the RS metric so that it can be used to evaluate ML algorithms in an online setting at run-time. We assess the performance of a number of ML algorithms, using a smart lighting dataset with non-deterministic one-to-many input-output relationships. The results indicate that using RS instead of classification accuracy (CA) is suitable to analyze the performance of conventional ML algorithms applied to the category of nDMOC problems. Instance-based online ML gives the best RS performance. An interesting finding is that the RS keeps increasing with increasing number of samples, even after the CA performance converges.

引用

页码：459 / 464

页数：6

共 50 条

[1] Evaluating Model Drift in Machine Learning Algorithms
Nelson, Kevin
Corbin, George
Amnia, Mark
Kovacs, Matthew
Tobias, Jeremy
Blowers, Misty
2015 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE FOR SECURITY AND DEFENSE APPLICATIONS (CISDA), 2015, : 162 - 169
[2] A Bayesian metric for evaluating machine learning algorithms
Hope, LR
Korb, KB
AI 2004: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2004, 3339 : 991 - 997
[3] Co-reasoning by Humans in the Loop as a Goal for Designers of Machine Learning-Driven Algorithms in Medicine
Guth, Stephen
AMERICAN JOURNAL OF BIOETHICS, 2024, 24 (09): : 120 - 122
[4] Evaluating Machine Learning Algorithms for Fake News Detection
Gilda, Shlok
PROCEEDINGS OF THE 2017 IEEE 15TH STUDENT CONFERENCE ON RESEARCH AND DEVELOPMENT (SCORED), 2017, : 110 - 115
[5] Evaluating Machine Learning Algorithms for Anomaly Detection in Clouds
Gulenko, Anton
Wallschlaeger, Marcel
Schmidt, Florian
Kao, Odej
Liu, Feng
2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2016, : 2716 - 2721
[6] Evaluating Machine Learning Algorithms for Detecting DDoS Attacks
Suresh, Manjula
Anitha, R.
ADVANCES IN NETWORK SECURITY AND APPLICATIONS, 2011, 196 : 441 - 452
[7] Evaluating Machine Learning Algorithms to Detect Employees' Attrition
Khalifa, Noor
Alnasheet, Maryam
Kadhem, Hasan
2022 3RD INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, ROBOTICS AND CONTROL, AIRC, 2022, : 93 - 97
[8] A Review of Machine Learning Algorithms for Biomedical Applications
Binson, V. A.
Thomas, Sania
Subramoniam, M.
Arun, J.
Naveen, S.
Madhu, S.
ANNALS OF BIOMEDICAL ENGINEERING, 2024, 52 (04) : 1051 - 1066
[9] Machine Learning and Cognitive Algorithms for Engineering Applications
Perlovsky, Leonid
Kuvich, Gary
INTERNATIONAL JOURNAL OF COGNITIVE INFORMATICS AND NATURAL INTELLIGENCE, 2013, 7 (04) : 64 - 82
[10] Machine Learning: A Review of the Algorithms and Its Applications
Dhall, Devanshi
Kaur, Ravinder
Juneja, Mamta
PROCEEDINGS OF RECENT INNOVATIONS IN COMPUTING, ICRIC 2019, 2020, 597 : 47 - 63

← 1 2 3 4 5 →