Estimation of Machine Learning model uncertainty in particle physics event classifiers

被引：3

作者：

Vazquez-Escobar, Julia ^{[1
]}

Hernandez, J. M. ^{[1
]}

Cardenas-Montes, Miguel ^{[1
]}

机构：

[1] CIEMAT, Dept Fundamental Res, Avda Complutense 40, Madrid 28040, Spain

来源：

COMPUTER PHYSICS COMMUNICATIONS | 2021年 / 268卷

关键词：

Uncertainty estimation; Machine Learning; Particle physics; Supervised classification;

D O I：

10.1016/j.cpc.2021.108100

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Particle physics experiments entail the collection of large data samples of complex information. In order to produce and detect low probability processes of interest (signal), a huge number of particle collisions must be carried out. This type of experiments produces huge sets of observations where most of them are of no interest (background). For this reason, a mechanism able to differentiate rare signals buried in immense backgrounds is required. The use of Machine Learning algorithms for this task allows to efficiently process huge amounts of complex data, automate the classification of event categories and produce signal-enriched filtered datasets more suitable for subsequent physics study. Although the classification of large imbalanced datasets has been undertaken in the past, the generation of predictions with their corresponding uncertainties is quite infrequent. In particle physics, as well as in other scientific domains, point estimations are considered as an incomplete answer if uncertainties are not presented. As a benchmark, we present a real case study where we compare three methods that estimate the uncertainty of Machine Learning algorithms predictions in the identification of the production and decay of top-antitop quark pairs in collisions of protons at the Large Hadron Collider at CERN. Datasets of detailed simulations of the signal and background processes elaborated by the CMS experiment are used. Three different techniques that provide a way to quantify prediction uncertainties for classification algorithms are proposed and evaluated: dropout training in deep neural networks as approximate Bayesian inference, variance estimation across an ensemble of trained deep neural networks, and Probabilistic Random Forest. All of them exhibit an excellent discrimination power with a model uncertainty measure that turns out to be small, showing that the predictions are precise and robust. (C) 2021 Elsevier B.V. All rights reserved.

引用

页数：8

共 50 条

[21] Uncertainty-aware machine learning for high energy physics
Ghosh, Aishik
Nachman, Benjamin
Whiteson, Daniel
PHYSICAL REVIEW D, 2021, 104 (05)
[22] A Survey of Machine Learning-Based Physics Event Generation
Alanazi, Yasir
Sato, Nobuo
Ambrozewicz, Pawel
Hiller-Blin, Astrid
Melnitchouk, Wally
Battaglieri, Marco
Liu, Tianbo
Li, Yaohang
PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 4286 - 4293
[23] Fast and Accurate Uncertainty Estimation in Chemical Machine Learning
Musil, Felix
Willatt, Michael J.
Langovoy, Mikhail A.
Ceriotti, Michele
JOURNAL OF CHEMICAL THEORY AND COMPUTATION, 2019, 15 (02) : 906 - 915
[24] Machine Learning and Event-Based Software Testing: Classifiers for Identifying Infeasible GUI Event Sequences
Gove, Robert
Faytong, Jorge
ADVANCES IN COMPUTERS, VOL 86, 2012, 86 : 109 - 135
[25] Uncertainty hits particle physics
Goddard, A
PHYSICS WORLD, 1996, 9 (09) : 5 - 5
[26] SUPA: A Lightweight Diagnostic Simulator for Machine Learning in Particle Physics
Sinha, Atul Kumar
Paliotta, Daniele
Mate, Balint
Raine, John A.
Golling, Tobias
Fleuret, Francois
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[27] Modern machine learning and particle physics: an in-depth review
Bhattacherjee, Biplob
Mukherjee, Swagata
EUROPEAN PHYSICAL JOURNAL-SPECIAL TOPICS, 2024, 233 (15-16): : 2421 - 2424
[28] MadMiner: Machine Learning-Based Inference for Particle Physics
Brehmer J.
Kling F.
Espejo I.
Cranmer K.
Computing and Software for Big Science, 2020, 4 (1)
[29] Copying Machine Learning Classifiers
Unceta, Irene
Nin, Jordi
Pujol, Oriol
IEEE ACCESS, 2020, 8 (08) : 160268 - 160284
[30] Printed Machine Learning Classifiers
Mubarik, Muhammad Husnain
Weller, Dennis D.
Bleier, Nathaniel
Tomei, Matthew
Aghassi-Hagmann, Jasmin
Tahoori, Mehdi B.
Kumar, Rakesh
2020 53RD ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE (MICRO 2020), 2020, : 73 - 87

← 1 2 3 4 5 →