Estimation of Machine Learning model uncertainty in particle physics event classifiers

被引:3
|
作者
Vazquez-Escobar, Julia [1 ]
Hernandez, J. M. [1 ]
Cardenas-Montes, Miguel [1 ]
机构
[1] CIEMAT, Dept Fundamental Res, Avda Complutense 40, Madrid 28040, Spain
关键词
Uncertainty estimation; Machine Learning; Particle physics; Supervised classification;
D O I
10.1016/j.cpc.2021.108100
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Particle physics experiments entail the collection of large data samples of complex information. In order to produce and detect low probability processes of interest (signal), a huge number of particle collisions must be carried out. This type of experiments produces huge sets of observations where most of them are of no interest (background). For this reason, a mechanism able to differentiate rare signals buried in immense backgrounds is required. The use of Machine Learning algorithms for this task allows to efficiently process huge amounts of complex data, automate the classification of event categories and produce signal-enriched filtered datasets more suitable for subsequent physics study. Although the classification of large imbalanced datasets has been undertaken in the past, the generation of predictions with their corresponding uncertainties is quite infrequent. In particle physics, as well as in other scientific domains, point estimations are considered as an incomplete answer if uncertainties are not presented. As a benchmark, we present a real case study where we compare three methods that estimate the uncertainty of Machine Learning algorithms predictions in the identification of the production and decay of top-antitop quark pairs in collisions of protons at the Large Hadron Collider at CERN. Datasets of detailed simulations of the signal and background processes elaborated by the CMS experiment are used. Three different techniques that provide a way to quantify prediction uncertainties for classification algorithms are proposed and evaluated: dropout training in deep neural networks as approximate Bayesian inference, variance estimation across an ensemble of trained deep neural networks, and Probabilistic Random Forest. All of them exhibit an excellent discrimination power with a model uncertainty measure that turns out to be small, showing that the predictions are precise and robust. (C) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页数:8
相关论文
共 50 条
  • [21] Uncertainty-aware machine learning for high energy physics
    Ghosh, Aishik
    Nachman, Benjamin
    Whiteson, Daniel
    PHYSICAL REVIEW D, 2021, 104 (05)
  • [22] A Survey of Machine Learning-Based Physics Event Generation
    Alanazi, Yasir
    Sato, Nobuo
    Ambrozewicz, Pawel
    Hiller-Blin, Astrid
    Melnitchouk, Wally
    Battaglieri, Marco
    Liu, Tianbo
    Li, Yaohang
    PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 4286 - 4293
  • [23] Fast and Accurate Uncertainty Estimation in Chemical Machine Learning
    Musil, Felix
    Willatt, Michael J.
    Langovoy, Mikhail A.
    Ceriotti, Michele
    JOURNAL OF CHEMICAL THEORY AND COMPUTATION, 2019, 15 (02) : 906 - 915
  • [24] Machine Learning and Event-Based Software Testing: Classifiers for Identifying Infeasible GUI Event Sequences
    Gove, Robert
    Faytong, Jorge
    ADVANCES IN COMPUTERS, VOL 86, 2012, 86 : 109 - 135
  • [25] Uncertainty hits particle physics
    Goddard, A
    PHYSICS WORLD, 1996, 9 (09) : 5 - 5
  • [26] SUPA: A Lightweight Diagnostic Simulator for Machine Learning in Particle Physics
    Sinha, Atul Kumar
    Paliotta, Daniele
    Mate, Balint
    Raine, John A.
    Golling, Tobias
    Fleuret, Francois
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [27] Modern machine learning and particle physics: an in-depth review
    Bhattacherjee, Biplob
    Mukherjee, Swagata
    EUROPEAN PHYSICAL JOURNAL-SPECIAL TOPICS, 2024, 233 (15-16): : 2421 - 2424
  • [28] MadMiner: Machine Learning-Based Inference for Particle Physics
    Brehmer J.
    Kling F.
    Espejo I.
    Cranmer K.
    Computing and Software for Big Science, 2020, 4 (1)
  • [29] Copying Machine Learning Classifiers
    Unceta, Irene
    Nin, Jordi
    Pujol, Oriol
    IEEE ACCESS, 2020, 8 (08) : 160268 - 160284
  • [30] Printed Machine Learning Classifiers
    Mubarik, Muhammad Husnain
    Weller, Dennis D.
    Bleier, Nathaniel
    Tomei, Matthew
    Aghassi-Hagmann, Jasmin
    Tahoori, Mehdi B.
    Kumar, Rakesh
    2020 53RD ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE (MICRO 2020), 2020, : 73 - 87