Estimation of Machine Learning model uncertainty in particle physics event classifiers

被引:3
|
作者
Vazquez-Escobar, Julia [1 ]
Hernandez, J. M. [1 ]
Cardenas-Montes, Miguel [1 ]
机构
[1] CIEMAT, Dept Fundamental Res, Avda Complutense 40, Madrid 28040, Spain
关键词
Uncertainty estimation; Machine Learning; Particle physics; Supervised classification;
D O I
10.1016/j.cpc.2021.108100
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Particle physics experiments entail the collection of large data samples of complex information. In order to produce and detect low probability processes of interest (signal), a huge number of particle collisions must be carried out. This type of experiments produces huge sets of observations where most of them are of no interest (background). For this reason, a mechanism able to differentiate rare signals buried in immense backgrounds is required. The use of Machine Learning algorithms for this task allows to efficiently process huge amounts of complex data, automate the classification of event categories and produce signal-enriched filtered datasets more suitable for subsequent physics study. Although the classification of large imbalanced datasets has been undertaken in the past, the generation of predictions with their corresponding uncertainties is quite infrequent. In particle physics, as well as in other scientific domains, point estimations are considered as an incomplete answer if uncertainties are not presented. As a benchmark, we present a real case study where we compare three methods that estimate the uncertainty of Machine Learning algorithms predictions in the identification of the production and decay of top-antitop quark pairs in collisions of protons at the Large Hadron Collider at CERN. Datasets of detailed simulations of the signal and background processes elaborated by the CMS experiment are used. Three different techniques that provide a way to quantify prediction uncertainties for classification algorithms are proposed and evaluated: dropout training in deep neural networks as approximate Bayesian inference, variance estimation across an ensemble of trained deep neural networks, and Probabilistic Random Forest. All of them exhibit an excellent discrimination power with a model uncertainty measure that turns out to be small, showing that the predictions are precise and robust. (C) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页数:8
相关论文
共 50 条
  • [41] Implementing a Model to Detect Parkinson Disease using Machine Learning Classifiers
    Kumar, Uday G. S.
    Baskaran, S.
    Sumathi, D.
    JOURNAL OF ALGEBRAIC STATISTICS, 2022, 13 (01) : 99 - 110
  • [42] Machine Learning of Phases and Structures for Model Systems in Physics
    Bayo, Djenabou
    Civitcioglu, Burak
    Webb, Joseph J.
    Honecker, Andreas
    Roemer, Rudolf A.
    JOURNAL OF THE PHYSICAL SOCIETY OF JAPAN, 2025, 94 (03)
  • [43] Anomaly detection in Skin Model Shapes using machine learning classifiers
    Yacob, Filmon
    Semere, Daniel
    Nordgren, Erik
    INTERNATIONAL JOURNAL OF ADVANCED MANUFACTURING TECHNOLOGY, 2019, 105 (09): : 3677 - 3689
  • [44] Anomaly detection in Skin Model Shapes using machine learning classifiers
    Filmon Yacob
    Daniel Semere
    Erik Nordgren
    The International Journal of Advanced Manufacturing Technology, 2019, 105 : 3677 - 3689
  • [45] Implementing a Model to Detect Diabetes Prediction using Machine Learning Classifiers
    Sireesha, P. J.
    Prakash, K.
    Sumathi, D.
    JOURNAL OF ALGEBRAIC STATISTICS, 2022, 13 (01) : 558 - 566
  • [46] An efficient plant disease prediction model based on machine learning and deep learning classifiers
    Shinde, Nirmala
    Ambhaikar, Asha
    EVOLUTIONARY INTELLIGENCE, 2025, 18 (01)
  • [47] Physics-Informed Machine Learning and Uncertainty Quantification for Mechanics of Heterogeneous Materials
    B. V. S. S. Bharadwaja
    Mohammad Amin Nabian
    Bharatkumar Sharma
    Sanjay Choudhry
    Alankar Alankar
    Integrating Materials and Manufacturing Innovation, 2022, 11 : 607 - 627
  • [48] Physics-Informed Machine Learning and Uncertainty Quantification for Mechanics of Heterogeneous Materials
    Bharadwaja, B. V. S. S.
    Nabian, Mohammad Amin
    Sharma, Bharatkumar
    Choudhry, Sanjay
    Alankar, Alankar
    INTEGRATING MATERIALS AND MANUFACTURING INNOVATION, 2022, 11 (04) : 607 - 627
  • [49] Estimating RANS model uncertainty using machine learning
    Heyse, Jan F.
    Mishra, Aashwin A.
    Iaccarino, Gianluca
    JOURNAL OF THE GLOBAL POWER AND PROPULSION SOCIETY, 2021,
  • [50] Fracture Permeability Estimation Under Complex Physics: A Data-Driven Model Using Machine Learning
    He, Xupeng
    AlSinan, Marwah M.
    Kwak, Hyung T.
    Hoteit, Hussein
    Saudi Aramco Journal of Technology, 2022, 2022 : 2 - 11