Scientific machine learning benchmarks

被引:61
|
作者
Thiyagalingam, Jeyan [1 ]
Shankar, Mallikarjun [2 ]
Fox, Geoffrey [3 ]
Hey, Tony [1 ]
机构
[1] Sci & Technol Facil Council, Rutherford Appleton Lab, Harwell Campus, Didcot, Oxon, England
[2] Oak Ridge Natl Lab, Oak Ridge, TN USA
[3] Univ Virginia, Comp Sci & Biocomplex Inst, Charlottesville, VA USA
基金
英国工程与自然科学研究理事会;
关键词
40;
D O I
10.1038/s42254-022-00441-7
中图分类号
O59 [应用物理学];
学科分类号
摘要
Finding the most appropriate machine learning algorithm for the analysis of any given scientific dataset is currently challenging, but new machine learning benchmarks for science are being developed to help. Deep learning has transformed the use of machine learning technologies for the analysis of large experimental datasets. In science, such datasets are typically generated by large-scale experimental facilities, and machine learning focuses on the identification of patterns, trends and anomalies to extract meaningful scientific insights from the data. In upcoming experimental facilities, such as the Extreme Photonics Application Centre (EPAC) in the UK or the international Square Kilometre Array (SKA), the rate of data generation and the scale of data volumes will increasingly require the use of more automated data analysis. However, at present, identifying the most appropriate machine learning algorithm for the analysis of any given scientific dataset is a challenge due to the potential applicability of many different machine learning frameworks, computer architectures and machine learning models. Historically, for modelling and simulation on high-performance computing systems, these issues have been addressed through benchmarking computer applications, algorithms and architectures. Extending such a benchmarking approach and identifying metrics for the application of machine learning methods to open, curated scientific datasets is a new challenge for both scientists and computer scientists. Here, we introduce the concept of machine learning benchmarks for science and review existing approaches. As an example, we describe the SciMLBench suite of scientific machine learning benchmarks.
引用
收藏
页码:413 / 420
页数:8
相关论文
共 50 条
  • [1] Scientific machine learning benchmarks
    Jeyan Thiyagalingam
    Mallikarjun Shankar
    Geoffrey Fox
    Tony Hey
    Nature Reviews Physics, 2022, 4 : 413 - 420
  • [2] Recommendations for machine learning benchmarks in neuroimaging
    Leenings, Ramona
    Winter, Nils R.
    Dannlowski, Udo
    Hahn, Tim
    NEUROIMAGE, 2022, 257
  • [3] A Case Study on Machine Learning for Synthesizing Benchmarks
    Goens, Andres
    Brauckmann, Alexander
    Ertel, Sebastian
    Cummins, Chris
    Leather, Hugh
    Castrillon, Jeronimo
    PROCEEDINGS OF THE 3RD ACM SIGPLAN INTERNATIONAL WORKSHOP ON MACHINE LEARNING AND PROGRAMMING LANGUAGES (MAPL '19), 2019, : 38 - 46
  • [4] GRAPH TOPOLOGY INFERENCE BENCHMARKS FOR MACHINE LEARNING
    Lassance, Carlos
    Gripon, Vincent
    Mateos, Gonzalo
    PROCEEDINGS OF THE 2020 IEEE 30TH INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2020,
  • [5] Envisioning better benchmarks for machine learning PDE solvers
    Brandstetter, Johannes
    NATURE MACHINE INTELLIGENCE, 2025, 7 (01) : 2 - 3
  • [6] NeuroGraph: Benchmarks for Graph Machine Learning in Brain Connectomics
    Said, Anwar
    Bayrak, Roza G.
    Derr, Tyler
    Shabbir, Mudassir
    Moyer, Daniel
    Chang, Catie
    Koutsoukos, Xenofon
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [7] BubbleML: A Multiphase Multiphysics Dataset and Benchmarks for Machine Learning
    Hassan, Sheikh Md Shakeel
    Feeney, Arthur
    Dhruv, Akash
    Kim, Jihoon
    Suh, Youngjoon
    Ryu, Jaiyoung
    Won, Yoonjin
    Chandramowlishwaran, Aparna
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [8] SCIENTIFIC MACHINE LEARNING: A SYMBIOSIS
    Keith, Brendan
    O'leary-roseberry, Thomas
    Sanderse, Benjamin
    Scheichl, Robert
    Waanders, Bart van bloemen
    FOUNDATIONS OF DATA SCIENCE, 2025, 7 (01):
  • [9] BIKED: A Dataset for Computational Bicycle Design With Machine Learning Benchmarks
    Regenwetter, Lyle
    Curry, Brent
    Ahmed, Faez
    JOURNAL OF MECHANICAL DESIGN, 2022, 144 (03)
  • [10] Benchmarks for machine learning in depression discrimination using electroencephalography signals
    Ayan Seal
    Rishabh Bajpai
    Mohan Karnati
    Jagriti Agnihotri
    Anis Yazidi
    Enrique Herrera-Viedma
    Ondrej Krejcar
    Applied Intelligence, 2023, 53 : 12666 - 12683