Scientific machine learning benchmarks

被引:61
|
作者
Thiyagalingam, Jeyan [1 ]
Shankar, Mallikarjun [2 ]
Fox, Geoffrey [3 ]
Hey, Tony [1 ]
机构
[1] Sci & Technol Facil Council, Rutherford Appleton Lab, Harwell Campus, Didcot, Oxon, England
[2] Oak Ridge Natl Lab, Oak Ridge, TN USA
[3] Univ Virginia, Comp Sci & Biocomplex Inst, Charlottesville, VA USA
基金
英国工程与自然科学研究理事会;
关键词
40;
D O I
10.1038/s42254-022-00441-7
中图分类号
O59 [应用物理学];
学科分类号
摘要
Finding the most appropriate machine learning algorithm for the analysis of any given scientific dataset is currently challenging, but new machine learning benchmarks for science are being developed to help. Deep learning has transformed the use of machine learning technologies for the analysis of large experimental datasets. In science, such datasets are typically generated by large-scale experimental facilities, and machine learning focuses on the identification of patterns, trends and anomalies to extract meaningful scientific insights from the data. In upcoming experimental facilities, such as the Extreme Photonics Application Centre (EPAC) in the UK or the international Square Kilometre Array (SKA), the rate of data generation and the scale of data volumes will increasingly require the use of more automated data analysis. However, at present, identifying the most appropriate machine learning algorithm for the analysis of any given scientific dataset is a challenge due to the potential applicability of many different machine learning frameworks, computer architectures and machine learning models. Historically, for modelling and simulation on high-performance computing systems, these issues have been addressed through benchmarking computer applications, algorithms and architectures. Extending such a benchmarking approach and identifying metrics for the application of machine learning methods to open, curated scientific datasets is a new challenge for both scientists and computer scientists. Here, we introduce the concept of machine learning benchmarks for science and review existing approaches. As an example, we describe the SciMLBench suite of scientific machine learning benchmarks.
引用
收藏
页码:413 / 420
页数:8
相关论文
共 50 条
  • [21] Machine learning and big scientific data
    Hey, Tony
    Butler, Keith
    Jackson, Sam
    Thiyagalingam, Jeyarajan
    PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 2020, 378 (2166):
  • [22] FTIR coupled with machine learning to unveil spectroscopic benchmarks in the Italian EVOO
    Scatigno, Claudia
    Festa, Giulia
    INTERNATIONAL JOURNAL OF FOOD SCIENCE AND TECHNOLOGY, 2022, 57 (07): : 4156 - 4162
  • [23] Comparative Analysis of Machine Learning Models for Performance Prediction of the SPEC Benchmarks
    Tousi, Ashkan
    Lujan, Mikel
    IEEE ACCESS, 2022, 10 : 11994 - 12011
  • [24] A Survey of Big Data, High Performance Computing, and Machine Learning Benchmarks
    Ihde, Nina
    Marten, Paula
    Eleliemy, Ahmed
    Poerwawinata, Gabrielle
    Silva, Pedro
    Tolovski, Ilin
    Ciorba, Florina M.
    Rabl, Tilmann
    PERFORMANCE EVALUATION AND BENCHMARKING, TPCTC 2021, 2022, 13169 : 98 - 118
  • [25] Machine Learning for Bankruptcy Prediction in the American Stock Market: Dataset and Benchmarks
    Lombardo, Gianfranco
    Pellegrino, Mattia
    Adosoglou, George
    Cagnoni, Stefano
    Pardalos, Panos M.
    Poggi, Agostino
    FUTURE INTERNET, 2022, 14 (08):
  • [26] Construction of Realistic Place-and-Route Benchmarks for Machine Learning Applications
    Kim, Daeyeon
    Lee, Sung-Yun
    Min, Kyungjun
    Kang, Seokhyeong
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2023, 42 (06) : 2030 - 2042
  • [27] NEW SISC SECTION ON SCIENTIFIC MACHINE LEARNING
    De Sterck, Hans
    SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2024, 46 (01): : VII - VIII
  • [28] Evolution and scientific visualization of Machine learning field
    Rio-Belver, Rosa
    Garechana, Gaizka
    Bildosola, Inakki
    Zarrabeitia, Enara
    2ND INTERNATIONAL CONFERENCE ON ADVANCED RESEARCH METHODS AND ANALYTICS (CARMA 2018), 2018, : 115 - 123
  • [29] Machine Learning for Holistic Evaluation of Scientific Essays
    Hughes, Simon
    Hastings, Peter
    Britt, Mary Anne
    Wallace, Patricia
    Blaum, Dylan
    ARTIFICIAL INTELLIGENCE IN EDUCATION, AIED 2015, 2015, 9112 : 165 - 175
  • [30] Explainable Machine Learning for Scientific Insights and Discoveries
    Roscher, Ribana
    Bohn, Bastian
    Duarte, Marco F.
    Garcke, Jochen
    IEEE ACCESS, 2020, 8 : 42200 - 42216