A Survey of Big Data, High Performance Computing, and Machine Learning Benchmarks

被引:3
|
作者
Ihde, Nina [1 ]
Marten, Paula [1 ]
Eleliemy, Ahmed [2 ]
Poerwawinata, Gabrielle [2 ]
Silva, Pedro [1 ]
Tolovski, Ilin [1 ]
Ciorba, Florina M. [2 ]
Rabl, Tilmann [1 ]
机构
[1] Hasso Platner Inst, Potsdam, Germany
[2] Univ Basel, Basel, Switzerland
关键词
Benchmarking; Big Data; HPC; Machine Learning; PARALLEL;
D O I
10.1007/978-3-030-94437-7_7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In recent years, there has been a convergence of Big Data (BD), High Performance Computing (HPC), and Machine Learning (ML) systems. This convergence is due to the increasing complexity of long data analysis pipelines on separated software stacks. With the increasing complexity of data analytics pipelines comes a need to evaluate their systems, in order to make informed decisions about technology selection, sizing and scoping of hardware. While there are many benchmarks for each of these domains, there is no convergence of these efforts. As a first step, it is also necessary to understand how the individual benchmark domains relate. In this work, we analyze some of the most expressive and recent benchmarks of BD, HPC, and ML systems. We propose a taxonomy of those systems based on individual dimensions such as accuracy metrics and common dimensions such as workload type. Moreover, we aim at enabling the usage of our taxonomy in identifying adapted benchmarks for their BD, HPC, and ML systems. Finally, we identify challenges and research directions related to the future of converged BD, HPC, and ML system benchmarking.
引用
收藏
页码:98 / 118
页数:21
相关论文
共 50 条
  • [11] A SURVEY OF MACHINE LEARNING ALGORITHMS FOR BIG DATA ANALYTICS
    Athmaja, S.
    Hanumanthappa, M.
    Kavitha, Vasantha
    2017 INTERNATIONAL CONFERENCE ON INNOVATIONS IN INFORMATION, EMBEDDED AND COMMUNICATION SYSTEMS (ICIIECS), 2017,
  • [12] Large-scale machine learning based on functional networks for biomedical big data with high performance computing platforms
    Elsebakhi, Emad
    Lee, Frank
    Schendel, Eric
    Haque, Anwar
    Kathireason, Nagarajan
    Pathare, Tushar
    Syed, Najeeb
    Al-Ali, Rashid
    JOURNAL OF COMPUTATIONAL SCIENCE, 2015, 11 : 69 - 81
  • [13] Granular computing based machine learning in the era of big data
    Hu, Qinghua
    Mi, Jusheng
    Chen, Degang
    Information Sciences, 2022, 591 : 422 - 423
  • [14] A Survey on Benchmarks for Big Data and Some More Considerations
    Qin, Xiongpai
    Zhou, Xiaoyun
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2013, 2013, 8206 : 619 - 627
  • [15] Contributions to High-Performance Big Data Computing
    Fox, Geoffrey
    Qiu, Judy
    Crandall, David
    Von Laszewski, Gregor
    Beckstein, Oliver
    Paden, John
    Paraskevakos, Ioannis
    Jha, Shantenu
    Wang, Fusheng
    Marathe, Madhav
    Vullikanti, Anil
    Cheatham, Thomas
    FUTURE TRENDS OF HPC IN A DISRUPTIVE SCENARIO, 2019, 34 : 34 - 81
  • [16] High-Performance Computing for Big Data Processing
    Wu, Yulei
    Xiang, Yang
    Ge, Jingguo
    Muller, Peter
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2018, 88 : 693 - 695
  • [17] A survey of big data architectures and machine learning algorithms in healthcare
    Manogaran G.
    Lopez D.
    International Journal of Biomedical Engineering and Technology, 2017, 25 (2-4) : 182 - 211
  • [18] A Survey of Distributed and Parallel Extreme Learning Machine for Big Data
    Wang, Zhiqiong
    Sui, Ling
    Xin, Junchang
    Qu, Luxuan
    Yao, Yudong
    IEEE ACCESS, 2020, 8 : 201247 - 201258
  • [19] Role of cloud computing, big data and machine learning in iot revolution
    Solanki, Arun
    Recent Advances in Computer Science and Communications, 2021, 14 (03): : 666 - 668
  • [20] In-Memory Computing Architectures for Big Data and Machine Learning Applications
    Snasel, Vaclav
    Tran Khanh Dang
    Pham, Phuong N. H.
    Kueng, Josef
    Kong, Lingping
    FUTURE DATA AND SECURITY ENGINEERING. BIG DATA, SECURITY AND PRIVACY, SMART CITY AND INDUSTRY 4.0 APPLICATIONS, FDSE 2022, 2022, 1688 : 19 - 33