An OpenCL micro-benchmark suite for GPUs and CPUs

被引:0
|
作者
Xin Yan
Xiaohua Shi
Lina Wang
Haiyan Yang
机构
[1] Beihang University,State Key Laboratory of Software Development Environment, School of Computer Science and Engineering
来源
关键词
Micro benchmark; OpenCL; GPU; Multi-core CPU;
D O I
暂无
中图分类号
学科分类号
摘要
Open computing language (OpenCL) is a new industry standard for task-parallel and data-parallel heterogeneous computing on a variety of modern CPUs, GPUs, DSPs, and other microprocessor designs. OpenCL is vendor independent and hence not specialized for any particular compute device. To develop efficient OpenCL applications for the particular platform, we still need a more profound understanding of architecture features on the OpenCL model and computing devices. For this purpose, we design and implement an OpenCL micro-benchmark suite for GPUs and CPUs. In this paper, we introduce the implementations of our OpenCL micro benchmarks, and present the measuring results of hardware and software features like performance of mathematical operations, bus bandwidths, memory architectures, branch synchronizations and scalability, etc., on two multi-core CPUs, i.e. AMD Athlon II X2 250 and Intel Pentium Dual-Core E5400, and two different GPUs, i.e. NVIDIA GeForce GTX 460se and AMD Radeon HD 6850. We also compared the measuring results with existing benchmarks to demonstrate the reasonableness and correctness of our benchmark suite.
引用
收藏
页码:693 / 713
页数:20
相关论文
共 50 条
  • [31] Implementing and Evaluating an Heterogeneous, Scalable, Tridiagonal Linear System Solver with OpenCL to Target FPGAs, GPUs, and CPUs
    Macintosh, Hamish J.
    Banks, Jasmine E.
    Kelson, Neil A.
    INTERNATIONAL JOURNAL OF RECONFIGURABLE COMPUTING, 2019, 2019
  • [32] Improving Performance of OpenCL on CPUs
    Karrenberg, Ralf
    Hack, Sebastian
    COMPILER CONSTRUCTION, CC 2012, 2012, 7210 : 1 - 20
  • [33] O2render: An OpenCL-to-Renderscript Translator for Porting Across Various GPUs or CPUs
    Yang, Cheng-yan
    Wu, Yi-jui
    Liao, Steven
    2012 IEEE 10TH SYMPOSIUM ON EMBEDDED SYSTEMS FOR REAL-TIME MULTIMEDIA (ESTIMEDIA), 2012, : 67 - 74
  • [34] vBench: A Micro-benchmark for File - I/O Performance of Virtual Machines
    Yuan, Pingpeng
    Jin, Hai
    Ye, Ding
    Cao, Wenzhi
    Yan, Yaowei
    Xie, Xia
    2009 IEEE ASIA-PACIFIC SERVICES COMPUTING CONFERENCE (APSCC 2009), 2009, : 150 - 155
  • [35] Evaluating LSTM Time Series Prediction Performance on Benchmark CPUs and GPUs in Cloud Environments
    Saha, Aditi
    Rahman, Mohammad
    Wu, Fan
    PROCEEDINGS OF THE 2024 ACM SOUTHEAST CONFERENCE, ACMSE 2024, 2024, : 321 - 322
  • [36] GNNMark: A Benchmark Suite to Characterize Graph Neural Network Training on GPUs
    Baruah, Trinayan
    Shivdikar, Kaustubh
    Dong, Shi
    Sun, Yifan
    Mojumder, Saiful A.
    Jung, Kihoon
    Abellan, Jose L.
    Ukidave, Yash
    Joshi, Ajay
    Kim, John
    Kaeli, David
    2021 IEEE INTERNATIONAL SYMPOSIUM ON PERFORMANCE ANALYSIS OF SYSTEMS AND SOFTWARE (ISPASS 2021), 2021, : 13 - 23
  • [37] On Energy Nonproportionality of CPUs and GPUs
    Manumachu, Ravi Reddy
    Lastovetsky, Alexey
    2022 IEEE 36TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW 2022), 2022, : 34 - 44
  • [38] Micro-benchmark level performance comparison of high-speed cluster interconnects
    Liu, JX
    Chandrasekaran, B
    Yu, WK
    Wu, JS
    Buntinas, D
    Kini, S
    Wyckoff, P
    Panda, DK
    HOT INTERCONNECTS 11, 2003, : 60 - 65
  • [39] HeteroSync: A Benchmark Suite for Fine-Grained Synchronization on Tightly Coupled GPUs
    Sinclair, Matthew D.
    Alsop, Johnathan
    Adve, Sarita V.
    PROCEEDINGS OF THE 2017 IEEE INTERNATIONAL SYMPOSIUM ON WORKLOAD CHARACTERIZATION (IISWC), 2017, : 239 - 249
  • [40] Distributed Task-based Runtime Systems - Current State and Micro-Benchmark Performance
    Hoque, Reazul
    Shamis, Pavel
    IEEE 20TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS / IEEE 16TH INTERNATIONAL CONFERENCE ON SMART CITY / IEEE 4TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND SYSTEMS (HPCC/SMARTCITY/DSS), 2018, : 934 - 941