HaoCL: Harnessing Large-scale Heterogeneous Processors Made Easy

被引:2
|
作者
Chen, Yao [1 ]
Long, Xin [2 ]
He, Jiong [3 ]
Chen, Yuhang [1 ]
Tan, Hongshi [4 ]
Zhang, Zhenxiang [2 ]
Winslett, Marianne [1 ,5 ]
Chen, Deming [5 ]
机构
[1] Adv Digital Sci Ctr, Singapore, Singapore
[2] Alibaba Grp, Shenzhen, Guangdong, Peoples R China
[3] ASTAR, Inst High Performance Comp, Singapore, Singapore
[4] Natl Univ Singapore, Singapore, Singapore
[5] Univ Illinois, Champaign, IL USA
基金
新加坡国家研究基金会;
关键词
heterogeneous cluster; distributed computing; OpenCL; machine learning; deep learning;
D O I
10.1109/ICDCS47774.2020.00120
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The pervasive adoption of Deep Learning (DL) and Graph Processing (GP) makes it a de facto requirement to build large-scale clusters of heterogeneous accelerators including GPUs and FPGAs. The OpenCL programming framework can be used on the individual nodes of such clusters but is not intended for deployment in a distributed manner. Fortunately, the original OpenCL semantics naturally fit into the programming environment of heterogeneous clusters. In this paper, we propose a heterogeneity-aware OpenCL-like (HaoCL) programming framework to facilitate the programming of a wide range of scientific applications including DL and GP workloads on large-scale heterogeneous clusters. With HaoCL, existing applications can be directly deployed on heterogeneous clusters without any modifications to the original OpenCL source code and without awareness of the underlying hardware topologies and configurations. Our experiments show that HaoCL imposes a negligible overhead in a distributed environment, and provides near-liner speedups on standard benchmarks when computation or data size exceeds the capacity of a single node. The system design and the evaluations are presented in this demo paper.
引用
收藏
页码:1231 / 1234
页数:4
相关论文
共 50 条
  • [21] Optimized localization in large-scale heterogeneous WSN
    Sumit Kumar
    Neera Batra
    Shrawan Kumar
    The Journal of Supercomputing, 2023, 79 : 6705 - 6729
  • [22] Harnessing Web Page Directories for Large-Scale Classification of Tweets
    Zubiaga, Arkaitz
    Ji, Heng
    PROCEEDINGS OF THE 22ND INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW'13 COMPANION), 2013, : 225 - 226
  • [23] Load balancing in large-scale heterogeneous systems
    Borst, Sem
    QUEUEING SYSTEMS, 2022, 100 (3-4) : 397 - 399
  • [24] Parallel Graph Processing on Graphics Processors Made Easy
    Zhong, Jianlong
    He, Bingsheng
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2013, 6 (12): : 1270 - 1273
  • [25] EasyScale: Easy Mapping for Large-Scale Network Security Experiments
    Yao, Wei-Min
    Fahmy, Sonia
    Zhu, Jiahong
    2013 IEEE CONFERENCE ON COMMUNICATIONS AND NETWORK SECURITY (CNS), 2013, : 269 - 277
  • [26] CESAREAN SECTIONS MADE ON COWS ON LARGE-SCALE FARMS
    KOCH, H
    MAGYAR ALLATORVOSOK LAPJA, 1978, 33 (07) : 458 - 460
  • [27] TSH: Easy-to-be distributed partitioning for large-scale graphs
    Wang, Ning
    Wang, Zhigang
    Gu, Yu
    Bao, Yubin
    Yu, Ge
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2019, 101 : 804 - 818
  • [28] Large-Scale Neuromorphic Spiking Array Processors: A Quest to Mimic the Brain
    Thakur, Chetan Singh
    Molin, Jamal Lottier
    Cauwenberghs, Gert
    Indiveri, Giacomo
    Kumar, Kundan
    Qiao, Ning
    Schemmel, Johannes
    Wang, Runchun
    Chicca, Elisabetta
    Hasler, Jennifer Olson
    Seo, Jae-sun
    Yu, Shimeng
    Cao, Yu
    van Schaik, Andre
    Etienne-Cummings, Ralph
    FRONTIERS IN NEUROSCIENCE, 2018, 12
  • [30] From Designing Quantum Processors to Large-Scale Quantum Computing Systems
    Almudever, Carmen G.
    Wille, Robert
    Sebastian, Fabio
    Haider, Nadia
    Alarcon, Eduard
    2024 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION, DATE, 2024,