HaoCL: Harnessing Large-scale Heterogeneous Processors Made Easy

被引:2
|
作者
Chen, Yao [1 ]
Long, Xin [2 ]
He, Jiong [3 ]
Chen, Yuhang [1 ]
Tan, Hongshi [4 ]
Zhang, Zhenxiang [2 ]
Winslett, Marianne [1 ,5 ]
Chen, Deming [5 ]
机构
[1] Adv Digital Sci Ctr, Singapore, Singapore
[2] Alibaba Grp, Shenzhen, Guangdong, Peoples R China
[3] ASTAR, Inst High Performance Comp, Singapore, Singapore
[4] Natl Univ Singapore, Singapore, Singapore
[5] Univ Illinois, Champaign, IL USA
基金
新加坡国家研究基金会;
关键词
heterogeneous cluster; distributed computing; OpenCL; machine learning; deep learning;
D O I
10.1109/ICDCS47774.2020.00120
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The pervasive adoption of Deep Learning (DL) and Graph Processing (GP) makes it a de facto requirement to build large-scale clusters of heterogeneous accelerators including GPUs and FPGAs. The OpenCL programming framework can be used on the individual nodes of such clusters but is not intended for deployment in a distributed manner. Fortunately, the original OpenCL semantics naturally fit into the programming environment of heterogeneous clusters. In this paper, we propose a heterogeneity-aware OpenCL-like (HaoCL) programming framework to facilitate the programming of a wide range of scientific applications including DL and GP workloads on large-scale heterogeneous clusters. With HaoCL, existing applications can be directly deployed on heterogeneous clusters without any modifications to the original OpenCL source code and without awareness of the underlying hardware topologies and configurations. Our experiments show that HaoCL imposes a negligible overhead in a distributed environment, and provides near-liner speedups on standard benchmarks when computation or data size exceeds the capacity of a single node. The system design and the evaluations are presented in this demo paper.
引用
收藏
页码:1231 / 1234
页数:4
相关论文
共 50 条
  • [31] SPARE AND COMPARE, AN ONLINE TESTING METHOD FOR LARGE-SCALE DIGITAL PROCESSORS
    SHIFFER, C
    IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 1977, 13 (04) : 456 - 456
  • [32] Large-scale finite element fluid analysis by massively parallel processors
    Yagawa, G
    Nakabayashi, Y
    Okuda, H
    PARALLEL COMPUTING, 1997, 23 (09) : 1365 - 1377
  • [33] Deep Learning Systems: Algorithms, Compilers, and Processors for Large-Scale Production
    Rodriguez A.
    Synthesis Lectures on Computer Architecture, 2021, 15 (04): : 1 - 265
  • [34] Harnessing the Cloud for Securely Solving Large-scale Systems of Linear Equations
    Wang, Cong
    Ren, Kui
    Wang, Jia
    Urs, Karthik Mahendra Raje
    31ST INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2011), 2011, : 549 - 558
  • [35] Harnessing the Cloud for Securely Outsourcing Large-Scale Systems of Linear Equations
    Wang, Cong
    Ren, Kui
    Wang, Jia
    Wang, Qian
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2013, 24 (06) : 1172 - 1181
  • [36] Large-scale climate simulations harnessing clusters, grid and cloud infrastructures
    Fernandez-Quiruelas, V.
    Blanco, C.
    Cofino, A. S.
    Fernandez, J.
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2015, 51 : 36 - 44
  • [37] Harnessing Large-Scale Herbarium Image Datasets Through Representation Learning
    Walker, Barnaby E.
    Tucker, Allan
    Nicolson, Nicky
    FRONTIERS IN PLANT SCIENCE, 2022, 12
  • [38] Harnessing Connectivity in a Large-Scale Small-Molecule Sensitivity Dataset
    Seashore-Ludlow, Brinton
    Rees, Matthew G.
    Cheah, Jaime H.
    Cokol, Murat
    Price, Edmund V.
    Coletti, Matthew E.
    Jones, Victor
    Bodycombe, Nicole E.
    Soule, Christian K.
    Gould, Joshua
    Alexander, Benjamin
    Li, Ava
    Montgomery, Philip
    Wawer, Mathias J.
    Kuru, Nurdan
    Kotz, Joanne D.
    Hon, C. Suk-Yee
    Munoz, Benito
    Liefeld, Ted
    Dancik, Vlado
    Bittker, Joshua A.
    Palmer, Michelle
    Bradner, James E.
    Shamji, Alykhan F.
    Clemons, Paul A.
    Schreiber, Stuart L.
    CANCER DISCOVERY, 2015, 5 (11) : 1210 - 1223
  • [39] Large-Scale Feature Matching with Distributed and Heterogeneous Computing
    Mills, Steven
    Eyers, David
    Leung, Kai-Cheung
    Tang, Xiaoxin
    Huang, Zhiyi
    PROCEEDINGS OF 2013 28TH INTERNATIONAL CONFERENCE ON IMAGE AND VISION COMPUTING NEW ZEALAND (IVCNZ 2013), 2013, : 208 - 213
  • [40] COGNISERVE: HETEROGENEOUS SERVER ARCHITECTURE FOR LARGE-SCALE RECOGNITION
    Iyer, Ravi
    Srinivasan, Sadagopan
    Tickoo, Omesh
    Fang, Zhen
    Illikkal, Ramesh
    Zhang, Steven
    Chadha, Vineet
    Stillwell, Paul M., Jr.
    Lee, Seung Eun
    IEEE MICRO, 2011, 31 (03) : 20 - 31