CHEF: A Framework for Deploying Heterogeneous Models on Clusters With Heterogeneous FPGAs

被引:0
|
作者
Tang, Yue [1 ]
Song, Yukai [1 ]
Elango, Naveena [2 ]
Priya, Sheena Ratnam [2 ]
Jones, Alex K. [3 ,4 ]
Xiong, Jinjun [2 ]
Zhou, Peipei [5 ]
Hu, Jingtong [1 ]
机构
[1] Univ Pittsburgh, Dept Elect & Comp Engn, Pittsburgh, PA 15261 USA
[2] Univ Buffalo, Dept Comp Sci & Engn, Buffalo, NY 14260 USA
[3] Syracuse Univ, Dept Elect Engn, Syracuse, NY 13244 USA
[4] Syracuse Univ, Comp Sci Dept, Syracuse, NY 13244 USA
[5] Brown Univ, Sch Engn, Providence, RI 02912 USA
关键词
Design automation; Computational modeling; Clustering algorithms; Machine learning; Bandwidth; Benchmark testing; Market research; Hardware; Integrated circuit modeling; Field programmable gate arrays; Heterogeneous FPGA clusters; multimodality multitask (MMMT);
D O I
10.1109/TCAD.2024.3438994
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Deep neural networks (DNNs) are rapidly evolving from streamlined single-modality single-task (SMST) to multimodality multitask (MMMT) with large variations for different layers and complex data dependencies among layers. To support such models, hardware systems also evolved to be heterogeneous. The heterogeneous system comes from the prevailing trend to integrate diverse accelerators into the system for lower latency. FPGAs have high-computation density and communication bandwidth and are configurable to be deployed with different designs of accelerators, which are widely used for various machine-learning applications. However, scaling from SMST to MMMT on heterogeneous FPGAs is challenging since MMMT has much larger layer variations, a massive number of layers, and complex data dependency among different backbones. Previous mapping algorithms are either inefficient or over-simplified which makes them impractical in general scenarios. In this work, we propose CHEF to enable efficient implementation of MMMT models in realistic heterogeneous FPGA clusters, i.e., deploying heterogeneous accelerators on heterogeneous FPGAs (A2F) and mapping the heterogeneous DNNs on the deployed heterogeneous accelerators (M2A). We propose CHEF-A2F, a two-stage accelerators-to-FPGAs deployment approach to co-optimize hardware deployment and accelerator mapping. In addition, we propose CHEF-M2A, which can support general and practical cases compared to previous mapping algorithms. To the best of our knowledge, this is the first attempt to implement MMMT models in real heterogeneous FPGA clusters. Experimental results show that the latency obtained with CHEF is near-optimal while the search time is 10 $000\times $ less than exhaustively searching the optimal solution.
引用
收藏
页码:3937 / 3948
页数:12
相关论文
共 50 条
  • [1] AIgean: An Open Framework for Deploying Machine Learning on Heterogeneous Clusters
    Tarafdar, Naif
    Di Guglielmo, Giuseppe
    Harris, Philip C.
    Krupa, Jeffrey D.
    Loncar, Vladimir
    Rankin, Dylan S.
    Tran, Nhan
    Wu, Zhenbin
    Shen, Qianfeng
    Chow, Paul
    ACM TRANSACTIONS ON RECONFIGURABLE TECHNOLOGY AND SYSTEMS, 2022, 15 (03)
  • [2] A Modular Heterogeneous Stack for Deploying FPGAs and CPUs in the Data Center
    Eskandari, Nariman
    Tarafdar, Naif
    Ly-Ma, Daniel
    Chow, Paul
    PROCEEDINGS OF THE 2019 ACM/SIGDA INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE GATE ARRAYS (FPGA'19), 2019, : 262 - 271
  • [3] Heterogeneous floorplanning for FPGAs
    Feng, Y
    Mehta, DP
    19TH INTERNATIONAL CONFERENCE ON VLSI DESIGN, PROCEEDINGS, 2005, : 257 - 262
  • [4] A simulation framework for priority scheduling on heterogeneous clusters
    Cavdar, Derya
    Birke, Robert
    Chen, Lydia Y.
    Alagoz, Fatih
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2015, 52 : 37 - 48
  • [5] A framework for statistical analysis of datasets on heterogeneous clusters
    Carino, Ricolindo L.
    Banicescu, Ioana
    2005 IEEE International Conference on Cluster Computing (CLUSTER), 2006, : 493 - 501
  • [6] Workload Distribution Framework for the Parallel Solution of Large Structural Models on Heterogeneous PC Clusters
    Kurc, Ozgur
    JOURNAL OF COMPUTING IN CIVIL ENGINEERING, 2010, 24 (02) : 151 - 160
  • [7] Configuring A MapReduce Framework For Performance-Heterogeneous Clusters
    Hartog, Jessica
    DelValle, Renan
    Govindaraju, Madhusudhan
    Lewis, Michael J.
    2014 IEEE INTERNATIONAL CONGRESS ON BIG DATA (BIGDATA CONGRESS), 2014, : 120 - 127
  • [8] AIgean: An Open Framework for Machine Learning on Heterogeneous Clusters
    Tarafdar, Naif
    Di Guglielmo, Giuseppe
    Harris, Philip C.
    Krupa, Jeffrey D.
    Loncar, Vladimir
    Rankin, Dylan S.
    Nhan Tran
    Wu, Zhenbin
    Shen, Qianfeng
    Chow, Paul
    28TH IEEE INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES (FCCM), 2020, : 239 - 239
  • [9] ExaStamp: A Parallel Framework for Molecular Dynamics on Heterogeneous Clusters
    Cieren, Emmanuel
    Colombet, Laurent
    Pitoiset, Samuel
    Namyst, Raymond
    EURO-PAR 2014: PARALLEL PROCESSING WORKSHOPS, PT II, 2014, 8806 : 121 - 132
  • [10] A framework for adaptive communication modeling on heterogeneous hierarchical clusters
    Nasri, Wahid
    Hamad, Hajer
    Fejjari, Hadhemi
    2006 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING, VOLS 1 AND 2, 2006, : 699 - +