LiteFlow: Towards High-performance Adaptive Neural Networks for Kernel Datapath

被引:3
|
作者
Zhang, Junxue [1 ,2 ]
Zeng, Chaoliang [1 ]
Zhang, Hong [3 ]
Hu, Shuihai [4 ]
Chen, Kai [1 ]
机构
[1] Hong Kong Univ Sci & Technol, iSING Lab, Hong Kong, Peoples R China
[2] Clustar, Beijing, Peoples R China
[3] Univ Calif Berkeley, Berkeley, CA USA
[4] Huawei, Shenzhen, Peoples R China
关键词
Kernel Datapath; Adaptive Neural Network; Deployment;
D O I
10.1145/3544216.3544229
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Adaptive neural networks (NN) have been used to optimize OS kernel datapath functions because they can achieve superior performance under changing environments. However, how to deploy these NNs remains a challenge. One approach is to deploy these adaptive NNs in the userspace. However, such userspace deployments suffer from either high cross-space communication overhead or low responsiveness, significantly compromising the function performance. On the other hand, pure kernel-space deployments also incur a large performance degradation because the computation logic of model tuning algorithm is typically complex, interfering with the performance of normal datapath execution. This paper presents LiteFlow, a hybrid solution to build high-performance adaptive NNs for kernel datapath. At its core, LiteFlow decouples the control path of adaptive NNs into: (1) a kernel-space fast path for efficient model inference, and (2) a userspace slow path for effective model tuning. We have implemented LiteFlow with Linux kernel datapath and evaluated it with three popular datapath functions including congestion control, flow scheduling, and load balancing. Compared to prior works, LiteFlow achieves 44.4% better goodput for congestion control, and improves the completion time for long flows by 33.7% and 56.7% for flow scheduling and load balancing, respectively.
引用
收藏
页码:414 / 427
页数:14
相关论文
共 50 条
  • [21] Towards high performance low bitwidth training for deep neural networks
    Chunyou Su
    Sheng Zhou
    Liang Feng
    Wei Zhang
    Journal of Semiconductors, 2020, 41 (02) : 65 - 74
  • [22] Towards high performance low bitwidth training for deep neural networks
    Su, Chunyou
    Zhou, Sheng
    Feng, Liang
    Zhang, Wei
    JOURNAL OF SEMICONDUCTORS, 2020, 41 (02)
  • [23] Modeling of strength of high-performance concrete using artificial neural networks
    Yeh, IC
    CEMENT AND CONCRETE RESEARCH, 1998, 28 (12) : 1797 - 1808
  • [24] Deciphering the Feature Representation of Deep Neural Networks for High-Performance AI
    Islam, Md Tauhidul
    Xing, Lei
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (08) : 5273 - 5287
  • [25] High-Performance FPGA-based Accelerator for Bayesian Neural Networks
    Fan, Hongxiang
    Ferianc, Martin
    Rodrigues, Miguel
    Zhou, Hongyu
    Niu, Xinyu
    Luk, Wayne
    2021 58TH ACM/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2021, : 1063 - 1068
  • [26] High-performance Architecture Aware Sparse Convolutional Neural Networks for GPUs
    Xiang, Lizhi
    Sadayappan, P.
    Sukumaran-Rajam, Aravind
    PROCEEDINGS OF THE 2022 31ST INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES, PACT 2022, 2022, : 265 - 278
  • [27] A HIGH-PERFORMANCE DIGITAL PROCESSOR FOR IMPLEMENTING LARGE ARTIFICIAL NEURAL NETWORKS
    MYERS, DJ
    VINCENT, JM
    COX, AL
    HARBRIDGE, JR
    ORREY, DA
    WILLIAMSON, CA
    NAYLOR, DJ
    BT TECHNOLOGY JOURNAL, 1992, 10 (03): : 134 - 143
  • [28] Binaryware: A High-Performance Digital Hardware Accelerator for Binary Neural Networks
    Ryu, Sungju
    Oh, Youngtaek
    Kim, Jae-Joon
    IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2023, 31 (12) : 2137 - 2141
  • [29] Scalable High-Performance Architecture for Convolutional Ternary Neural Networks on FPGA
    Prost-Boucle, Adrien
    Bourge, Alban
    Petrot, Frederic
    Alemdar, Hande
    Caldwell, Nicholas
    Leroy, Vincent
    2017 27TH INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE LOGIC AND APPLICATIONS (FPL), 2017,
  • [30] High-performance stock index trading via neural networks and trees
    Chalvatzis, Chariton
    Hristu-Varsakelis, Dimitrios
    APPLIED SOFT COMPUTING, 2020, 96