LiteFlow: Towards High-performance Adaptive Neural Networks for Kernel Datapath

被引：3

作者：

Zhang, Junxue ^{[1
,2
]}

Zeng, Chaoliang ^{[1
]}

Zhang, Hong ^{[3
]}

Hu, Shuihai ^{[4
]}

Chen, Kai ^{[1
]}

机构：

[1] Hong Kong Univ Sci & Technol, iSING Lab, Hong Kong, Peoples R China

[2] Clustar, Beijing, Peoples R China

[3] Univ Calif Berkeley, Berkeley, CA USA

[4] Huawei, Shenzhen, Peoples R China

来源：

SIGCOMM '22: PROCEEDINGS OF THE 2022 ACM SIGCOMM 2022 CONFERENCE | 2022年

关键词：

Kernel Datapath; Adaptive Neural Network; Deployment;

D O I：

10.1145/3544216.3544229

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Adaptive neural networks (NN) have been used to optimize OS kernel datapath functions because they can achieve superior performance under changing environments. However, how to deploy these NNs remains a challenge. One approach is to deploy these adaptive NNs in the userspace. However, such userspace deployments suffer from either high cross-space communication overhead or low responsiveness, significantly compromising the function performance. On the other hand, pure kernel-space deployments also incur a large performance degradation because the computation logic of model tuning algorithm is typically complex, interfering with the performance of normal datapath execution. This paper presents LiteFlow, a hybrid solution to build high-performance adaptive NNs for kernel datapath. At its core, LiteFlow decouples the control path of adaptive NNs into: (1) a kernel-space fast path for efficient model inference, and (2) a userspace slow path for effective model tuning. We have implemented LiteFlow with Linux kernel datapath and evaluated it with three popular datapath functions including congestion control, flow scheduling, and load balancing. Compared to prior works, LiteFlow achieves 44.4% better goodput for congestion control, and improves the completion time for long flows by 33.7% and 56.7% for flow scheduling and load balancing, respectively.

引用

页码：414 / 427

页数：14

共 50 条

[21] Towards high performance low bitwidth training for deep neural networks
Chunyou Su
Sheng Zhou
Liang Feng
Wei Zhang
Journal of Semiconductors, 2020, 41 (02) : 65 - 74
[22] Towards high performance low bitwidth training for deep neural networks
Su, Chunyou
Zhou, Sheng
Feng, Liang
Zhang, Wei
JOURNAL OF SEMICONDUCTORS, 2020, 41 (02)
[23] Modeling of strength of high-performance concrete using artificial neural networks
Yeh, IC
CEMENT AND CONCRETE RESEARCH, 1998, 28 (12) : 1797 - 1808
[24] Deciphering the Feature Representation of Deep Neural Networks for High-Performance AI
Islam, Md Tauhidul
Xing, Lei
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (08) : 5273 - 5287
[25] High-Performance FPGA-based Accelerator for Bayesian Neural Networks
Fan, Hongxiang
Ferianc, Martin
Rodrigues, Miguel
Zhou, Hongyu
Niu, Xinyu
Luk, Wayne
2021 58TH ACM/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2021, : 1063 - 1068
[26] High-performance Architecture Aware Sparse Convolutional Neural Networks for GPUs
Xiang, Lizhi
Sadayappan, P.
Sukumaran-Rajam, Aravind
PROCEEDINGS OF THE 2022 31ST INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES, PACT 2022, 2022, : 265 - 278
[27] A HIGH-PERFORMANCE DIGITAL PROCESSOR FOR IMPLEMENTING LARGE ARTIFICIAL NEURAL NETWORKS
MYERS, DJ
VINCENT, JM
COX, AL
HARBRIDGE, JR
ORREY, DA
WILLIAMSON, CA
NAYLOR, DJ
BT TECHNOLOGY JOURNAL, 1992, 10 (03): : 134 - 143
[28] Binaryware: A High-Performance Digital Hardware Accelerator for Binary Neural Networks
Ryu, Sungju
Oh, Youngtaek
Kim, Jae-Joon
IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2023, 31 (12) : 2137 - 2141
[29] Scalable High-Performance Architecture for Convolutional Ternary Neural Networks on FPGA
Prost-Boucle, Adrien
Bourge, Alban
Petrot, Frederic
Alemdar, Hande
Caldwell, Nicholas
Leroy, Vincent
2017 27TH INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE LOGIC AND APPLICATIONS (FPL), 2017,
[30] High-performance stock index trading via neural networks and trees
Chalvatzis, Chariton
Hristu-Varsakelis, Dimitrios
APPLIED SOFT COMPUTING, 2020, 96

← 1 2 3 4 5 →