P4GPU: Acceleration of Programmable Data Plane Using a CPU-GPU Heterogeneous Architecture

被引：0

作者：

Li, Peilong ^{[1
]}

Luo, Yan ^{[1
]}

机构：

[1] Univ Massachusetts Lowell, Dept Elect & Comp Engn, Lowell, MA 01852 USA

来源：

2016 IEEE 17TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE SWITCHING AND ROUTING (HPSR) | 2016年

关键词：

Programmable Data Plane; Heterogeneous Architecture; Packet Processing; P4; IP LOOKUP;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

The programmability of the network data plane has become one of the most desirable features within the context of software defined networks, with P4 serving as a domain-specific language for defining data plane processing. In this work, we are motivated to address the challenges of mapping a P4 defined data plane to a heterogeneous programmable hardware architecture consisting of both a CPU and a GPU, which includes a salient parallel SIMD architecture for processing network flows. We first design a toolset that can be used to map a P4 program onto the proposed architecture. We then optimize the GPU kernel designs for "match-action" primitives and present latency-hiding techniques to reduce the overheads of CPU/GPU communication. In addition, load balancing is investigated to maximize the utilization of CPU and GPU resources. Our toolset and optimizations allow a P4 program to render promising performance on the given heterogeneous architecture. Specifically, the experimental results collected on our prototype systems show that the automatically configured GPU kernels achieve scalable lookup and classification speeds with 420 million IP lookups per second, and more than 60 million classifications per second (for 4K firewall rules).

引用

页码：168 / 175

页数：8

共 50 条

[41] TAP: A TLP-Aware Cache Management Policy for a CPU-GPU Heterogeneous Architecture
Lee, Jaekyu
Kim, Hyesoon
2012 IEEE 18TH INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE COMPUTER ARCHITECTURE (HPCA), 2012, : 91 - 102
[42] SCALABLE HETEROGENEOUS CPU-GPU COMPUTATIONS FOR UNSTRUCTURED TETRAHEDRAL MESHES
Langguth, Johannes
Sourouri, Mohammed
Lines, Glenn Terje
Baden, Scott B.
Cai, Xing
IEEE MICRO, 2015, 35 (04) : 6 - 15
[43] Accelerating Batched Power Flow on Heterogeneous CPU-GPU Platform
Hao, Jiao
Zhang, Zongbao
He, Zonglin
Liu, Zhengyuan
Tan, Zhengdong
Song, Yankan
Energies, 2024, 17 (24)
[44] FlinkCL: An OpenCL-Based In-Memory Computing Architecture on Heterogeneous CPU-GPU Clusters for Big Data
Chen, Cen
Li, Kenli
Ouyang, Aijia
Li, Keqin
IEEE TRANSACTIONS ON COMPUTERS, 2018, 67 (12) : 1765 - 1779
[45] Task Offloading and Resource Allocation in CPU-GPU Heterogeneous Networks
Gong, Chenyu
Ma, Mulei
Wu, Liantao
Liu, Wenxiang
Zhou, Yong
Yang, Yang
2022 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM 2022), 2022, : 4492 - 4497
[46] Multireference coupled cluster methods on heterogeneous CPU-GPU systems
Bhaskaran-Nair, Kiran
Ma, Wenjing
Krishnamoorthy, Sriram
Villa, Oreste
van Dam, Hubertus J. J.
Apra, Edoardo
Kowalski, Karol
ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2013, 246
[47] Adaptive Partitioning for Irregular Applications on Heterogeneous CPU-GPU Chips
Vilches, Antonio
Asenjo, Rafael
Navarro, Angeles
Corbera, Francisco
Gran, Ruben
Garzaran, Maria
INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, ICCS 2015 COMPUTATIONAL SCIENCE AT THE GATES OF NATURE, 2015, 51 : 140 - 149
[48] MPtostream:an OpenMP compiler for CPU-GPU heterogeneous parallel systems
YANG XueJun
Science China(Information Sciences), 2012, 55 (09) : 1961 - 1971
[49] Heterogeneous Computing (CPU-GPU) for Pollution Dispersion in an Urban Environment
Fernandez, Gonzalo
Mendina, Mariana
Usera, Gabriel
COMPUTATION, 2020, 8 (01)
[50] Hybrid-Smash: A Heterogeneous CPU-GPU Compression Library
Penaranda, Cristian
Reano, Carlos
Silla, Federico
IEEE ACCESS, 2024, 12 : 32706 - 32723

← 1 2 3 4 5 →