P4GPU: Acceleration of Programmable Data Plane Using a CPU-GPU Heterogeneous Architecture

被引:0
|
作者
Li, Peilong [1 ]
Luo, Yan [1 ]
机构
[1] Univ Massachusetts Lowell, Dept Elect & Comp Engn, Lowell, MA 01852 USA
关键词
Programmable Data Plane; Heterogeneous Architecture; Packet Processing; P4; IP LOOKUP;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The programmability of the network data plane has become one of the most desirable features within the context of software defined networks, with P4 serving as a domain-specific language for defining data plane processing. In this work, we are motivated to address the challenges of mapping a P4 defined data plane to a heterogeneous programmable hardware architecture consisting of both a CPU and a GPU, which includes a salient parallel SIMD architecture for processing network flows. We first design a toolset that can be used to map a P4 program onto the proposed architecture. We then optimize the GPU kernel designs for "match-action" primitives and present latency-hiding techniques to reduce the overheads of CPU/GPU communication. In addition, load balancing is investigated to maximize the utilization of CPU and GPU resources. Our toolset and optimizations allow a P4 program to render promising performance on the given heterogeneous architecture. Specifically, the experimental results collected on our prototype systems show that the automatically configured GPU kernels achieve scalable lookup and classification speeds with 420 million IP lookups per second, and more than 60 million classifications per second (for 4K firewall rules).
引用
收藏
页码:168 / 175
页数:8
相关论文
共 50 条
  • [41] TAP: A TLP-Aware Cache Management Policy for a CPU-GPU Heterogeneous Architecture
    Lee, Jaekyu
    Kim, Hyesoon
    2012 IEEE 18TH INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE COMPUTER ARCHITECTURE (HPCA), 2012, : 91 - 102
  • [42] SCALABLE HETEROGENEOUS CPU-GPU COMPUTATIONS FOR UNSTRUCTURED TETRAHEDRAL MESHES
    Langguth, Johannes
    Sourouri, Mohammed
    Lines, Glenn Terje
    Baden, Scott B.
    Cai, Xing
    IEEE MICRO, 2015, 35 (04) : 6 - 15
  • [43] Accelerating Batched Power Flow on Heterogeneous CPU-GPU Platform
    Hao, Jiao
    Zhang, Zongbao
    He, Zonglin
    Liu, Zhengyuan
    Tan, Zhengdong
    Song, Yankan
    Energies, 2024, 17 (24)
  • [44] FlinkCL: An OpenCL-Based In-Memory Computing Architecture on Heterogeneous CPU-GPU Clusters for Big Data
    Chen, Cen
    Li, Kenli
    Ouyang, Aijia
    Li, Keqin
    IEEE TRANSACTIONS ON COMPUTERS, 2018, 67 (12) : 1765 - 1779
  • [45] Task Offloading and Resource Allocation in CPU-GPU Heterogeneous Networks
    Gong, Chenyu
    Ma, Mulei
    Wu, Liantao
    Liu, Wenxiang
    Zhou, Yong
    Yang, Yang
    2022 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM 2022), 2022, : 4492 - 4497
  • [46] Multireference coupled cluster methods on heterogeneous CPU-GPU systems
    Bhaskaran-Nair, Kiran
    Ma, Wenjing
    Krishnamoorthy, Sriram
    Villa, Oreste
    van Dam, Hubertus J. J.
    Apra, Edoardo
    Kowalski, Karol
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2013, 246
  • [47] Adaptive Partitioning for Irregular Applications on Heterogeneous CPU-GPU Chips
    Vilches, Antonio
    Asenjo, Rafael
    Navarro, Angeles
    Corbera, Francisco
    Gran, Ruben
    Garzaran, Maria
    INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, ICCS 2015 COMPUTATIONAL SCIENCE AT THE GATES OF NATURE, 2015, 51 : 140 - 149
  • [49] Heterogeneous Computing (CPU-GPU) for Pollution Dispersion in an Urban Environment
    Fernandez, Gonzalo
    Mendina, Mariana
    Usera, Gabriel
    COMPUTATION, 2020, 8 (01)
  • [50] Hybrid-Smash: A Heterogeneous CPU-GPU Compression Library
    Penaranda, Cristian
    Reano, Carlos
    Silla, Federico
    IEEE ACCESS, 2024, 12 : 32706 - 32723