M3X: Autonomous Accelerators via Context-Enabled Fast-Path Communication

被引:0
|
作者
Asmussen, Nils [1 ,2 ]
Roitzsch, Michael [1 ,2 ]
Haertig, Hermann [1 ,2 ]
机构
[1] Tech Univ Dresden, Dresden, Germany
[2] Barkhausen Inst, Dresden, Germany
关键词
SYSTEM; DESIGN;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Performance and efficiency requirements are driving a trend towards specialized accelerators in both datacenters and embedded devices. In order to cut down communication overheads, system components are pinned to cores and fast-path communication between them is established. These fast paths reduce latency by avoiding indirections through the operating system. However, we see three roadblocks that can impede further gains: First, accelerators today need to be assisted by a general-purpose core, because they cannot autonomously access operating system services like file systems or network stacks. Second, fast-path communication is at odds with preemptive context switching, which is still necessary today to improve efficiency when applications underutilize devices. Third, these concepts should be kept orthogonal, such that direct and unassisted communication is possible between any combination of accelerators and general-purpose cores. At the same time, all of them should support switching between multiple application contexts, which is most difficult with accelerators that lack the hardware features to run an operating system. We present (MX)-X-3, a system architecture that removes these roadblocks. (MX)-X-3 retains the low overhead of fast-path communication while enabling context switching for general-purpose cores and specialized accelerators. (MX)-X-3 runs accelerators autonomously and achieves a speedup of 4.7 for PCIe-attached image-processing accelerators compared to traditional assisted operation. At the same time, utilization of the host CPU is reduced by a factor of 30.
引用
收藏
页码:617 / 631
页数:15
相关论文
empty
未找到相关数据