Network-Accelerated Non-Contiguous Memory Transfers

被引:5
|
作者
Di Girolamo, Salvatore [1 ,4 ]
Taranov, Konstantin [1 ]
Kurth, Andreas [2 ]
Schaffner, Michael [2 ]
Schneider, Timo [1 ]
Beranek, Jakub [3 ]
Besta, Maciej [1 ]
Benini, Luca [2 ]
Roweth, Duncan [4 ]
Hoefler, Torsten [1 ]
机构
[1] Swiss Fed Inst Technol, Dept Comp Sci, CH-8092 Zurich, Switzerland
[2] Swiss Fed Inst Technol, Integrated Syst Lab, CH-8092 Zurich, Switzerland
[3] VSB Tech Univ Ostrava, IT4Innovat, Ostrava, Czech Republic
[4] Cray UK Ltd, Bristol, Avon, England
基金
欧洲研究理事会;
关键词
DATA-TYPE COMMUNICATION; MPI;
D O I
10.1145/3295500.3356189
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Applications often communicate data that is non-contiguous in the send- or the receive-buffer, e.g., when exchanging a column of a matrix stored in row-major order. While non-contiguous transfers are well supported in HPC (e.g., MPI derived datatypes), they can still be up to 5x slower than contiguous transfers of the same size. As we enter the era of network acceleration, we need to investigate which tasks to offload to the NIC: In this work we argue that non-contiguous memory transfers can be transparently network-accelerated, truly achieving zero-copy communications. We implement and extend sPIN, a packet streaming processor, within a Portals 4 NIC SST model, and evaluate strategies for NIC-offloaded processing of MPI datatypes, ranging from datatype-specific handlers to general solutions for any MPI datatype. We demonstrate up to 8x speedup in the unpack throughput of real applications, demonstrating that non-contiguous memory transfers are a first-class candidate for network acceleration.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] Novel non-contiguous exon duplication in choroideremia
    Edwards, T. L.
    Williams, J.
    Patricio, M. I.
    Simunovic, M. P.
    Shanks, M.
    Clouston, P.
    MacLaren, R. E.
    CLINICAL GENETICS, 2018, 93 (01) : 144 - 148
  • [22] Performance evaluation of adaptive non-contiguous MC-CDMA and non-contiguous CI/MC-CDMA for dynamic spectrum access
    Wu, Zhiqiang
    Ratazzi, Paul
    Chakravarthy, Vasu D.
    Hong, Lang
    2008 3RD INTERNATIONAL CONFERENCE ON COGNITIVE RADIO ORIENTED WIRELESS NETWORKS AND COMMUNICATIONS, 2008, : 526 - 531
  • [23] Building Blocks for Network-Accelerated Distributed File Systems
    Di Girolamo, Salvatore
    De Sensi, Daniele
    Taranov, Konstantin
    Malesevic, Milos
    Besta, Maciej
    Schneider, Timo
    Kistler, Severin
    Hoefler, Torsten
    SC22: INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS, 2022,
  • [24] Implementation of Experimental Equipment with Wideband Non-Contiguous OFDM
    Takakusaki, Keiji
    Kanno, Issei
    Kosaka, Kazuhiro
    Hasegawa, Akio
    Shinbo, Hiroyuki
    Takeuchi, Yoshio
    2014 IEEE ASIA PACIFIC CONFERENCE ON WIRELESS AND MOBILE, 2014, : 162 - 167
  • [25] RANDOM NON-CONTIGUOUS PLOTS IN INTERLOCKING FIELD LAYOUTS
    LIBBY, WJ
    COCKERHAM, CC
    SILVAE GENETICA, 1980, 29 (5-6) : 183 - 190
  • [26] Compiler Optimizations for Non-contiguous Remote Data Movement
    Schneider, Timo
    Gerstenberger, Robert
    Hoefler, Torsten
    LANGUAGES AND COMPILERS FOR PARALLEL COMPUTING, LCPC 2013, 2014, 8664 : 307 - 321
  • [27] Approximating the Non-contiguous Multiple Organization Packing Problem
    Bougeret, Marin
    Dutot, Pierre Francois
    Jansen, Klaus
    Otte, Christina
    Trystram, Denis
    THEORETICAL COMPUTER SCIENCE, 2010, 323 : 316 - +
  • [28] PAPR analysis of non-contiguous duplex multicarrier signals
    Savaux, V
    Louet, Y.
    ELECTRONICS LETTERS, 2021, 57 (02) : 92 - 95
  • [29] Brucellosis of the spine affecting multiple non-contiguous levels
    Charalambides, Constantinos
    Papademetriou, Kyriakos
    Sgouros, Spyros
    Sakas, Damianos
    BRITISH JOURNAL OF NEUROSURGERY, 2010, 24 (05) : 589 - 591
  • [30] Approximating the non-contiguous multiple organization packing problem
    Bougeret M.
    Dutot P.F.
    Jansen K.
    Otte C.
    Trystram D.
    IFIP Advances in Information and Communication Technology, 2010, 323 AICT : 316 - 327