Characterizing Performance of Graph Neighborhood Communication Patterns

被引:7
|
作者
Ghosh, Sayan [1 ]
Tallent, Nathan R. [1 ]
Halappanavar, Mahantesh [1 ]
机构
[1] Pacific Northwest Natl Lab, Adv Comp Math & Data Div, Richland, WA 99354 USA
关键词
Benchmark testing; Pattern matching; Heating systems; Clustering algorithms; Software; Topology; Surges; Graphs and networks; neighborhood communication; MPI; network communication; network contention; distributed memories; benchmarking; NETWORK CONTENTION; ALGORITHMS;
D O I
10.1109/TPDS.2021.3101425
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Distributed-memory graph algorithms are fundamental enablers in scientific computing and analytics workflows. A majority of graph algorithms rely on the graph neighborhood communication pattern, i.e., repeated asynchronous communication between a vertex and its neighbors in the graph. The pattern is adversarial for communication software and hardware due to high message injection rates and input-dependent, many-to-one traffic with variable destinations and volumes. We present benchmarks and performance analysis of graph neighborhood communication on modern large-scale network interconnects from four supercomputers: ALCF Theta, NERSC Cori, OLCF Summit and R-CCS Fugaku. Our benchmarks characterize communication from the perspectives of latency and throughput. Benchmark parameters make it possible to mimic the behaviors of complex applications on real world or synthetic graphs by varying work distribution, remote edges, message volume, and per-vertex work. We find that minor changes in the input graph can substantially increase latencies; and contention can develop in memory caches and network stacks before contention in the network itself. Further, latencies and contention vary significantly for different graph neighborhoods, motivating the need for exploring asynchronous algorithms in greater detail. When adding work, load imbalance on real-world graphs can be pronounced: latencies for the 99th percentile were 8-128x than the corresponding average latencies. Our results help analysts and developers understand the performance implications of this important pattern, especially for the impending exascale platforms.
引用
收藏
页码:915 / 928
页数:14
相关论文
共 50 条
  • [1] Characterizing Communication Patterns of Parallel Programs Through Graph Visualization and Analysis
    Stringhini, Denise
    Fazenda, Alvaro
    EURO-PAR 2015: PARALLEL PROCESSING WORKSHOPS, 2015, 9523 : 565 - 576
  • [2] Characterizing Individual Communication Patterns
    Malmgren, R. Dean
    Hofman, Jake M.
    Amaral, Luis A. N.
    Watts, Duncan J.
    KDD-09: 15TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2009, : 607 - 615
  • [3] Mining Frequent Neighborhood Patterns in a Large Labeled Graph
    Han, Jialong
    Wen, Ji-Rong
    PROCEEDINGS OF THE 22ND ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM'13), 2013, : 259 - 268
  • [4] Interferer nulling based on neighborhood communication patterns
    Vilzmann, Robert
    Widmer, Joerg
    Aad, Imad
    Hartmann, Christian
    WIRELESS COMMUNICATIONS 2007 CNIT THYRRENIAN SYMPOSIUM, 2008, : 81 - +
  • [5] Characterizing Communication Patterns between Audiences and Newsbots
    Gomez-Zara, Diego
    Diakopoulos, Nicholas
    DIGITAL JOURNALISM, 2020, 8 (09) : 1093 - 1113
  • [6] On Characterizing the Performance of Distributed Graph Computation Platforms
    Barnawi, Ahmed
    Batarfi, Omar
    Behteshi, Seyed-Mehdi-Reza
    Elshawi, Radwa
    Fayoumi, Ayman
    Nouri, Reza
    Sakr, Sherif
    PERFORMANCE CHARACTERIZATION AND BENCHMARKING: TRADITIONAL TO BIG DATA, 2015, 8904 : 29 - 43
  • [7] CHARACTERIZING A ZOOPLANKTON NEIGHBORHOOD - SMALL-SCALE PATTERNS OF ASSOCIATION AND ABUNDANCE
    FOLT, C
    SCHULZE, PC
    BAUMGARTNER, K
    FRESHWATER BIOLOGY, 1993, 30 (02) : 289 - 300
  • [8] Characterizing exercise relationships: Communication, closeness, and performance
    Max, Emery J.
    Wittenbaum, Gwen M.
    Feltz, Deborah L.
    JOURNAL OF SPORT & EXERCISE PSYCHOLOGY, 2016, 38 : S228 - S228
  • [9] Characterizing the Communication Demands of the Graph500 Benchmark on a Commodity Cluster
    Fuentes, Pablo
    Luis Bosque, Jose
    Beivide, Ramon
    Valero, Mateo
    Minkenberg, Cyriel
    2014 IEEE/ACM INTERNATIONAL SYMPOSIUM ON BIG DATA COMPUTING (BDC), 2014, : 83 - 89
  • [10] Characterizing Communication Patterns of NAS-MPI benchmark programs
    Lee, I.
    PROCEEDINGS OF THE IEEE SOUTHEASTCON 2009, TECHNICAL PROCEEDINGS, 2009, : 158 - 163