Model and simulation of exascale communication networks

被引:16
作者
Liu, N. [1 ]
Carothers, C. [1 ]
Cope, J. [2 ]
Carns, P. [2 ]
Ross, R. [1 ,2 ]
机构
[1] Rensselaer Polytech Inst, Dept Comp Sci, Troy, NY USA
[2] Argonne Natl Lab, Div Math & Comp Sci, Argonne, IL 60439 USA
基金
美国国家科学基金会;
关键词
parallel discrete-event simulation; torus network; exascale; discrete-event model;
D O I
10.1057/jos.2012.4
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Exascale supercomputers will have millions or even hundreds of millions of processing cores and the potential for nearly billion-way parallelism. Exascale compute and data storage architectures will be critically dependent on the interconnection network. The most popular interconnection network for current and future supercomputer systems is the torus (eg, k-ary, n-cube). This paper focuses on the modelling and simulation of ultra-large-scale torus networks using Rensselaer's Optimistic Simulator System. We compare real communication delays between our model and the actual torus network from Blue Gene/L using 2048 processors. Our performance experiments demonstrate the ability to simulate million-node to billion-node torus networks. The torus network model for a 16-million-node configuration shows a high degree of strong scaling when going from 1024 cores to 32 768 cores on Blue Gene/L, with a peak event-rate of nearly 5 billion events per second. We also demonstrate the performance of our torus network model configured with 1 billion nodes on both Blue Gene/L and Blue Gene/P systems. The observed best event rate at 128K cores is 12.36 billion per second on Blue Gene/P. Journal of Simulation (2012) 6, 227-236. doi:10.1057/jos.2012.4; published online 23 March 2012
引用
收藏
页码:227 / 236
页数:10
相关论文
共 29 条
[1]   Symbiotic Routing in Future Data Centers [J].
Abu-Libdeh, Hussam ;
Costa, Paolo ;
Rowstron, Antony ;
O'Shea, Greg ;
Donnelly, Austin .
ACM SIGCOMM COMPUTER COMMUNICATION REVIEW, 2010, 40 (04) :51-62
[2]   Blue Gene/L torus interconnection network [J].
Adiga, NR ;
Blumrich, MA ;
Chen, D ;
Coteus, P ;
Gara, A ;
Giampapa, ME ;
Heidelberger, P ;
Singh, S ;
Steinmacher-Burow, BD ;
Takken, T ;
Tsao, M ;
Vranas, P .
IBM JOURNAL OF RESEARCH AND DEVELOPMENT, 2005, 49 (2-3) :265-276
[4]  
Balaji Pavan, 2009, Proceedings of the 2009 IEEE 15th International Conference on Parallel and Distributed Systems (ICPADS 2009), P586, DOI 10.1109/ICPADS.2009.117
[5]   Scalable Time Warp on Blue Gene Supercomputers [J].
Bauer, David W., Jr. ;
Carothers, Christopher D. ;
Holder, Akintayo .
PADS 2009: 23RD WORKSHOP ON PRINCIPLES OF ADVANCED AND DISTRIBUTED SIMULATION, PROCEEDINGS, 2009, :35-+
[6]  
Bestavros A, 2004, P 1 INT COMP ENG C I
[7]  
Bland AS, 2009, COMP FUT CUG 2009 P
[8]  
Blumrich M., 2003, RC23025W0312022 IBM
[10]  
Budnik T, 2010, 3 IEEE WORKSH MAN TA