Modeling and Optimizing Large-Scale Wide-Area Data Transfers

被引:11
|
作者
Kettimuthu, Rajkumar [1 ,2 ]
Vardoyan, Gayane [1 ]
Agrawal, Gagan [2 ]
Sadayappan, P. [2 ]
机构
[1] Argonne Natl Lab, Math & Comp Sci Div, Argonne, IL 60439 USA
[2] Ohio State Univ, Comp Sci & Engn, Columbus, OH 43210 USA
关键词
wide-area data transfer; GridFTP; modeling data transfer; BANDWIDTH ALLOCATION;
D O I
10.1109/CCGrid.2014.114
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Data generated by experimental, simulation, and observational science is growing exponentially. The resulting datasets are often transported over wide-area networks for storage, analysis, or visualization. Network bandwidth, which is not increasing at the same rate as dataset sizes, is becoming a key obstacle to data-driven sciences. In this paper, we focus on how bandwidth allocation can be controlled at the level of a protocol such as GridFTP, in view of goals such as maintaining certain priorities or performing scheduling with specified objectives. In particular, we explore how GridFTP transfer performance can be controlled by using parallelism and concurrency. We find that concurrency turns out to be a more powerful control knob than is parallelism. For a source where most bandwidth is consumed by transfers to a small number of other destinations, we build a model for each destination's achieved throughput in terms of its concurrency and total concurrency (over GridFTP transfers) to other major destinations. We then enhance this model by including an indicator of the time-varying external load, using multiple ways to measure this external load. We study the effectiveness of the proposed models in controlling the bandwidth allocation. After evaluating the numerous combinations of models and methods of measuring external load, we narrow in on the four best-performing ones, based on both their validation results and their applicability. After extensive testing of these four approaches, we find that they can obtain desired bandwidth allocations with a mean(median) error rate of 19.8%(13.8%), with 38% of the errors in our benchmark tests being less than 10% and 54% of them being less than 15%.
引用
收藏
页码:196 / 205
页数:10
相关论文
共 50 条
  • [21] Building a Large-Scale and Wide-Area Quantum Internet Based on an OSI-Alike Model
    Li, Zhonghui
    Xue, Kaiping
    Li, Jian
    Yu, Nenghai
    Liu, Jianqing
    Wei, David S. L.
    Sun, Qibin
    Lu, Jun
    CHINA COMMUNICATIONS, 2021, 18 (10) : 1 - 14
  • [22] An Autonomous Decentralized Adaptive Function for Retaining Control Strength in Large-Scale and Wide-Area System
    Sakumoto, Yusuke
    Aida, Masaki
    Shimonishi, Hideyuki
    2014 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM 2014), 2014, : 1923 - 1929
  • [23] A cluster-based networking approach for large-scale and wide-area quantum key agreement
    Zhonghui Li
    Kaiping Xue
    Qidong Jia
    Jian Li
    David S. L. Wei
    Jianqing Liu
    Nenghai Yu
    Quantum Information Processing, 21
  • [24] Decentralized Wide-Area Neural Network Predictive Damping Controller for a Large-scale Power System
    Yohanandhan, Rajaa Vikhram
    Srinivasan, Latha
    2018 IEEE INTERNATIONAL CONFERENCE ON POWER ELECTRONICS, DRIVES AND ENERGY SYSTEMS (PEDES), 2018,
  • [25] Online Tuning of Dynamic Equivalents for Large-Scale Power Systems Using Wide-area Measurements
    Jiang, Zhihao
    Tong, Ning
    Zhu, Lin
    Liu, Yilu
    2021 IEEE POWER & ENERGY SOCIETY GENERAL MEETING (PESGM), 2021,
  • [26] A dynamic scheduling approach for coordinated wide-area data transfers using GridFTP
    Khanna, Gaurav
    Catalyurek, Umit
    Kurc, Tahsin
    Kettimuthu, Rajkumar
    Sadayappan, P.
    Saltz, Joel
    2008 IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL & DISTRIBUTED PROCESSING, VOLS 1-8, 2008, : 2011 - +
  • [27] MODELING AND OPTIMIZING LARGE-SCALE CHROMATOGRAPHIC SEPARATIONS
    MCCOY, BJ
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 1986, 191 : 109 - INDE
  • [28] An Autonomous Decentralized Control for Indirectly Controlling System Performance Variable in Large-Scale and Wide-Area Network
    Sakumoto, Yusuke
    Aida, Masaki
    Shimonishi, Hideyuki
    2014 16TH INTERNATIONAL TELECOMMUNICATIONS NETWORK STRATEGY AND PLANNING SYMPOSIUM (NETWORKS), 2014,
  • [29] Towards Optimizing Wide-Area Streaming Analytics
    Heintz, Benjamin
    Chandra, Abhishek
    Sitaraman, Ramesh K.
    2015 IEEE INTERNATIONAL CONFERENCE ON CLOUD ENGINEERING (IC2E 2015), 2015, : 452 - 457
  • [30] Optimizing parallel applications for wide-area clusters
    Bal, HE
    Plaat, A
    Bakker, MG
    Dozy, P
    Hofman, RFH
    FIRST MERGED INTERNATIONAL PARALLEL PROCESSING SYMPOSIUM & SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING, 1998, : 784 - 790