Modeling and Optimizing Large-Scale Wide-Area Data Transfers

被引:11
|
作者
Kettimuthu, Rajkumar [1 ,2 ]
Vardoyan, Gayane [1 ]
Agrawal, Gagan [2 ]
Sadayappan, P. [2 ]
机构
[1] Argonne Natl Lab, Math & Comp Sci Div, Argonne, IL 60439 USA
[2] Ohio State Univ, Comp Sci & Engn, Columbus, OH 43210 USA
关键词
wide-area data transfer; GridFTP; modeling data transfer; BANDWIDTH ALLOCATION;
D O I
10.1109/CCGrid.2014.114
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Data generated by experimental, simulation, and observational science is growing exponentially. The resulting datasets are often transported over wide-area networks for storage, analysis, or visualization. Network bandwidth, which is not increasing at the same rate as dataset sizes, is becoming a key obstacle to data-driven sciences. In this paper, we focus on how bandwidth allocation can be controlled at the level of a protocol such as GridFTP, in view of goals such as maintaining certain priorities or performing scheduling with specified objectives. In particular, we explore how GridFTP transfer performance can be controlled by using parallelism and concurrency. We find that concurrency turns out to be a more powerful control knob than is parallelism. For a source where most bandwidth is consumed by transfers to a small number of other destinations, we build a model for each destination's achieved throughput in terms of its concurrency and total concurrency (over GridFTP transfers) to other major destinations. We then enhance this model by including an indicator of the time-varying external load, using multiple ways to measure this external load. We study the effectiveness of the proposed models in controlling the bandwidth allocation. After evaluating the numerous combinations of models and methods of measuring external load, we narrow in on the four best-performing ones, based on both their validation results and their applicability. After extensive testing of these four approaches, we find that they can obtain desired bandwidth allocations with a mean(median) error rate of 19.8%(13.8%), with 38% of the errors in our benchmark tests being less than 10% and 54% of them being less than 15%.
引用
收藏
页码:196 / 205
页数:10
相关论文
共 50 条
  • [31] Wide-area damping controller design of large-scale PV power plants in interconnected power systems
    Ge, Jing
    Zhao, Da-wei
    Ma, Jin
    Wang, Yu-sheng
    Du, Hong-ji
    Qian, Min-hui
    2016 CHINA INTERNATIONAL CONFERENCE ON ELECTRICITY DISTRIBUTION (CICED), 2016,
  • [32] Autonomous Decentralized Control for Indirectly Controlling System Performance Variable of Large-Scale and Wide-Area Networks
    Sakumoto, Yusuke
    Aida, Masaki
    Shimonishi, Hideyuki
    IEICE TRANSACTIONS ON COMMUNICATIONS, 2015, E98B (11) : 2248 - 2258
  • [33] Wide-area Coordinated Control of Large Scale Energy Storage System
    Lu, Qiuyu
    Ru, Wei
    Min, Yong
    Ge, Weichun
    Wang, Zhiming
    2012 IEEE INTERNATIONAL CONFERENCE ON POWER SYSTEM TECHNOLOGY (POWERCON), 2012,
  • [34] Modeling and Control of Wide-Area Networks
    Wang, Qiuzhen
    Hu, Jiangping
    MATHEMATICS, 2023, 11 (18)
  • [35] Efficient Differentiated Storage Architecture for Large-Scale Flow Tables in Software-Defined Wide-Area Networks
    Xiong, Bing
    Wu, Rengeng
    Zhao, Jinyuan
    Liao, Zhuofan
    Wang, Jin
    IEEE ACCESS, 2019, 7 : 141193 - 141208
  • [36] Assessment and Choice of Input for Wide-area Damping Controller in Large-scale PV Taking Time Delay into Consideration
    Shah, Rakibuzzaman
    Mithulananthan, N.
    Lee, K. Y.
    2012 22ND AUSTRALASIAN UNIVERSITIES POWER ENGINEERING CONFERENCE (AUPEC): GREEN SMART GRID SYSTEMS, 2012,
  • [37] Wide-area measurement signal-based stabiliser for large-scale photovoltaic plants with high variability and uncertainty
    Shah, Rakibuzzaman
    Mithulananthan, Nadarajah
    Lee, Kwang Y.
    Bansal, Ramesh C.
    IET RENEWABLE POWER GENERATION, 2013, 7 (06) : 614 - 622
  • [38] Emergency fault affected wide-area automatic generation control via large-scale deep reinforcement learning
    Li, Jiawen
    Yu, Tao
    Zhang, Xiaoshun
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2021, 106
  • [39] Wide-area Nile: A case study of a wide-area data-parallel application
    Amoroso, A
    Marzullo, K
    Ricciardi, A
    18TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS, PROCEEDINGS, 1998, : 506 - 515
  • [40] Optimizing data robustness in large-scale storage systems
    Gougeaud, Sebastien
    Zertal, Soraya
    Lafoucriere, Jacques-Charles
    Deniel, Philippe
    2017 INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING & SIMULATION (HPCS), 2017, : 236 - 243