A Holistic Heterogeneity-Aware Data Placement Scheme for Hybrid Parallel I/O Systems

被引:8
|
作者
He, Shuibing [1 ]
Li, Zheng [2 ]
Zhou, Jiang [3 ]
Yin, Yanlong [4 ]
Xu, Xiaohua [5 ]
Chen, Yong [6 ]
Sun, Xian-He [7 ]
机构
[1] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou 310027, Peoples R China
[2] Stockton Univ, Sch Business, Comp Sci Program, Galloway, NJ 08205 USA
[3] Chinese Acad Sci, Inst Informat Engn, Beijing 100864, Peoples R China
[4] Inst Artificial Intelligence, Intelligent Comp Syst Res Ctr, Zhejiang Lab, Hangzhou 311100, Peoples R China
[5] Kennesaw State Univ, Dept Comp Sci, Kennesaw, GA 30144 USA
[6] Texas Tech Univ, Dept Comp Sci, Lubbock, TX 79409 USA
[7] Illinois Inst Technol, Dept Comp Sci, Chicago, IL 60616 USA
基金
美国国家科学基金会;
关键词
Servers; System performance; Bandwidth; Computer science; Distributed databases; Sun; File systems; Parallel I; O system; parallel file system; hybrid parallel file system; data placement; solid state drive; SSD;
D O I
10.1109/TPDS.2019.2948901
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We present H2DP, a holistic heterogeneity-aware data placement scheme for hybrid parallel I/O systems, which consist of HDD servers and SSD servers. Most of the existing approaches focus on server performance or application I/O pattern heterogeneity in data placement. H2DP considers three axes of heterogeneity: server performance, server space, and application I/O pattern. More specifically, H2DP determines the optimized stripe sizes on servers based on server performance, keeps only critical data on all hybrid servers and the rest data on HDD servers, and dynamically migrates data among different types of servers at run-time. This holistic heterogeneity-awareness enables H2DP to achieve high performance by alleviating server load imbalance, efficiently utilizing SSD space, and accommodating application pattern variation. We have implemented a prototype of H2DP under MPICH2 atop OrangeFS. Extensive experimental results demonstrate that H2DP significantly improve I/O system performance compared to existing data placement schemes.
引用
收藏
页码:830 / 842
页数:13
相关论文
共 50 条
  • [31] Heterogeneity-aware Cross-school Electives Recommendation: a Hybrid Federated Approach
    Ju, Chengyi
    Cao, Jiannong
    Yang, Yu
    Yang, Zhen-Qun
    Lee, Ho Man
    2023 23RD IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS, ICDMW 2023, 2023, : 1500 - 1508
  • [32] SDPIPE: A Semi-Decentralized Framework for Heterogeneity-aware Pipeline-parallel Training
    Miao, Xupeng
    Shi, Yining
    Yang, Zhi
    Cui, Bin
    Jia, Zhihao
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2023, 16 (09): : 2354 - 2363
  • [33] CHRT: a Criticality- and Heterogeneity-Aware Runtime System for Task-Parallel Applications
    Han, Myeonggyun
    Park, Jinsu
    Baek, Woongki
    PROCEEDINGS OF THE 2017 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE), 2017, : 942 - 945
  • [34] Heterogeneity-aware Clustered Distributed Learning for Multi-source Data Analysis
    Chen, Yuanxing
    Zhang, Qingzhao
    Ma, Shuangge
    Fang, Kuangnan
    JOURNAL OF MACHINE LEARNING RESEARCH, 2024, 25
  • [35] Spatially-aware Parallel I/O for Particle Data
    Kumar, Sidharth
    Petruzza, Steve
    Usher, Will
    Pascucci, Valerio
    PROCEEDINGS OF THE 48TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING (ICPP 2019), 2019,
  • [36] Heterogeneity-aware elastic provisioning in cloud-assisted edge computing systems
    Li, Chunlin
    Bai, Jingpan
    Ge, Yuan
    Luo, Youlong
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2020, 112 (112): : 1106 - 1121
  • [37] Design and Implementation of a Criticality- and Heterogeneity-Aware Runtime System for Task-Parallel Applications
    Han, Myeonggyun
    Park, Jinsu
    Baek, Woongki
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2021, 32 (05) : 1117 - 1132
  • [38] FedDM: Data and Model Heterogeneity-Aware Federated Learning via Dynamic Weight Sharing
    Shen, Leming
    Zheng, Yuanqing
    2023 IEEE 43RD INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS, ICDCS, 2023, : 975 - 976
  • [39] Buffer-Aware Data Migration Scheme for Hybrid Storage Systems
    Lin, Mingwei
    Chen, Riqing
    Lin, Li
    Li, Xuan
    Huang, Jingchang
    IEEE ACCESS, 2018, 6 : 47646 - 47656
  • [40] HiNUMA: NUMA-aware Data Placement and Migration in Hybrid Memory Systems
    Duan, Zhuohui
    Liu, Haikun
    Liao, Xiaofei
    Jin, Hai
    Jiang, Wenbin
    Zhang, Yu
    2019 IEEE 37TH INTERNATIONAL CONFERENCE ON COMPUTER DESIGN (ICCD 2019), 2019, : 367 - 375