A Holistic Heterogeneity-Aware Data Placement Scheme for Hybrid Parallel I/O Systems

被引:8
|
作者
He, Shuibing [1 ]
Li, Zheng [2 ]
Zhou, Jiang [3 ]
Yin, Yanlong [4 ]
Xu, Xiaohua [5 ]
Chen, Yong [6 ]
Sun, Xian-He [7 ]
机构
[1] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou 310027, Peoples R China
[2] Stockton Univ, Sch Business, Comp Sci Program, Galloway, NJ 08205 USA
[3] Chinese Acad Sci, Inst Informat Engn, Beijing 100864, Peoples R China
[4] Inst Artificial Intelligence, Intelligent Comp Syst Res Ctr, Zhejiang Lab, Hangzhou 311100, Peoples R China
[5] Kennesaw State Univ, Dept Comp Sci, Kennesaw, GA 30144 USA
[6] Texas Tech Univ, Dept Comp Sci, Lubbock, TX 79409 USA
[7] Illinois Inst Technol, Dept Comp Sci, Chicago, IL 60616 USA
基金
美国国家科学基金会;
关键词
Servers; System performance; Bandwidth; Computer science; Distributed databases; Sun; File systems; Parallel I; O system; parallel file system; hybrid parallel file system; data placement; solid state drive; SSD;
D O I
10.1109/TPDS.2019.2948901
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We present H2DP, a holistic heterogeneity-aware data placement scheme for hybrid parallel I/O systems, which consist of HDD servers and SSD servers. Most of the existing approaches focus on server performance or application I/O pattern heterogeneity in data placement. H2DP considers three axes of heterogeneity: server performance, server space, and application I/O pattern. More specifically, H2DP determines the optimized stripe sizes on servers based on server performance, keeps only critical data on all hybrid servers and the rest data on HDD servers, and dynamically migrates data among different types of servers at run-time. This holistic heterogeneity-awareness enables H2DP to achieve high performance by alleviating server load imbalance, efficiently utilizing SSD space, and accommodating application pattern variation. We have implemented a prototype of H2DP under MPICH2 atop OrangeFS. Extensive experimental results demonstrate that H2DP significantly improve I/O system performance compared to existing data placement schemes.
引用
收藏
页码:830 / 842
页数:13
相关论文
共 50 条
  • [41] On the impact of heterogeneity-aware mesh partitioning and non-contributing computation removal on parallel reservoir simulations
    Andreas Thune
    Xing Cai
    Alf Birger Rustad
    Journal of Mathematics in Industry, 11
  • [42] On the impact of heterogeneity-aware mesh partitioning and non-contributing computation removal on parallel reservoir simulations
    Thune, Andreas
    Cai, Xing
    Rustad, Alf Birger
    JOURNAL OF MATHEMATICS IN INDUSTRY, 2021, 11 (01)
  • [43] Bandwidth-Aware Data Placement Scheme for Hadoop
    Shabeera, T. P.
    Kumar, Madhu S. D.
    2013 IEEE RECENT ADVANCES IN INTELLIGENT COMPUTATIONAL SYSTEMS (RAICS), 2013, : 64 - 67
  • [44] HaaS: Cloud-based Real-time Data Analytics with Heterogeneity-aware Scheduling
    He, Jiong
    Chen, Yao
    Fu, Tom Z. J.
    Long, Xin
    Winslett, Marianne
    You, Liang
    Zhang, Zhenjie
    2018 IEEE 38TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS), 2018, : 1017 - 1028
  • [45] A Source-aware Interrupt Scheduling for Modern Parallel I/O Systems
    Zou, Hongbo
    Sun, Xian-He
    Ma, Siyuan
    Duan, Xi
    2012 IEEE 26TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS), 2012, : 156 - 166
  • [46] Parallel I/O Aware Query Optimization
    Ghodsnia, Pedram
    Bowman, Ivan T.
    Nica, Anisoara
    SIGMOD'14: PROCEEDINGS OF THE 2014 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2014, : 349 - 360
  • [47] Toward heterogeneity-aware device-to-device data dissemination over Wi-Fi networks
    Hamidouche, Lyes
    Monnet, Sebastien
    Sens, Pierre
    Refauvelet, Dimitri
    2017 IEEE 23RD INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS), 2017, : 105 - 112
  • [48] AutoPipe-H: A Heterogeneity-Aware Data-Paralleled Pipeline Approach on Commodity GPU Servers
    Liu, Weijie
    Lu, Kai
    Lai, Zhiquan
    Li, Shengwei
    Ge, Keshi
    Li, Dongsheng
    Lu, Xicheng
    IEEE TRANSACTIONS ON COMPUTERS, 2025, 74 (04) : 1196 - 1209
  • [49] Heterogeneity-aware Group-based Semantic Overlay Network for P2P Systems
    Bo, Jin
    WISM: 2009 INTERNATIONAL CONFERENCE ON WEB INFORMATION SYSTEMS AND MINING, PROCEEDINGS, 2009, : 701 - 704
  • [50] A HYBRID SHARED MEMORY EXECUTION MODEL FOR A DATA PARALLEL LANGUAGE WITH I/O
    Grelck, Clemens
    Kuthe, Steffen
    Scholz, Sven-Bodo
    PARALLEL PROCESSING LETTERS, 2008, 18 (01) : 23 - 37