A distributed multi-storage I/O system for data intensive scientific computing

被引:1
|
作者
Shen, XH [1 ]
Choudhary, A [1 ]
机构
[1] Northwestern Univ, Dept Elect & Comp Engn, Ctr Parallel & Distributed Comp, Evanston, IL 60208 USA
关键词
multi-storage I/O system; access pattern; data intensive computing;
D O I
10.1016/j.parco.2003.05.009
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
More and more parallel applications are running in a distributed environment to take advantage of easily available and inexpensive commodity resources. For data intensive applications, employing multiple distributed storage resources has many advantages. In this paper, we present a Multi-Storage I/O System (MS-I/O) that cannot only effectively manage various distributed storage resources in the system, but also provide novel high performance storage access schemes. MS-I/O employs many state-of-the-art I/O optimizations such as collective I/O, asynchronous I/O etc. and a number of new techniques such as data location, data replication, subfile, superfile and data access history. In addition, many MS-I/O optimization schemes can work simultaneously within a single data access session, greatly improving the performance. Although I/O optimization techniques can help improve performance, it also complicates I/O system. In addition, most optimization techniques have their limitations. Therefore, selecting accurate optimization policies requires expert knowledge which is not suitable for end users who may have little knowledge of I/O techniques. So the task of I/O optimization decision should be left to the I/O system itself, that is, automatic from user's point of view. We present a User Access Pattern data structure which is associated with each dataset that can help MS-I/O easily make accurate I/O optimization decisions. (C) 2003 Elsevier B.V. All rights reserved.
引用
收藏
页码:1623 / 1643
页数:21
相关论文
共 50 条
  • [21] Data-Intensive Scalable Computing for Scientific Applications
    Bryant, Randal E.
    COMPUTING IN SCIENCE & ENGINEERING, 2011, 13 (06) : 25 - 33
  • [22] Parallel data intensive computing in scientific and commercial applications
    Cannataro, M
    Talia, D
    Srimani, PK
    PARALLEL COMPUTING, 2002, 28 (05) : 673 - 704
  • [23] A Distributed Graph Data Storage and Computing Framework
    Zhou, Wei
    Gao, Yun
    Han, Jizhong
    Yue, Yinliang
    Xu, Zhiyong
    2015 IEEE 17TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS, 2015 IEEE 7TH INTERNATIONAL SYMPOSIUM ON CYBERSPACE SAFETY AND SECURITY, AND 2015 IEEE 12TH INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS (ICESS), 2015, : 896 - 899
  • [24] MataNui - A Distributed Storage Infrastructure for Scientific Data
    Kloss, Guy K.
    2013 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, 2013, 18 : 2607 - 2610
  • [25] Scalable Distributed Storage for Big Scientific Data
    Kokoulin, Andrey N.
    Yuzhakov, Aleksandr A.
    Kiryanov, Dmitriy A.
    PROCEEDINGS OF THE 2018 IEEE CONFERENCE OF RUSSIAN YOUNG RESEARCHERS IN ELECTRICAL AND ELECTRONIC ENGINEERING (EICONRUS), 2018, : 1099 - 1103
  • [26] Optimal Intraday Rolling Operation Strategy of Integrated Energy System with Multi-Storage
    Xu, Jieyan
    Chen, Zheng
    Hao, Tianyi
    Zhu, Shaojie
    Tang, Yu
    Liu, Haoming
    2018 2ND IEEE CONFERENCE ON ENERGY INTERNET AND ENERGY SYSTEM INTEGRATION (EI2), 2018,
  • [27] A Linear Programming Model for Optimizing about Compensative Regulation in Multi-Storage System
    Ge Jiuyan
    Fu Qian
    2008 4TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND MOBILE COMPUTING, VOLS 1-31, 2008, : 12678 - 12681
  • [28] Data-Intensive Computing Modules for Teaching Parallel and Distributed Computing
    Gowanlock, Michael
    Gallet, Benoit
    2021 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW), 2021, : 350 - 357
  • [29] Dual channel parallel I/O: A fast data access for scientific computing
    High Performance Computing Center, Institute of Applied Physics and Computational Mathematics, Beijing
    100094, China
    Jisuanji Xuebao, 5 (1035-1043):
  • [30] Optimized data storage algorithm of IoT based on cloud computing in distributed system
    Wang, Mingzhe
    Zhang, Qiuliang
    COMPUTER COMMUNICATIONS, 2020, 157 : 124 - 131