Federated data storage and management infrastructure

被引:4
|
作者
Zarochentsev, A. [1 ]
Kiryanov, A. [2 ,3 ]
Klimentov, A. [3 ,4 ]
Krasnopevtsev, D. [3 ,5 ]
Hristov, P. [6 ]
机构
[1] St Petersburg State Univ, St Petersburg, Russia
[2] Petersburg Nucl Phys Inst, Gatchina, Leningrad Oblas, Russia
[3] Natl Res Ctr, Kurchatov Inst, Moscow, Russia
[4] Brookhaven Natl Lab, Upton, NY 11973 USA
[5] Natl Res Nucl Univ MEPhI, Moscow, Russia
[6] CERN, European Ctr Nucl Res, Geneva, Switzerland
关键词
D O I
10.1088/1742-6596/762/1/012016
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The Large Hadron Collider (LHC), operating at the international CERN Laboratory in Geneva, Switzerland, is leading Big Data driven scientific explorations. Experiments at the LHC explore the fundamental nature of matter and the basic forces that shape our universe. Computing models for the High Luminosity LHC era anticipate a growth of storage needs of at least orders of magnitude; it will require new approaches in data storage organization and data handling. In our project we address the fundamental problem of designing of architecture to integrate a distributed heterogeneous disk resources for LHC experiments and other data intensive science applications and to provide access to data from heterogeneous computing facilities. We have prototyped a federated storage for Russian T1 and T2 centers located in Moscow, St.-Petersburg and Gatchina, as well as Russian / CERN federation. We have conducted extensive tests of underlying network infrastructure and storage endpoints with synthetic performance measurement tools as well as with HENP-specific workloads, including the ones running on supercomputing platform, cloud computing and Grid for ALICE and ATLAS experiments. We will present our current accomplishments with running LHC data analysis remotely and locally to demonstrate our ability to efficiently use federated data storage experiment wide within National Academic facilities for High Energy and Nuclear Physics as well as for other data-intensive science applications, such as bio-infomatics.
引用
收藏
页数:9
相关论文
共 50 条
  • [31] INFRASTRUCTURE FOR METAGENOME DATA MANAGEMENT AND ANALYSIS
    Tatusova, Tatiana
    BIOINFORMATICS 2011, 2011, : 357 - 362
  • [32] A Data Management Infrastructure for Bridge Monitoring
    Jeong, Seongwoon
    Byun, Jaewook
    Kim, Daeyoung
    Sohn, Hoon
    Bae, In Hwan
    Law, Kincho H.
    SENSORS AND SMART STRUCTURES TECHNOLOGIES FOR CIVIL, MECHANICAL, AND AEROSPACE SYSTEMS 2015, 2015, 9435
  • [33] DaltOn:: An infrastructure for scientific data management
    Jablonski, Stefan
    Cure, Olivier
    Rehman, M. Abdul
    Volz, Bernhard
    COMPUTATIONAL SCIENCE - ICCS 2008, PT 3, 2008, 5103 : 520 - +
  • [34] Data Management Infrastructure for the Mobile Web
    Jensen, Christian S.
    2009 FIFTH INTERNATIONAL CONFERENCE ON SEMANTICS, KNOWLEDGE AND GRID (SKG 2009), 2009, : 1 - 1
  • [35] FedDS: Data Selection for Streaming Federated Learning with Limited Storage
    Wei, Yongquan
    Wang, Xijun
    Guo, Kun
    Yang, Howard H.
    Chen, Xiang
    2024 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE, WCNC 2024, 2024,
  • [36] A Storage Infrastructure for Heterogeneous and Multimedia Data in the Internet of Things
    Di Francesco, Mario
    Li, Na
    Raj, Mayank
    Das, Sajal K.
    2012 IEEE INTERNATIONAL CONFERENCE ON GREEN COMPUTING AND COMMUNICATIONS, CONFERENCE ON INTERNET OF THINGS, AND CONFERENCE ON CYBER, PHYSICAL AND SOCIAL COMPUTING (GREENCOM 2012), 2012, : 26 - 33
  • [37] Storage and Analysis Infrastructure for Data Acquisition Systems with High Data Rates
    Sutter, M.
    Jejkal, T.
    Stotzka, R.
    Hartmann, V.
    Hardt, M.
    REMOTE INSTRUMENTATION SERVICES ON THE E-INFRASTRUCTURE: APPLICATIONS AND TOOLS, 2011, : 85 - 102
  • [38] Management of archival materials with Linked Data and federated queries
    Hidalgo-Delgado, Yusniel
    Senso, Jose A.
    Leiva-Mederos, Amed
    Hipola, Pedro
    REVISTA ESPANOLA DE DOCUMENTACION CIENTIFICA, 2016, 39 (03):
  • [39] Data Provenance Management of Bioinformatics Workflows in Federated Clouds
    Wercelens, Polyane
    da Silva, Waldeyr
    Castro, Klayton
    Araujo, Aleteia P. F.
    Lifschitz, Sergio
    Holanda, Maristela
    2019 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2019, : 750 - 754
  • [40] Integrating Identity Management With Federated Healthcare Data Models
    Hu, Jun
    Peyton, Liam
    E-TECHNOLOGIES-INNOVATION IN AN OPEN WORLD, 2009, 26 : 100 - +