PARALLEL PROCESSING OF LARGE DATA SETS IN PARTICLE PHYSICS

被引:0
|
作者
Rotaru, Marina [1 ]
Ciubancan, Mihai [1 ]
Stoicea, Gabriel [1 ]
机构
[1] Horia Hulubei Natl Inst Phys & Nucl Engn, Reactorului 30,POB MG6, RO-077125 Magurele, Romania
来源
ROMANIAN JOURNAL OF PHYSICS | 2016年 / 61卷 / 1-2期
关键词
particle physics; PROOF; protocols; data analysis;
D O I
暂无
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
The analysis of the LHC data aims to minimize the vast amounts of data and the number of observables used. After slimming and skimming the data, the remaining terabytes of ROOT files hold a selection of the events and a fiat structure for the variables needed that can be more easily inspected and traversed in the final stages of the analysis. PROOF has an efficient mechanism to distribute the analysis load by taking advantage of all the cores in modern CPUs through PROOF-Lite, PROOF Cluster or PROOF on Demand tools. In this paper we compared performance of different methods of file access (NFS, XROOTD, RFIO). The tests were done on Bucharest ATLAS Analysis Facility.
引用
收藏
页码:245 / 252
页数:8
相关论文
共 50 条
  • [1] Parallel Processing Strategies for Large SAR Image Data Sets in a Distributed Environment
    A. Goller
    Computing, 1999, 62 : 277 - 291
  • [2] Parallel processing strategies for large SAR image data sets in a distributed environment
    Goller, A
    COMPUTING, 1999, 62 (04) : 277 - 291
  • [3] Parallel visualization of large data sets
    Rosenberg, R
    Lanzagorta, M
    Chtchelkanova, A
    Khokhlov, A
    VISUAL DATA EXPLORATION AND ANALYSIS VII, 2000, 3960 : 135 - 143
  • [4] PCA for large data sets with parallel data summarization
    Ordonez, Carlos
    Mohanam, Naveen
    Garcia-Alvarado, Carlos
    DISTRIBUTED AND PARALLEL DATABASES, 2014, 32 (03) : 377 - 403
  • [5] PCA for large data sets with parallel data summarization
    Carlos Ordonez
    Naveen Mohanam
    Carlos Garcia-Alvarado
    Distributed and Parallel Databases, 2014, 32 : 377 - 403
  • [6] Parallel processing on large redundant biological data sets: Protein structures classification with CEPAR
    Pekurovsky, D
    Shindyalov, I
    Bourneb, P
    PARALLEL COMPUTING: SOFTWARE TECHNOLOGY, ALGORITHMS, ARCHITECTURES AND APPLICATIONS, 2004, 13 : 661 - 668
  • [7] Dealing with large particle counting data sets
    Ceronio, AD
    Haarhoff, J
    3RD WORLD WATER CONGRESS: DRINKING WATER TREATMENT, 2002, 2 (5-6): : 35 - 40
  • [8] Modern technologies for processing of large data sets
    Valova, Ivanka
    Noirhomme-Fraiture, Monique
    COMPTES RENDUS DE L ACADEMIE BULGARE DES SCIENCES, 2008, 61 (04): : 513 - 524
  • [9] Querying large physics data sets over an information grid
    Baker, N
    Brooks, P
    Kovacs, Z
    Le Goff, JM
    McClatchey, R
    PROCEEDINGS OF CHEP 2001, 2001, : 663 - 667
  • [10] SCOPE: Easy and Efficient Parallel Processing of Massive Data Sets
    Chaiken, Ronnie
    Jenkins, Bob
    Larson, Per-Ake
    Ramsey, Bill
    Shakib, Darren
    Weaver, Simon
    Zhou, Jingren
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2008, 1 (02): : 1265 - 1276