PARALLEL PROCESSING OF LARGE DATA SETS IN PARTICLE PHYSICS

被引:0
|
作者
Rotaru, Marina [1 ]
Ciubancan, Mihai [1 ]
Stoicea, Gabriel [1 ]
机构
[1] Horia Hulubei Natl Inst Phys & Nucl Engn, Reactorului 30,POB MG6, RO-077125 Magurele, Romania
来源
ROMANIAN JOURNAL OF PHYSICS | 2016年 / 61卷 / 1-2期
关键词
particle physics; PROOF; protocols; data analysis;
D O I
暂无
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
The analysis of the LHC data aims to minimize the vast amounts of data and the number of observables used. After slimming and skimming the data, the remaining terabytes of ROOT files hold a selection of the events and a fiat structure for the variables needed that can be more easily inspected and traversed in the final stages of the analysis. PROOF has an efficient mechanism to distribute the analysis load by taking advantage of all the cores in modern CPUs through PROOF-Lite, PROOF Cluster or PROOF on Demand tools. In this paper we compared performance of different methods of file access (NFS, XROOTD, RFIO). The tests were done on Bucharest ATLAS Analysis Facility.
引用
收藏
页码:245 / 252
页数:8
相关论文
共 50 条
  • [41] Extracting large data sets using DB2 parallel edition
    Padmanabhan, S
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON VERY LARGE DATA BASES, 1996, : 582 - 582
  • [42] Parallel acceleration of CPU and GPU range queries over large data sets
    Mitchell Nelson
    Zachary Sorenson
    Joseph M. Myre
    Jason Sawin
    David Chiu
    Journal of Cloud Computing, 9
  • [43] Parallel acceleration of CPU and GPU range queries over large data sets
    Nelson, Mitchell
    Sorenson, Zachary
    Myre, Joseph M.
    Sawin, Jason
    Chiu, David
    JOURNAL OF CLOUD COMPUTING-ADVANCES SYSTEMS AND APPLICATIONS, 2020, 9 (01):
  • [44] Massively parallel software rendering for visualizing large-scale data sets
    Ma, KL
    Parker, S
    IEEE COMPUTER GRAPHICS AND APPLICATIONS, 2001, 21 (04) : 72 - 83
  • [45] P-AutoClass: Scalable parallel clustering for mining large data sets
    Pizzuti, C
    Talia, D
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2003, 15 (03) : 629 - 641
  • [46] A Multi-Agent Parallel Approach to Analyzing Large Climate Data Sets
    Woodring, Jason
    Sell, Matthew
    Fukuda, Munehiro
    Asuncion, Hazeline
    Salathe, Eric
    2017 IEEE 37TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2017), 2017, : 1639 - 1648
  • [47] A parallel algorithm for Bayesian network structure learning from large data sets
    Madsen, Anders L.
    Jensen, Frank
    Salmeron, Antonio
    Langseth, Helge
    Nielsen, Thomas D.
    KNOWLEDGE-BASED SYSTEMS, 2017, 117 : 46 - 55
  • [48] SOFTWARE ARCHITECTURE FOR PARALLEL PARTICLE TRACKING WITH THE DISTRIBUTION OF LARGE AMOUNT OF DATA
    Bonnier, Florent
    Emad, Nahid
    Juvigny, Xavier
    HIGH PERFORMANCE COMPUTING SYMPOSIUM (HPC 2018), 2018, 50 (04):
  • [49] Extension of Parallel Primitives and Their Applications to Large-Scale Data Processing
    Nakano, Masashi
    Chang, Qiong
    Miyazaki, Jun
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, PT II, DEXA 2024, 2024, 14911 : 248 - 253
  • [50] Particle Physics with the LHC Data
    Hagiwara, Kaoru
    GUT2012, 2012, 1467 : 86 - 92