An Intelligent Cache Management for Data Analysis at CMS

被引:2
|
作者
Tracolli, Mirco [1 ,2 ,3 ]
Baioletti, Marco [1 ]
Ciangottini, Diego [3 ]
Poggioni, Valentina [1 ]
Spiga, Daniele [3 ]
机构
[1] Univ Perugia, Perugia, Italy
[2] Univ Firenze, Florence, Italy
[3] Ist Nazl Fis Nucl, Sez Perugia, Perugia, Italy
关键词
Cache; Optimization; LRU; Intelligent system; Big data; Data science workflow;
D O I
10.1007/978-3-030-58802-1_24
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In this work, we explore a score-based approach to manage a cache system. With the proposed method, the cache can better discriminate the input requests and improve the overall performances. We created a score based discriminator using the file statistics. The score represents the weight of a file. We tested several functions to compute the file weight used to determine whether a file has to be stored in the cache or not. We developed a solution experimenting on a real cache manager named XCache, that is used within the Compact Muon Solenoid (CMS) data analysis workflow. The aim of this work is optimizing to reduce maintaining costs of the cache system without compromising the user experience.
引用
收藏
页码:320 / 332
页数:13
相关论文
共 50 条
  • [41] The Grid-distributed data analysis in CMS
    Fanzago, F.
    Farina, F.
    Cinquilli, M.
    Codispoti, G.
    Fanfani, F.
    Lacaprara, S.
    Miccio, E.
    Spiga, S.
    Vaandering, E.
    NUOVO CIMENTO DELLA SOCIETA ITALIANA DI FISICA C-COLLOQUIA ON PHYSICS, 2009, 32 (02): : 115 - 119
  • [42] CMS Analysis and Data Reduction with Apache Spark
    Gutsche, Oliver
    Canali, Luca
    Cremer, Illia
    Cremonesi, Matteo
    Elmer, Peter
    Fisk, Ian
    Girone, Maria
    Jayatilaka, Bo
    Kowalkowski, Jim
    Khristenko, Viktor
    Motesnitsalis, Evangelos
    Pivarski, Jim
    Sehrish, Saba
    Surdy, Kacper
    Svyatkovskiy, Alexey
    18TH INTERNATIONAL WORKSHOP ON ADVANCED COMPUTING AND ANALYSIS TECHNIQUES IN PHYSICS RESEARCH (ACAT2017), 2018, 1085
  • [43] RDMS CMS data processing and analysis workflow
    Gavrilov, V.
    Golutvin, I.
    Korenkov, V.
    Tikhonenko, E.
    Shmatov, S.
    Zhiltsov, V.
    Ilyin, V.
    Kodolova, O.
    Levchuk, L.
    NUCLEAR ELECTRONICS & COMPUTING (NEC'2011), 2011, : 148 - 153
  • [44] Distributed Computing and Data Analysis in the CMS Experiment
    Kreuzer, P.
    Spiga, D.
    2008 IEEE NUCLEAR SCIENCE SYMPOSIUM AND MEDICAL IMAGING CONFERENCE (2008 NSS/MIC), VOLS 1-9, 2009, : 1254 - +
  • [45] Intelligent memory manager: Reducing cache pollution due to memory management functions
    Rezaei, M
    Kavi, KM
    JOURNAL OF SYSTEMS ARCHITECTURE, 2006, 52 (01) : 41 - 55
  • [46] Cloud to Cloudlet - An Intelligent Recommendation System for Efficient Resources Management: Mobile Cache
    Haseeb, Muhammad
    Ahsan, Ahmad
    Malik, Asad W.
    PROCEEDINGS OF 14TH INTERNATIONAL CONFERENCE ON FRONTIERS OF INFORMATION TECHNOLOGY PROCEEDINGS - FIT 2016, 2016, : 40 - 45
  • [47] Distributed and on-demand cache for CMS experiment at LHC
    Ciangottini, Diego
    Spiga, Daniele
    Boccali, Tommaso
    Donvito, Giacinto
    Cesini, Daniele
    Bagliesi, Giuseppe
    Mazzone, Enrico
    Falabella, Antonio
    2018 IEEE 14TH INTERNATIONAL CONFERENCE ON E-SCIENCE (E-SCIENCE 2018), 2018, : 336 - 337
  • [48] An intelligent approach to improve the performance of a data warehouse cache based on association rules
    Moudani, Walid
    Hussein, Mohammad
    Moukhtar, Mirna
    Mora-Camino, Felix
    JOURNAL OF INFORMATION & OPTIMIZATION SCIENCES, 2012, 33 (06): : 601 - 621
  • [49] From cache and memory management to WCET analysis
    Guo, Zhishan
    Boyer, Marc
    REAL-TIME SYSTEMS, 2024, 60 (04) : 535 - 536
  • [50] Dual Data Cache Systems: Architecture and Analysis
    Sustran, Zivojin
    Rakocevic, Goran
    Milutinovic, Veljko
    ADVANCES IN COMPUTERS, VOL 96: DATAFLOW PROCESSING, 2015, 96 : 187 - 233