An Intelligent Cache Management for Data Analysis at CMS

被引:2
|
作者
Tracolli, Mirco [1 ,2 ,3 ]
Baioletti, Marco [1 ]
Ciangottini, Diego [3 ]
Poggioni, Valentina [1 ]
Spiga, Daniele [3 ]
机构
[1] Univ Perugia, Perugia, Italy
[2] Univ Firenze, Florence, Italy
[3] Ist Nazl Fis Nucl, Sez Perugia, Perugia, Italy
关键词
Cache; Optimization; LRU; Intelligent system; Big data; Data science workflow;
D O I
10.1007/978-3-030-58802-1_24
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In this work, we explore a score-based approach to manage a cache system. With the proposed method, the cache can better discriminate the input requests and improve the overall performances. We created a score based discriminator using the file statistics. The score represents the weight of a file. We tested several functions to compute the file weight used to determine whether a file has to be stored in the cache or not. We developed a solution experimenting on a real cache manager named XCache, that is used within the Compact Muon Solenoid (CMS) data analysis workflow. The aim of this work is optimizing to reduce maintaining costs of the cache system without compromising the user experience.
引用
收藏
页码:320 / 332
页数:13
相关论文
共 50 条
  • [1] Intelligent cache management for mobile data warehouse systems
    Huang, SM
    Lin, BS
    Deng, QS
    JOURNAL OF DATABASE MANAGEMENT, 2005, 16 (02) : 46 - 65
  • [2] Intelligent Buffer cache management in multimedia data retrieval
    Ryu, Y
    Cho, K
    Won, Y
    Koh, K
    FOUNDATIONS OF INTELLIGENT SYSTEMS, PROCEEDINGS, 2002, 2366 : 462 - 471
  • [3] SortCache: Intelligent Cache Management for Accelerating Sparse Data Workloads
    Srikanth, Sriseshan
    Jain, Anirudh
    Conte, Thomas M.
    Debenedictis, Erik P.
    Cook, Jeanine
    ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2021, 18 (04)
  • [4] AsyncStageOut: Distributed user data management for CMS Analysis
    Riahi, H.
    Wildish, T.
    Ciangottini, D.
    Hernandez, J. M.
    Andreeva, J.
    Balcas, J.
    Karavakis, E.
    Mascheroni, M.
    Tanasijczuk, A. J.
    Vaandering, E. W.
    21ST INTERNATIONAL CONFERENCE ON COMPUTING IN HIGH ENERGY AND NUCLEAR PHYSICS (CHEP2015), PARTS 1-9, 2015, 664
  • [5] AsyncStageOut: Distributed user data management for CMS analysis
    20161402188307
    (1) European Organization for Nuclear Research, IT Department, Geneva; CH-1211-23, Switzerland; (2) Princeton University, Princeton; NJ; 08544, United States; (3) Universitá and INFN Perugia, Via Alessandro Pascoli, Perugia; 06123, Italy; (4) CIEMAT, Madrid; 28040, Spain; (5) DiSCC, Vilnius University, Vilnius; LT-01513, Lithuania; (6) INFN Milano-Bicocca, Piazza della Scienza, Milan 3; I-20126, Italy; (7) University of California, San Diego; CA; 92093-0354, United States; (8) Fermi National Laboratory, Batavia; IL; 60510, United States, 1600, (IOP Publishing Ltd):
  • [6] BIG DATA PSYCHOLOGICAL ANALYSIS BASED ON CACHE MANAGEMENT
    Ma, Y.
    Chen, Y. F.
    Su, J. J.
    Zou, L. D.
    Guo, Z. H.
    BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2016, 118 : 70 - 70
  • [7] The CMS Data Management System
    Giffels, M.
    Guo, Y.
    Kuznetsov, V.
    Magini, N.
    Wildish, T.
    20TH INTERNATIONAL CONFERENCE ON COMPUTING IN HIGH ENERGY AND NUCLEAR PHYSICS (CHEP2013), PARTS 1-6, 2014, 513
  • [8] Design of an Intelligent Data Cache with Replacement Policy
    Begum, B. Shameedha
    Ramasubramanian, N.
    INTERNATIONAL JOURNAL OF EMBEDDED AND REAL-TIME COMMUNICATION SYSTEMS (IJERTCS), 2019, 10 (02): : 87 - 107
  • [9] WATCHMAN: A data warehouse intelligent cache manager
    Scheuermann, P
    Shim, JH
    Vingralek, R
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON VERY LARGE DATA BASES, 1996, : 51 - 62
  • [10] Efficient cooperative cache management for latency-aware data intelligent processing in edge environment
    Li, Chunlin
    Liu, Jun
    Zhang, Qingchuan
    Luo, Youlong
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2021, 123 : 48 - 67