Cluman: Advanced cluster management for the large-scale infrastructures

被引:0
|
作者
Babik, Marian [1 ]
Fedorko, Ivan [1 ]
Rodrigues, David [1 ]
机构
[1] CERN, European Org Nucl Res, CH-1211 Geneva 23, Switzerland
关键词
D O I
10.1088/1742-6596/331/5/052002
中图分类号
O57 [原子核物理学、高能物理学];
学科分类号
070202 ;
摘要
The recent uptake of multi-core computing has produced a rapid growth of virtualisation and cloud computing services. With the increased use of the many-core processors this trend will likely accelerate and computing centres will be faced with the management of the tens of thousands of the virtual machines. Furthermore, these machines will likely be geographically distributed and need to be allocated on demand. In order to cope with such complexity we have designed and developed an advanced cluster management system that can execute administrative tasks targeting thousands of machines as well as provide an interactive high-density visualisation of the fabrics. The job management subsystem can perform complex tasks while following their progress and output and report aggregated information back to the system administrators. The visualisation subsystem can display tree maps of the infrastructure elements with data and monitoring information, thus providing a very detailed overview of the large clusters at a glance. The initial experience with development and testing of the system will be presented as well as an evaluation of its performance.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] CluMan - Cluster Management toolsuit
    Siket, Miroslav
    Babik, Marian
    Lopienski, Sebastian
    Manana, Filipe David Borba
    17TH INTERNATIONAL CONFERENCE ON COMPUTING IN HIGH ENERGY AND NUCLEAR PHYSICS (CHEP09), 2010, 219
  • [2] Large-scale Cluster Management at Google with Borg
    Zeng, Fengsheng
    2017 6TH EEM INTERNATIONAL CONFERENCE ON EDUCATION SCIENCE AND SOCIAL SCIENCE (EEM-ESSS 2017), 2017, 104 : 76 - 80
  • [3] Cybersecurity governance in large-scale infrastructures
    Stoleriu, Razvan
    Petre, Ionut
    Pop, Florin
    ROMANIAN JOURNAL OF INFORMATION TECHNOLOGY AND AUTOMATIC CONTROL-REVISTA ROMANA DE INFORMATICA SI AUTOMATICA, 2025, 35 (01): : 51 - 66
  • [4] Autonomous and Energy-Aware Management of Large-Scale Cloud Infrastructures
    Feller, Eugen
    Morin, Christine
    2012 IEEE 26TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS & PHD FORUM (IPDPSW), 2012, : 2542 - 2545
  • [5] A novel management architecture for large-scale server cluster
    Xue, Zhenghua
    Dong, Xiaoshe
    Fan, Shengqun
    2008 IEEE 8TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY, VOLS 1 AND 2, 2008, : 273 - 278
  • [6] Cost-Sensitive Security Risk Management for Large-Scale Computing Infrastructures
    Master, Neal
    Bambos, Nicholas
    2017 INTERNATIONAL CONFERENCE ON COMPUTING, NETWORKING AND COMMUNICATIONS (ICNC), 2016, : 115 - 119
  • [7] LIFECYCLE MANAGEMENT, MONITORING AND ASSESSMENT FOR SAFE LARGE-SCALE INFRASTRUCTURES: CHALLENGES AND NEEDS
    Limongelli, M. P.
    Previtali, M.
    Cantini, L.
    Carosio, S.
    Matos, J. C.
    Isoird, J. M.
    Wenzel, H.
    Pellegrino, C.
    2ND INTERNATIONAL CONFERENCE OF GEOMATICS AND RESTORATION (GEORES 2019), 2019, 42-2 (W11): : 727 - 734
  • [8] Paving the road toward Smart Grids through large-scale advanced metering infrastructures
    Lopez, G.
    Moreno, J. I.
    Amaris, H.
    Salazar, F.
    ELECTRIC POWER SYSTEMS RESEARCH, 2015, 120 : 194 - 205
  • [9] Large-Scale Pairwise Sequence Alignments on a Large-Scale GPU Cluster
    Savran, Ibrahim
    Gao, Yang
    Bakos, Jason D.
    IEEE DESIGN & TEST, 2014, 31 (01) : 51 - 61
  • [10] Institutionalized Materials Research Based on Large-Scale Scientific Infrastructures——Preface of Special Issue for Materials Research with Large-Scale Scientific Infrastructures
    Li, Dianzhong
    Li, Bing
    Jinshu Xuebao/Acta Metallurgica Sinica, 2024, 60 (08):