Matchmaking: Distributed resource management for high throughput computing

被引:133
|
作者
Raman, R [1 ]
Livny, M [1 ]
Solomon, M [1 ]
机构
[1] Univ Wisconsin, Madison, WI 53703 USA
关键词
D O I
10.1109/HPDC.1998.709966
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Conventional resource management systems use a system model to describe resources and a centralized scheduler to control their allocation. We argue that this paradigm does not adapt well to distributed systems, particularly those built to support high-throughput computing. Obstacles include heterogeneity of resources, which make uniform allocation algorithms difficult to formulate, and distributed ownership, leading to widely varying allocation policies. Faced with these problems, we developed and implemented the classified advertisement (classad) matchmaking framework, a flexible and general approach to resource management in distributed environment with decentralized ownership of resources. Novel aspects of the framework include a semi-structured data model that combines schema, data, and query in a simple but powerful specification language, and a clean separation of the matching and claiming phases of resource allocation. The representation and protocols result in a robust, scalable and flexible framework that can evolve with changing resources. The framework was designed to solve real problems encountered in the deployment of Condor a high throughput computing system developed at the University of Wisconsin-Madison. Condor is heavily used by scientists at numerous sites around the world. It derives much of its robustness and efficiency from the matchmaking architecture.
引用
收藏
页码:140 / 146
页数:3
相关论文
共 50 条
  • [41] A Resource Management Model for Distributed Multi-Task Applications in Fog Computing Networks
    Hosseinpour, Farhoud
    Naebi, Ahmad
    Virtanen, Seppo
    Pahikkala, Tapio
    Tenhunen, Hannu
    Plosila, Juha
    IEEE ACCESS, 2021, 9 (09): : 152792 - 152802
  • [42] Optimized Distributed Resource Management in Fog Computing by Using Ant-Colony Optimization
    Mirtaheri, Seyedeh Leili
    Shirzad, Hamid Reza
    FUTURE TRENDS OF HPC IN A DISRUPTIVE SCENARIO, 2019, 34 : 206 - 219
  • [43] An efficient methodology for resource discovery on distributed computing
    Shang, Qinghong
    Zhou, Mingtian
    Fukuda, Munehiro
    Bic, Lubomir
    Dillencourt, Michael B.
    Advances in Information Sciences and Service Sciences, 2012, 4 (10): : 434 - 441
  • [44] A New Distributed Strategy to Schedule Computing Resource
    Wang, Qi
    Deng, Pan
    Yang, Qinghong
    Yuan, Wei
    Nie, Yaolong
    Bi, Chaofan
    Chao, Han-Chieh
    CLOUD COMPUTING (CLOUDCOMP 2014), 2015, 142 : 216 - 224
  • [45] Resource allocation to conserve energy in distributed computing
    Lynar, Timothy M.
    Herbert, Ric D.
    Simon
    Chivers, William J.
    INTERNATIONAL JOURNAL OF GRID AND UTILITY COMPUTING, 2011, 2 (01) : 1 - 10
  • [46] The Gain of Resource Delegation in Distributed Computing Environments
    Foelling, Alexander
    Grimme, Christian
    Lepping, Joachim
    Papaspyrou, Alexander
    JOB SCHEDULING STRATEGIES FOR PARALLEL PROCESSING, 2010, 6253 : 77 - 92
  • [47] Global Resource Scheduling for Distributed Edge Computing
    Tan, Aiping
    Li, Yunuo
    Wang, Yan
    Yang, Yujie
    APPLIED SCIENCES-BASEL, 2023, 13 (22):
  • [48] Reactive framework for resource aware distributed computing
    Gupta, R
    Shyamasundar, RK
    ADVANCES IN COMPUTER SCIENCE - ASIAN 2004, PROCEEDINGS, 2004, 3321 : 452 - 467
  • [49] Proactive Resource Management for Failure Resilient High Performance Computing Clusters
    Fu, Song
    Xu, Cheng-Zhong
    2009 INTERNATIONAL CONFERENCE ON AVAILABILITY, RELIABILITY, AND SECURITY (ARES), VOLS 1 AND 2, 2009, : 257 - +
  • [50] Reliability-oriented resource management for High-Performance Computing
    Massari, Giuseppe
    Peta, Miriam
    Campi, Alessandro
    Reghenzani, Federico
    Terraneo, Federico
    Agosta, Giovanni
    Fornaciari, William
    Ciesielski, Sebastian
    Kulczewski, Michal
    Piatek, Wojciech
    SUSTAINABLE COMPUTING-INFORMATICS & SYSTEMS, 2023, 39