Lightweight Online Performance Monitoring and Tuning with Embedded Gossip

被引:1
|
作者
Zhu, Julie Wenbin [1 ]
Bridges, Patrick G. [2 ]
Maccabe, Arthur B. [2 ]
机构
[1] Xilinx Inc, Albuquerque, NM 87109 USA
[2] Univ New Mexico, Dept Comp Sci, Albuquerque, NM 87131 USA
关键词
Lightweight performance monitoring; dynamic performance tuning; support for adaptation; parallel systems;
D O I
10.1109/TPDS.2008.126
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Understanding and tuning the performance of large-scale long-running applications is difficult, with both standard trace-based and statistical methods having substantial shortcomings that limit their usefulness. This paper describes a new performance monitoring approach called Embedded Gossip (EG) designed to enable lightweight online performance monitoring and tuning. EG works by piggybacking performance information on existing messages and performing information correlation online, giving each process in a parallel application a weakly consistent global view of the behavior of the entire application. To demonstrate the viability of EG, this paper presents the design and experimental evaluation of two different online monitoring systems and an online global adaptation system driven by Embedded Gossiping. In addition, we present a metric system for evaluating the suitability of an application to EG-based monitoring and adaptation, a general architecture for implementing EG-based monitoring systems, and a modified global commit algorithm appropriate for use in EG-based global adaptation systems. Together, these results demonstrate that EG is an efficient low-overhead approach for addressing a wide range of parallel performance monitoring tasks and that results from these systems can effectively drive online global adaptation.
引用
收藏
页码:1038 / 1049
页数:12
相关论文
共 50 条
  • [41] Online performance monitoring and diagnosis of multivariate systems
    Moghbeli, Neshat
    Poshtan, Javad
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART I-JOURNAL OF SYSTEMS AND CONTROL ENGINEERING, 2021, 235 (04) : 461 - 473
  • [42] Online Optimizations Driven by Hardware Performance Monitoring
    Schneider, Florian T.
    Payer, Mathias
    Gross, Thomas R.
    PLDI'07: PROCEEDINGS OF THE 2007 ACM SIGPLAN CONFERENCE ON PROGRAMMING LANGUAGE DESIGN AND IMPLEMENTATION, 2007, : 373 - 382
  • [43] Online Monitoring Systems for Performance Fault Detection
    Gioiosa, Roberto
    kestor, Gokcen
    Kerbyson, Darren J.
    PARALLEL PROCESSING LETTERS, 2014, 24 (04)
  • [44] A practical approach of online control performance monitoring
    Meng Qing-wei
    Zhong Zhen-fang
    Liu Ji-zhen
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2015, 142 : 107 - 116
  • [45] An Online Performance Monitoring Method for Analog Circuit
    Zhang, Aihua
    Huang, Kailun
    Huo, Xing
    Zhang, Zhiqiang
    2016 IEEE 14TH INTERNATIONAL CONFERENCE ON INDUSTRIAL INFORMATICS (INDIN), 2016, : 452 - 456
  • [46] Dynamic behavior of high performance lightweight structures with composite-embedded actuators
    Hufenbach, W
    Täger, O
    Kroll, L
    Dannemann, M
    Proceedings of 2005 International Conference on Advanced Fibers and Polymer Materials (ICAFPM 2005), Vol 1 and 2: NEW CENTURY , NEW MATERIALS AND NEW LIFE, 2005, : 360 - 364
  • [47] Lightweight services for embedded systems
    Milanovic, N
    Richling, J
    Malek, M
    SECOND IEEE WORKSHOP ON SOFTWARE TECHNOLOGIES FOR FUTURE EMBEDDED AND UBIQUITOUS SYSTEMS, PROCEEDINGS, 2004, : 40 - 44
  • [48] Performance monitoring and assessment of embedded HW/SW systems
    Calvez, JP
    Pasquier, O
    DESIGN AUTOMATION FOR EMBEDDED SYSTEMS, 1998, 3 (01) : 5 - 22
  • [49] The use of embedded stress cells for monitoring pavement performance
    Gonçalves, FP
    Ceratti, JAP
    Bica, AVD
    GEOTECHNICAL TESTING JOURNAL, 2003, 26 (04): : 363 - 372
  • [50] Performance Monitoring and Assessment of Embedded HW/SW Systems
    J. P. Calvez
    O. Pasquier
    Design Automation for Embedded Systems, 1998, 3 : 5 - 22