Lightweight Online Performance Monitoring and Tuning with Embedded Gossip

被引:1
|
作者
Zhu, Julie Wenbin [1 ]
Bridges, Patrick G. [2 ]
Maccabe, Arthur B. [2 ]
机构
[1] Xilinx Inc, Albuquerque, NM 87109 USA
[2] Univ New Mexico, Dept Comp Sci, Albuquerque, NM 87131 USA
关键词
Lightweight performance monitoring; dynamic performance tuning; support for adaptation; parallel systems;
D O I
10.1109/TPDS.2008.126
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Understanding and tuning the performance of large-scale long-running applications is difficult, with both standard trace-based and statistical methods having substantial shortcomings that limit their usefulness. This paper describes a new performance monitoring approach called Embedded Gossip (EG) designed to enable lightweight online performance monitoring and tuning. EG works by piggybacking performance information on existing messages and performing information correlation online, giving each process in a parallel application a weakly consistent global view of the behavior of the entire application. To demonstrate the viability of EG, this paper presents the design and experimental evaluation of two different online monitoring systems and an online global adaptation system driven by Embedded Gossiping. In addition, we present a metric system for evaluating the suitability of an application to EG-based monitoring and adaptation, a general architecture for implementing EG-based monitoring systems, and a modified global commit algorithm appropriate for use in EG-based global adaptation systems. Together, these results demonstrate that EG is an efficient low-overhead approach for addressing a wide range of parallel performance monitoring tasks and that results from these systems can effectively drive online global adaptation.
引用
收藏
页码:1038 / 1049
页数:12
相关论文
共 50 条
  • [31] Hardware Observability Framework for Minimally Intrusive Online Monitoring of Embedded Systems
    Lee, Jong Chul
    Gardner, Andrew S.
    Lysecky, Roman
    18TH IEEE INTERNATIONAL CONFERENCE AND WORKSHOPS ON ENGINEERING OF COMPUTER BASED SYSTEMS (ECBS 2011), 2011, : 52 - 60
  • [32] An embedded lightweight GUI component library and ergonomics optimization method for industry process monitoring
    Tan, Da-peng
    Chen, Shu-ting
    Bao, Guan-jun
    Zhang, Li-bin
    FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2018, 19 (05) : 604 - 625
  • [33] An Embedded Estimator for Online Harmonic Monitoring in Power-Electronic Grids
    Jin, Zongshuai
    Zhang, Hengxu
    Terzija, Vladimir
    IEEE TRANSACTIONS ON SMART GRID, 2022, 13 (06) : 4677 - 4689
  • [34] Online monitoring of cracking in concrete structures using embedded piezoelectric transducers
    Dumoulin, C.
    Karaiskos, G.
    Sener, J-Y
    Deraemaeker, A.
    SMART MATERIALS AND STRUCTURES, 2014, 23 (11)
  • [35] Embedded nanolamps in electrospun nanofibers enabling online monitoring and ratiometric measurements
    Buchner, Markus
    Ngoensawat, Umphan
    Schenck, Milena
    Fenzl, Christoph
    Wongkaew, Nongnoot
    Matlock-Colangelo, Lauren
    Hirsch, Thomas
    Duerkop, Axel
    Baeumner, Antje J.
    JOURNAL OF MATERIALS CHEMISTRY C, 2017, 5 (37) : 9712 - 9720
  • [36] Analysis of gossip performance with copulas
    Weber, Steven
    Veeraraghavan, Vilas
    Kini, Ananth
    Singhal, Nikhil
    2006 40TH ANNUAL CONFERENCE ON INFORMATION SCIENCES AND SYSTEMS, VOLS 1-4, 2006, : 1212 - 1217
  • [37] Enabling Lightweight Network Performance Monitoring and Troubleshooting in Data Center
    Xun, Qinglin
    Li, Weichao
    Guo, Haorui
    Wang, Yi
    IEEE CONFERENCE ON COMPUTER COMMUNICATIONS WORKSHOPS (IEEE INFOCOM WKSHPS 2021), 2021,
  • [38] Online Performance Monitoring of Neuromorphic Computing Systems
    Mishra, Abhishek Kumar
    Das, Anup
    Kandasamy, Nagarajan
    2023 IEEE EUROPEAN TEST SYMPOSIUM, ETS, 2023,
  • [39] Online optimizations driven by hardware performance monitoring
    Schneider, Florian T.
    Payer, Mathias
    Gross, Thomas R.
    ACM SIGPLAN NOTICES, 2007, 42 (06) : 373 - 382
  • [40] Online Monitoring System for Performance Fault Detection
    Gioiosa, Roberto
    Kestor, Gokcen
    Kerbyson, Darren J.
    PROCEEDINGS OF 2014 IEEE INTERNATIONAL PARALLEL & DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW), 2014, : 1476 - 1484