Detecting performance anomalies in large-scale software systems using entropy

被引:1
|
作者
Malik, Haroon [1 ]
Shakshuki, Elhadi M. [2 ]
机构
[1] Marshall Univ, Weisberg Div Comp Sci, Huntington, WV 25755 USA
[2] Acadia Univ, Jodrey Sch Comp Sci, Wolfville, NS, Canada
关键词
Performance counters; Large-scale systems; Data center; Performance; Load test;
D O I
10.1007/s00779-017-1036-y
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Large-scale software systems (LSSs) are composed of hundreds of subsystems that interact with each other in an unforeseen and complex ways. The operators of these LSSs strictly monitor thousands of metrics (performance counters) to quickly identify performance anomalies before a catastrophe. The existing monitoring tools and methodologies have not kept in pace with the rapid growth and inherit complexity of these LSSs; hence are ineffective in assisting practitioners to effectively pinpoint performance anomalies. We propose two methodologies that use entropy measure to assist practitioners/operators of LSSs in quickly detecting both system-wide and underlying localized subsystem anomalies. Our performance tests conducted on an open-source benchmark system reveal that the proposed methodologies are robust in pinpointing anomalies, do not require any domain knowledge to operate, and avoid information overload on practitioners.
引用
收藏
页码:1127 / 1137
页数:11
相关论文
共 50 条
  • [1] Detecting performance anomalies in large-scale software systems using entropy
    Haroon Malik
    Elhadi M. Shakshuki
    Personal and Ubiquitous Computing, 2017, 21 : 1127 - 1137
  • [2] Detecting Discontinuities in Large-Scale Systems
    Malik, Haroon
    Davis, Ian J.
    Godfrey, Michael W.
    Neuse, Douglas
    Mankovskii, Serge
    2014 IEEE/ACM 7TH INTERNATIONAL CONFERENCE ON UTILITY AND CLOUD COMPUTING (UCC), 2014, : 345 - 354
  • [3] USING PERFORMANCE MODELING TO DESIGN LARGE-SCALE SYSTEMS
    Barker, Kevin J.
    Davis, Kei
    Hoisie, Adolfy
    Kerbyson, Darren J.
    Lang, Michael
    Pakin, Scott
    Carlos Sancho, Jose
    COMPUTER, 2009, 42 (11) : 42 - 49
  • [4] The complexity nature of large-scale software systems
    Yan Dong
    Qi Guo-Ning
    Gu Xin-Jian
    CHINESE PHYSICS, 2006, 15 (11): : 2489 - 2495
  • [5] Architectural Complexity of Large-Scale Software Systems
    Lilienthal, Carola
    13TH EUROPEAN CONFERENCE ON SOFTWARE MAINTENANCE AND REENGINEERING: CSMR 2009, PROCEEDINGS, 2009, : 17 - 26
  • [6] Cooperative work in large-scale software systems
    Belkhatir, N
    Estublier, J
    Melo, WL
    JOURNAL OF SOFTWARE MAINTENANCE-RESEARCH AND PRACTICE, 1994, 6 (06): : 319 - 335
  • [7] ACHIEVING RELIABILITY IN LARGE-SCALE SOFTWARE SYSTEMS
    SCHICK, GJ
    WOLVERTON, RW
    PROCEEDINGS ANNUAL RELIABILITY AND MAINTAINABILITY SYMPOSIUM, 1974, 7 (02): : 302 - 319
  • [8] Architectural complexity of large-scale software systems
    Software Engineering Group, University of Hamburg, C1 WPS GmbH, Germany
    Proc. Eur. Conf. Software Maint. Reeng., (17-26):
  • [9] Engineering Large-Scale Observation Software Systems
    Lamb, David
    Randles, Martin
    Taleb-Bendiab, A.
    2009 SECOND INTERNATIONAL CONFERENCE ON DEVELOPMENTS IN ESYSTEMS ENGINEERING (DESE 2009), 2009, : 266 - 272
  • [10] Load Testing Large-Scale Software Systems
    Jiang, Zhen Ming
    2015 IEEE/ACM 37th IEEE International Conference on Software Engineering, Vol 2, 2015, : 955 - 956