Detecting performance anomalies in large-scale software systems using entropy

被引：1

作者：

Malik, Haroon ^{[1
]}

Shakshuki, Elhadi M. ^{[2
]}

机构：

[1] Marshall Univ, Weisberg Div Comp Sci, Huntington, WV 25755 USA

[2] Acadia Univ, Jodrey Sch Comp Sci, Wolfville, NS, Canada

来源：

PERSONAL AND UBIQUITOUS COMPUTING | 2017年 / 21卷 / 06期

关键词：

Performance counters; Large-scale systems; Data center; Performance; Load test;

D O I：

10.1007/s00779-017-1036-y

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Large-scale software systems (LSSs) are composed of hundreds of subsystems that interact with each other in an unforeseen and complex ways. The operators of these LSSs strictly monitor thousands of metrics (performance counters) to quickly identify performance anomalies before a catastrophe. The existing monitoring tools and methodologies have not kept in pace with the rapid growth and inherit complexity of these LSSs; hence are ineffective in assisting practitioners to effectively pinpoint performance anomalies. We propose two methodologies that use entropy measure to assist practitioners/operators of LSSs in quickly detecting both system-wide and underlying localized subsystem anomalies. Our performance tests conducted on an open-source benchmark system reveal that the proposed methodologies are robust in pinpointing anomalies, do not require any domain knowledge to operate, and avoid information overload on practitioners.

引用

页码：1127 / 1137

页数：11

共 50 条

[11] An Information-Theoretic Approach to Detecting Performance Anomalies and Changes for Large-scale Distributed Web Services
Ozonat, Kivanc
2008 IEEE INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS & NETWORKS WITH FTCS & DCC, 2008, : 522 - 531
[12] A viable system structure for large-scale software systems
Deubler, HH
SOFTWARE-PRACTICE & EXPERIENCE, 1999, 29 (12): : 1025 - 1047
[13] Understanding large-scale software systems - structure and flows
Levy, Omer
Feitelson, Dror G.
EMPIRICAL SOFTWARE ENGINEERING, 2021, 26 (03)
[14] The large-scale structure of software-intensive systems
Booch, Grady
INTERFACE FOCUS, 2012, 2 (01) : 91 - 100
[15] Analyzing Large-scale OO Software by Joining Fractal and Entropy Measures
Ma, Zhiyi
2016 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE & COMPUTATIONAL INTELLIGENCE (CSCI), 2016, : 1310 - 1314
[16] Understanding large-scale software systems – structure and flows
Omer Levy
Dror G. Feitelson
Empirical Software Engineering, 2021, 26
[17] SOFTWARE RELIABILITY AND MAINTAINABILITY IN LARGE-SCALE SYSTEMS.
Strong III, Edward J.
1978, : 755 - 760
[18] Practical and representative faultloads for large-scale software systems
Costa, Pedro
Silva, Joao Gabriel
Madeira, Henrique
JOURNAL OF SYSTEMS AND SOFTWARE, 2015, 103 : 182 - 197
[19] Viable system structure for large-scale software systems
Deubler, Hanns-Helmuth
Software - Practice and Experience, 1999, 29 (12): : 1025 - 1047
[20] Managing the concurrent development of large-scale software systems
Aoyama, M
INTERNATIONAL JOURNAL OF TECHNOLOGY MANAGEMENT, 1997, 14 (6-8) : 739 - 765

← 1 2 3 4 5 →