ViSRE: A Unified Visual Analysis Dashboard for Proactive Cloud Outage Management

被引:2
|
作者
Kayongo, Paula [1 ]
Hoffswell, Jane [2 ]
Saini, Shiv [3 ]
Garg, Shaddy [3 ]
Koh, Eunyee [4 ]
Wang, Haoliang [4 ]
Jacobs, Tom [4 ]
机构
[1] Northwestern Univ, Evanston, IL 60208 USA
[2] Adobe Res, Washington, DC USA
[3] Adobe Res, Bangalore, Karnataka, India
[4] Adobe Res, Santa Clara, CA USA
关键词
Cloud Outage Prediction; Root Cause Analysis; Software Visualization;
D O I
10.1109/VISSOFT55257.2022.00010
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Efficient outage detection and remediation is crucial for effectively operating cloud computing systems. To remediate outages, system engineers must quickly identify the causal relationships between metrics and correlate events across multiple monitoring tools. In practice, this process largely remains reactive due to the complexity and general lack of interpretability within such monitoring environments. This work presents ViSRE: an integrated visual analytics system that integrates causal and predictive models with interactive visualizations to aid in proactive cloud outage management. We develop enhanced node representations for our causal graph representation to support system engineers in performing root cause analysis and reasoning about causality chains in multi-dimensional temporal data. We report the results of a quantitative assessment of the proposed predictive models, which show good performance guarantees. To evaluate and refine our system, we conduct a study with six cloud system engineers who verify that our proposed techniques can support proactive cloud maintenance by intuitively displaying temporal relationships between predicted and raw data. By correlating and presenting data from disparate sources, ViSRE also reduces context switching costs and reduces the time spent on manually correlating events during remediation of time-critical outages.
引用
收藏
页码:5 / 16
页数:12
相关论文
共 50 条
  • [1] Dementia dashboard - A proactive risk reduction management guideline
    Dalsania, Parag
    TOPICS IN GERIATRIC REHABILITATION, 2006, 22 (03) : 228 - 242
  • [2] A Unified Dashboard for Collaborative Robot Management System
    Ahmad, Hishamadie
    Khalid, Mohammad Fairus
    Kandan, Rajendar
    Mydin, Mohd Nizam Mohd
    Ismail, Bukhary Ikhwan
    Hoe, Ong Hong
    2020 18TH IEEE STUDENT CONFERENCE ON RESEARCH AND DEVELOPMENT (SCORED), 2020, : 5 - 9
  • [3] Outage Management Systems real-time dashboard assessment study
    Nielsen, Terrance D.
    2007 IEEE POWER ENGINEERING SOCIETY GENERAL MEETING, VOLS 1-10, 2007, : 1543 - 1545
  • [4] A Unified Method for Asymptotic Outage Analysis
    Shi, Zheng
    Wang, Hong
    Fu, Yaru
    Lei, Hongjiang
    Alouini, Mohamed-Slim
    IEEE WIRELESS COMMUNICATIONS LETTERS, 2024, 13 (02) : 545 - 549
  • [5] Proactive Autonomic Cloud Application Management
    Rozanska, Marta
    Horn, Geir
    2022 IEEE/ACM 15TH INTERNATIONAL CONFERENCE ON UTILITY AND CLOUD COMPUTING, UCC, 2022, : 102 - 111
  • [6] A Visual Dashboard to Track Learning Analytics for Educational Cloud Computing
    Naranjo, Diana M.
    Prieto, Jose R.
    Molto, German
    Calatrava, Amanda
    SENSORS, 2019, 19 (13)
  • [7] Unified Cloud Application Management
    Kolb, Stefan
    Roeck, Cedric
    PROCEEDINGS 2016 IEEE WORLD CONGRESS ON SERVICES - SERVICES 2016, 2016, : 1 - 8
  • [8] Proactive resource management for cloud of services environments
    Marques, Goncalo
    Senna, Carlos
    Sargento, Susana
    Carvalho, Luis
    Pereira, Luis
    Matos, Ricardo
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2024, 150 : 90 - 102
  • [9] Proactive Workload Management in Hybrid Cloud Computing
    Zhang, Hui
    Jiang, Guofei
    Yoshihira, Kenji
    Chen, Haifeng
    IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2014, 11 (01): : 90 - 100
  • [10] A Proactive Cloud Management Architecture for Private Clouds
    Dong, Dapeng
    Herbert, John
    2013 IEEE SIXTH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (CLOUD 2013), 2013, : 701 - 708