A proposal of event correlation for distributed network fault management and its evaluation

被引:0
|
作者
Kato, N [1 ]
Ohta, K
Ika, T
Mansfield, G
Nemoto, Y
机构
[1] Tohoku Univ, Grad Sch, Sendai, Miyagi 9808579, Japan
[2] Cyber Solut Inc, Sendai, Miyagi 9893204, Japan
关键词
event correlation; distributed network management; NMC (Network Management Clock);
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In a distributed network management environment, a NMS (Network Management Station) interacts with several agents in different sub-networks. In the network fault management context, the NMS detects symptoms that indicate some abnormality e.g. a surge in ICMP traffic, which may be caused by some network malfunction or misuse. The occurrence of a symptom is an event. Large number of events may be detected by an NMS. The sheer number of these events makes it difficult, if not impossible, for an NMS to diagnose these events. Generally, a fault may have a cascading effect which may, in turn, give rise to a very large number of events. The sequence of events and their correlation play an important role in fault management and diagnosis. In the distributed environment of todays networks, the absence of any uniform time for reference makes this a challenging task. In the present network management framework of SNMP, a Manager maintains a notion of the clock of the agent ii; interacts with. But this mechanism is inadequate to determine the sequence of events and their correlation, more so, in a distributed environment which may involve several managers. In this paper we propose a mechanism for ordering and correlating events detected in large-scale network which is managed in a distributed manner within the SNMP framework. Our algorithm uses the concept of a Network Management Clock (NMC). The NMC is a virtual clock maintained by a manager based on sysUpTime readings from each SNMP agent. In this paper, the algorithm, its implementation and evaluation will be discussed.
引用
收藏
页码:859 / 867
页数:9
相关论文
共 50 条
  • [1] Fault isolation and event correlation for integrated fault management
    Katker, S
    Paterok, M
    INTEGRATED NETWORK MANAGEMENT V: INTEGRATED MANAGEMENT IN A VIRTUAL WORLD, 1997, : 583 - 596
  • [2] Distributed Event Processing for Fault Management of Web Services
    Alam, Sazedul
    Deters, Ralph
    2009 IEEE ASIA-PACIFIC SERVICES COMPUTING CONFERENCE (APSCC 2009), 2009, : 310 - 315
  • [3] A network event correlation algorithm based on fault filtration
    Zheng, Qiuhua
    Qian, Yuntao
    Yao, Min
    PRICAI 2006: TRENDS IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2006, 4099 : 864 - 869
  • [4] Temporal and spatial distributed event correlation for network security
    Jiang, GF
    Cybenko, G
    PROCEEDINGS OF THE 2004 AMERICAN CONTROL CONFERENCE, VOLS 1-6, 2004, : 996 - 1001
  • [5] Distributed software agents for network fault management
    Hajji, H
    Far, BH
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2000, E83D (04): : 735 - 746
  • [6] Distributed software agents for network fault management
    Hajji, Hassan
    Homayoun, Behrouz
    IEICE Transactions on Information and Systems, 2000, E83-D (04) : 735 - 746
  • [7] The dynamic symptom isolation algorithm for network fault management and its evaluation
    Mori, T
    Ohta, K
    Kato, N
    Sone, H
    Mansfield, G
    Nemoto, Y
    IEICE TRANSACTIONS ON COMMUNICATIONS, 1998, E81B (12) : 2471 - 2480
  • [8] Robust event correlation scheme for fault identification in communications network
    Lo, CC
    Chen, SH
    GLOBECOM 98: IEEE GLOBECOM 1998 - CONFERENCE RECORD, VOLS 1-6: THE BRIDGE TO GLOBAL INTEGRATION, 1998, : 3745 - 3750
  • [10] Construction of network fault simulation platform and event samples acquisition techniques for event correlation
    Su, Y.-B.
    Wang, Z.
    Cao, Y.
    Huang, T.-X.
    Wang, L.-N.
    Wuhan University Journal of Natural Sciences, 2001, 6 (03) : 670 - 674