Anomaly Detection for Big Log Data Using a Hadoop Ecosystem

被引:0
|
作者
Son, Siwoon [1 ]
Gil, Myeong-Seon [1 ]
Moon, Yang-Sae [1 ]
机构
[1] Kangwon Natl Univ, Dept Comp Sci, Chunchon, Gangwon Do, South Korea
关键词
Anomaly Detection; Big Data; Log Data; Apache Hadoop; Apache Hive; Moving Average; 3-Sigma;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this paper, we address a novel method to efficiently manage and analyze a large amount of log data. First, we present a new Apache Hive-based data storage and analysis architecture to process a large volume of Hadoop log data, which rapidly occur in multiple nodes. Second, we design and implement three simple but efficient anomaly detection methods. These methods use moving average and 3-sigma techniques to detect anomalies in log data. Finally, we show that all the three methods detect abnormal intervals properly, and the weighted anomaly detection methods are more precise than the basic one. These results indicate that our research is an excellent and simple approach in detecting anomalies of log data on a Hadoop ecosystem.
引用
收藏
页码:377 / 380
页数:4
相关论文
共 50 条
  • [1] Hive-Based Anomaly Detection in Hadoop Log Data Management
    Son, Siwoon
    Gil, Myeong-Seon
    Yang, Seokwoo
    Moon, Yang-Sae
    ADVANCES IN COMPUTER SCIENCE AND UBIQUITOUS COMPUTING, 2017, 421 : 837 - 842
  • [2] Big Log Data Stream Processing: Adapting an Anomaly Detection Technique
    Dietz, Marietheres
    Pernul, Guenther
    DATABASE AND EXPERT SYSTEMS APPLICATIONS (DEXA 2018), PT II, 2018, 11030 : 159 - 166
  • [3] Big Data: Mining of Log File through Hadoop
    Kotiyal, Bina
    Kumar, Ankit
    Pant, Bhaskar
    Goudar, R. H.
    2013 INTERNATIONAL CONFERENCE ON HUMAN COMPUTER INTERACTIONS (ICHCI), 2013,
  • [4] IoT Big Data provenance scheme using blockchain on Hadoop ecosystem
    Pajooh, Houshyar Honar
    Rashid, Mohammed A.
    Alam, Fakhrul
    Demidenko, Serge
    JOURNAL OF BIG DATA, 2021, 8 (01)
  • [5] IoT Big Data provenance scheme using blockchain on Hadoop ecosystem
    Houshyar Honar Pajooh
    Mohammed A. Rashid
    Fakhrul Alam
    Serge Demidenko
    Journal of Big Data, 8
  • [6] Development of Fault Detection Systems Based on Big Data Ecosystem in Semiconductor Manufacturing: The Hadoop Ecosystem Implementation
    Fu, HuiChu
    Qiao, Yan
    Bai, LiPing
    Wu, NaiQi
    Liu, Bin
    He, YunFang
    IEEE ROBOTICS & AUTOMATION MAGAZINE, 2023, 30 (02) : 22 - 33
  • [7] Big Data Management Performance Evaluation in Hadoop Ecosystem
    Liu, Qing
    Fu, Yinjin
    Ni, Guiqiang
    Mei, Jianmin
    2017 3RD INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING AND COMMUNICATIONS (BIGCOM), 2017, : 413 - 421
  • [8] Hybrid Big Data Architecture for High-Speed Log Anomaly Detection
    Tangsatjatham, Pittayut
    Nupairoj, Natawut
    2016 13TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER SCIENCE AND SOFTWARE ENGINEERING (JCSSE), 2016, : 538 - 543
  • [9] Hybrid Big Data Architecture for High-Speed Log Anomaly Detection
    Nupairoj, Natawut
    Tangsatjatham, Pittayut
    JOURNAL OF INTERNET TECHNOLOGY, 2017, 18 (07): : 1681 - 1688