Leveraging an Isolation Forest to Anomaly Detection and Data Clustering

被引:4
|
作者
Yepmo, Veronne [1 ]
Smits, Gregory [2 ]
Lesot, Marie -Jeanne [3 ]
Pivert, Olivier [1 ]
机构
[1] Univ Rennes, IRISA, Lannion, France
[2] Lab STICC, IMT Atlantique, Brest, France
[3] Sorbonne Univ, LIP6, Paris, France
关键词
Anomaly/outlier detection; Isolation forest; Clustering; FUZZY; ALGORITHM; NOISE;
D O I
10.1016/j.datak.2024.102302
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Understanding why some points in a data set are considered as anomalies cannot be done without taking into account the structure of the regular points. Whereas many machine learning methods are dedicated to the identification of anomalies on one side, or to the identification of the data inner -structure on the other side, a solution is introduced to answers these two tasks using a same data model, a variant of an isolation forest. The initial algorithm to construct an isolation forest is indeed revisited to preserve the data inner structure without affecting the efficiency of the outlier detection. Experiments conducted both on synthetic and real -world data sets show that, in addition to improving the detection of abnormal data points, the proposed variant of isolation forest allows for a reconstruction of the subspaces of high density. Therefore, the former can serve as a basis for a unified approach to detect global and local anomalies, which is a necessary condition to then provide users with informative descriptions of the data.
引用
收藏
页数:16
相关论文
共 50 条
  • [41] Hydrological Time Series Anomaly Pattern Detection based on Isolation Forest
    Qin, Yu
    Lou, YuanSheng
    PROCEEDINGS OF 2019 IEEE 3RD INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC 2019), 2019, : 1706 - 1710
  • [42] DeepiForest: A Deep Anomaly Detection Framework with Hashing Based Isolation Forest
    Xiang, Haolong
    Hu, Hongsheng
    Zhang, Xuyun
    2022 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2022, : 1251 - 1256
  • [43] A novel unsupervised anomaly detection for gas turbine using Isolation Forest
    Zhong, Shisheng
    Fu, Song
    Lin, Lin
    Fu, Xuyun
    Cui, Zhiquan
    Wang, Rui
    2019 IEEE INTERNATIONAL CONFERENCE ON PROGNOSTICS AND HEALTH MANAGEMENT (ICPHM), 2019,
  • [44] Anomaly Detection of Storage Battery Based on Isolation Forest and Hyperparameter Tuning
    Lee, Chun-Hsiang
    Lu, Xu
    Lin, Xiunao
    Tao, Hongfeng
    Xue, Yaolei
    Wu, Chao
    2020 5TH INTERNATIONAL CONFERENCE ON MATHEMATICS AND ARTIFICIAL INTELLIGENCE (ICMAI 2020), 2020, : 229 - 233
  • [45] Evaluating the Isolation Forest Method for Anomaly Detection in SoftwareDefined Networking Security
    Lakshmi, M. Sri
    Rajavikram, G.
    Dattatreya, V.
    Jyothi, B. Swarna
    Patil, Shruti
    Bhavsingh, M.
    JOURNAL OF ELECTRICAL SYSTEMS, 2023, 19 (04) : 279 - 297
  • [46] A parallel algorithm for network traffic anomaly detection based on Isolation Forest
    Tao, Xiaoling
    Peng, Yang
    Zhao, Feng
    Zhao, Peichao
    Wang, Yong
    INTERNATIONAL JOURNAL OF DISTRIBUTED SENSOR NETWORKS, 2018, 14 (11)
  • [47] Improved Anomaly Detection by Using the Attention-Based Isolation Forest
    Utkin, Lev
    Ageev, Andrey
    Konstantinov, Andrei
    Muliukha, Vladimir
    ALGORITHMS, 2023, 16 (01)
  • [48] Anomaly Detection in Spacecraft Telemetry using Similarity Metrics and Isolation Forest
    Bollam, Mahesh
    Roy, Praful H.
    Jagtap, Anuj
    Mullapudi, Balaram
    Verma, Anjali
    2024 IEEE SPACE, AEROSPACE AND DEFENCE CONFERENCE, SPACE 2024, 2024, : 911 - 915
  • [49] Online Clustering for Evolving Data Streams with Online Anomaly Detection
    Chenaghlou, Milad
    Moshtaghi, Masud
    Leckie, Christopher
    Salehi, Mahsa
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2018, PT II, 2018, 10938 : 506 - 519
  • [50] Photovoltaic anomaly data detection method based on clustering iForest
    Han, Bitong
    Shan, Yu
    Xie, Hongbin
    Ge, Leyi
    THIRD INTERNATIONAL CONFERENCE ON ELECTRONICS AND COMMUNICATION; NETWORK AND COMPUTER TECHNOLOGY (ECNCT 2021), 2022, 12167