Anomaly credit data detection based on enhanced Isolation Forest

被引:5
|
作者
Zhang, Xiaodong [1 ]
Yao, Yuan [1 ]
Lv, Congdong [1 ]
Wang, Tao [2 ]
机构
[1] Nanjing Audit Univ, Sch Informat Engn, Nanjing 211815, Peoples R China
[2] JUSFOUN BIG DATA, Beijing 10000, Peoples R China
基金
国家重点研发计划;
关键词
Credit evaluation; Anomaly detection; Class-imbalance; Cost-sensitive; EasyEnsemble; Isolation forest; SVM;
D O I
10.1007/s00170-022-09251-8
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In view of the real-world problem of falsity and errors credit data, and the performance degradation of the credit evaluation model caused by these problems, we proposed an outlier detection algorithm, which considered two characteristics of class-imbalance and cost-sensitive in credit data. We use an anomaly detection model called EIF to optimize the credit evaluation models. EIF uses the EasyEnsemble algorithm to construct balanced data sets, and train an Isolation Forest model for anomaly detection by the balanced datasets with different disturbances. On the one hand, the balanced dataset ensures that the class-imbalance problem is solved by undersampling, on the other hand, each sub-model learns from the overall minority class samples in order to solve the cost-sensitive problem. Experiments were performed on UCI German dataset, and the test set with fake data was constructed by correlation. Compared with other anomaly detection algorithms in common credit evaluation models, the EIF-optimized model has a higher F1 score and a lower cost-sensitive error rate. In conclusion, the EIF model is effective in enhancing the performance of the credit evaluation model for forged credit datasets.
引用
收藏
页码:185 / 192
页数:8
相关论文
共 50 条
  • [41] On the effectiveness of isolation-based anomaly detection in cloud data centers
    Calheiros, Rodrigo N.
    Ramamohanarao, Kotagiri
    Buyya, Rajkumar
    Leckie, Christopher
    Versteeg, Steve
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2017, 29 (18):
  • [42] Isolation-Based Anomaly Detection
    Liu, Fei Tony
    Ting, Kai Ming
    Zhou, Zhi-Hua
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2012, 6 (01)
  • [43] Interpretable Anomaly Detection with DIFFI: Depth-based feature importance of Isolation Forest
    Carletti, Mattia
    Terzi, Matteo
    Susto, Gian Antonio
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 119
  • [44] Network Anomaly Detection Algorithm Based on Fusion of Artificial Fish Swarming and Isolation Forest
    Zhang, Chen
    Shen, Wei
    Wang, ShengZhao
    Wu, Yue
    2024 5TH INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND APPLICATION, ICCEA 2024, 2024, : 238 - 242
  • [45] Two-Stream Isolation Forest Based on Deep Features for Hyperspectral Anomaly Detection
    Cheng, Xi
    Zhang, Min
    Lin, Sheng
    Zhou, Kexue
    Zhao, Shaobo
    Wang, Hai
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
  • [46] Anomaly detection model based on multi-grained cascade isolation forest algorithm
    Yang X.
    Zhang S.
    Tongxin Xuebao/Journal on Communications, 2019, 40 (08): : 133 - 142
  • [47] Isolation Forest-based semi-supervised Anomaly Detection of multiple classes
    Melquiades, Caio
    de Lima Neto, Fernando Buarque
    2022 17TH IBERIAN CONFERENCE ON INFORMATION SYSTEMS AND TECHNOLOGIES (CISTI), 2022,
  • [48] OPHiForest: Order Preserving Hashing Based Isolation Forest for Robust and Scalable Anomaly Detection
    Xiang, Haolong
    Salcic, Zoran
    Dou, Wanchun
    Xu, Xiaolong
    Qi, Lianyong
    Zhang, Xuyun
    CIKM '20: PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, 2020, : 1655 - 1664
  • [49] Epileptic Seizure Detection by Cascading Isolation Forest-Based Anomaly Screening and EasyEnsemble
    Guo, Yao
    Jiang, Xinyu
    Tao, Linkai
    Meng, Long
    Dai, Chenyun
    Long, Xi
    Wan, Feng
    Zhang, Yuan
    van Dijk, Johannes
    Aarts, Ronald M.
    Chen, Wei
    Chen, Chen
    IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, 2022, 30 : 915 - 924
  • [50] Insider Threat Detection Model Using Anomaly-Based Isolation Forest Algorithm
    Al-Shehari, Taher
    Al-Razgan, Muna
    Alfakih, Taha
    Alsowail, Rakan A.
    Pandiaraj, Saravanan
    IEEE ACCESS, 2023, 11 : 118170 - 118185