Sequential Ensemble Learning for Outlier Detection: A Bias-Variance Perspective

被引:0
|
作者
Rayana, Shebuti [1 ]
Zhong, Wen [1 ]
Akoglu, Leman [2 ]
机构
[1] SUNY Stony Brook, Stony Brook, NY 11794 USA
[2] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
基金
美国国家科学基金会;
关键词
D O I
10.1109/ICDM.2016.117
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Ensemble methods for classification have been effectively used for decades, while for outlier detection it has only been studied recently. In this work, we design a new ensemble approach for outlier detection in multi-dimensional point data, which provides improved accuracy by reducing error through both bias and variance by considering outlier detection as a binary classification task with unobserved labels. In this paper, we propose a sequential ensemble approach called CARE that employs a two-phase aggregation of the intermediate results in each iteration to reach the final outcome. Unlike existing outlier ensembles, our ensemble incorporates both the parallel and sequential building blocks to reduce bias as well as variance by (i) successively eliminating outliers from the original dataset to build a better data model on which outlierness is estimated (sequentially), and (ii) combining the results from individual base detectors and across iterations (parallelly). Through extensive experiments on 16 real-world datasets mainly from the UCI machine learning repository [1], we show that CARE performs significantly better than or at least similar to the individual baselines as well as the existing state-of-the-art outlier ensembles.
引用
收藏
页码:1167 / 1172
页数:6
相关论文
共 50 条
  • [31] Multiclass Learning with Margin: Exponential Rates with No Bias-Variance Trade-Off
    Vigogna, Stefano
    Meanti, Giacomo
    De Vito, Ernesto
    Rosasco, Lorenzo
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [32] Bias-Variance Tradeoff of Graph Laplacian Regularizer
    Chen, Pin-Yu
    Liu, Sijia
    IEEE SIGNAL PROCESSING LETTERS, 2017, 24 (08) : 1118 - 1122
  • [33] Applications of the bias-variance decomposition to human forecasting
    Kane, Patrick Bodilly
    Broomell, Stephen B.
    JOURNAL OF MATHEMATICAL PSYCHOLOGY, 2020, 98
  • [34] BIAS-VARIANCE TRADEOFFS IN FUNCTIONAL ESTIMATION PROBLEMS
    LOW, MG
    ANNALS OF STATISTICS, 1995, 23 (03): : 824 - 835
  • [35] A bias-variance analysis of a real world learning problem: The CoIL Challenge 2000
    Van der Putten, P
    Van Someren, M
    MACHINE LEARNING, 2004, 57 (1-2) : 177 - 195
  • [36] Are We Missing the Target? A Bias-Variance Perspective on Multi-Hazard Risk Assessment
    Dunant, Alexandre
    FRONTIERS IN EARTH SCIENCE, 2021, 9
  • [37] GBVSSL: Contrastive Semi-Supervised Learning Based on Generalized Bias-Variance Decomposition
    Li, Shu
    Han, Lixin
    Wang, Yang
    Zhu, Jun
    SYMMETRY-BASEL, 2024, 16 (06):
  • [38] Robust Ranking Model via Bias-Variance Optimization
    Li, Jinzhong
    Liu, Guanjun
    Xia, Jiewu
    INTELLIGENT COMPUTING METHODOLOGIES, ICIC 2017, PT III, 2017, 10363 : 706 - 718
  • [39] A novel approach to the bias-variance problem in bump hunting
    Williams, M.
    JOURNAL OF INSTRUMENTATION, 2017, 12
  • [40] Robust multi-layer extreme learning machine using bias-variance tradeoff
    Yu, Tian-jun
    Yan, Xue-feng
    JOURNAL OF CENTRAL SOUTH UNIVERSITY, 2020, 27 (12) : 3744 - 3753