Sequential Ensemble Learning for Outlier Detection: A Bias-Variance Perspective

被引:0
|
作者
Rayana, Shebuti [1 ]
Zhong, Wen [1 ]
Akoglu, Leman [2 ]
机构
[1] SUNY Stony Brook, Stony Brook, NY 11794 USA
[2] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
基金
美国国家科学基金会;
关键词
D O I
10.1109/ICDM.2016.117
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Ensemble methods for classification have been effectively used for decades, while for outlier detection it has only been studied recently. In this work, we design a new ensemble approach for outlier detection in multi-dimensional point data, which provides improved accuracy by reducing error through both bias and variance by considering outlier detection as a binary classification task with unobserved labels. In this paper, we propose a sequential ensemble approach called CARE that employs a two-phase aggregation of the intermediate results in each iteration to reach the final outcome. Unlike existing outlier ensembles, our ensemble incorporates both the parallel and sequential building blocks to reduce bias as well as variance by (i) successively eliminating outliers from the original dataset to build a better data model on which outlierness is estimated (sequentially), and (ii) combining the results from individual base detectors and across iterations (parallelly). Through extensive experiments on 16 real-world datasets mainly from the UCI machine learning repository [1], we show that CARE performs significantly better than or at least similar to the individual baselines as well as the existing state-of-the-art outlier ensembles.
引用
收藏
页码:1167 / 1172
页数:6
相关论文
共 50 条
  • [41] Bias-variance control via hard points shaving
    Merler, S
    Caprile, B
    Furlanello, C
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2004, 18 (05) : 891 - 903
  • [42] A bias-variance evaluation framework for information retrieval systems
    Zhang, Peng
    Gao, Hui
    Hu, Zeting
    Yang, Meng
    Song, Dawei
    Wang, Jun
    Hou, Yuexian
    Hu, Bin
    INFORMATION PROCESSING & MANAGEMENT, 2022, 59 (01)
  • [43] Bias-variance analysis for controlling adaptive surface meshes
    Wilson, RC
    Hancock, ER
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2000, 77 (01) : 25 - 47
  • [44] There’s no such thing as a free lunchThe Bias-Variance dilemma
    Vivek S. Borkar
    Resonance, 1998, 3 (6) : 40 - 51
  • [45] Invariant operators, small samples, and the bias-variance dilemma
    Shi, X
    Manduchi, R
    PROCEEDINGS OF THE 2004 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 2, 2004, : 528 - 534
  • [46] Contrastive clustering based on generalized bias-variance decomposition
    Li, Shu
    Han, Lixin
    Wang, Yang
    Pu, Yonglin
    Zhu, Jun
    Li, Jingxian
    KNOWLEDGE-BASED SYSTEMS, 2024, 305
  • [47] Bias-variance tradeoff in machine learning: Theoretical formulation and implications to structural engineering applications
    Guan, Xingquan
    Burton, Henry
    STRUCTURES, 2022, 46 : 17 - 30
  • [48] Meta-Optimization of Bias-Variance Trade-Off in Stochastic Model Learning
    Aotani, Takumi
    Kobayashi, Taisuke
    Sugimoto, Kenji
    IEEE ACCESS, 2021, 9 : 148783 - 148799
  • [49] Bias-Variance Tradeoffs for Designing Simultaneous Temporal Experiments
    Xiong, Ruoxuan
    Chin, Alex
    Taylor, Sean
    KDD'23 WORKSHOP ON CAUSAL DISCOVERY, PREDICTION AND DECISION, VOL 218, 2023, 218 : 115 - 131
  • [50] Rethinking learning difficulty and uncertainty of samples with a target perturbation-aware bias-variance decomposition
    Yao, Rujing
    Wu, Ou
    Wang, Fang
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2025,