Sequential Ensemble Learning for Outlier Detection: A Bias-Variance Perspective

被引:0
|
作者
Rayana, Shebuti [1 ]
Zhong, Wen [1 ]
Akoglu, Leman [2 ]
机构
[1] SUNY Stony Brook, Stony Brook, NY 11794 USA
[2] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
基金
美国国家科学基金会;
关键词
D O I
10.1109/ICDM.2016.117
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Ensemble methods for classification have been effectively used for decades, while for outlier detection it has only been studied recently. In this work, we design a new ensemble approach for outlier detection in multi-dimensional point data, which provides improved accuracy by reducing error through both bias and variance by considering outlier detection as a binary classification task with unobserved labels. In this paper, we propose a sequential ensemble approach called CARE that employs a two-phase aggregation of the intermediate results in each iteration to reach the final outcome. Unlike existing outlier ensembles, our ensemble incorporates both the parallel and sequential building blocks to reduce bias as well as variance by (i) successively eliminating outliers from the original dataset to build a better data model on which outlierness is estimated (sequentially), and (ii) combining the results from individual base detectors and across iterations (parallelly). Through extensive experiments on 16 real-world datasets mainly from the UCI machine learning repository [1], we show that CARE performs significantly better than or at least similar to the individual baselines as well as the existing state-of-the-art outlier ensembles.
引用
收藏
页码:1167 / 1172
页数:6
相关论文
共 50 条
  • [1] Ensemble Learning in Hyperspectral Image Classification: Toward Selecting a Favorable Bias-Variance Tradeoff
    Merentitis, Andreas
    Debes, Christian
    Heremans, Roel
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2014, 7 (04) : 1089 - 1102
  • [2] A bias-variance perspective of data-driven control
    Colin, Kevin
    Ju, Yue
    Bombois, Xavier
    Rojas, Cristian R.
    Hjalmarsson, Hakan
    IFAC PAPERSONLINE, 2024, 58 (15): : 85 - 90
  • [3] On the stability and bias-variance analysis of kernel matrix learning
    Saradhi, V. Vijaya
    Karnick, Harish
    ADVANCES IN ARTIFICIAL INTELLIGENCE, 2007, 4509 : 441 - +
  • [4] Bias-Variance Decomposition for Ranking
    Shivaswamy, Pannaga
    Chandrashekar, Ashok
    WSDM '21: PROCEEDINGS OF THE 14TH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2021, : 472 - 480
  • [5] Generalized Emphatic Temporal Difference Learning: Bias-Variance Analysis
    Hallak, Assaf
    Tamar, Aviv
    Munos, Remi
    Mannor, Shie
    THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 1631 - 1637
  • [6] Enhanced Balancing of Bias-Variance Tradeoff in Stochastic Estimation: A Minimax Perspective
    Lam, Henry
    Zhang, Xinyu
    Zhang, Xuhui
    OPERATIONS RESEARCH, 2023, 71 (06) : 2352 - 2373
  • [7] Prefrontal solution to the bias-variance tradeoff during reinforcement learning
    Kim, Dongjae
    Jeong, Jaeseung
    Lee, Sang Wan
    CELL REPORTS, 2021, 37 (13):
  • [8] Towards Costless Model Selection in Contextual Bandits: A Bias-Variance Perspective
    Krishnamurthy, Sanath Kumar
    Propp, Adrienne Margaret
    Athey, Susan
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 238, 2024, 238
  • [9] Bias-variance decomposition in Genetic Programming
    Kowaliw, Taras
    Doursat, Rene
    OPEN MATHEMATICS, 2016, 14 : 62 - 80
  • [10] Bias-Variance Decomposition of IR Evaluation
    Zhang, Peng
    Song, Dawei
    Wang, Jun
    Hou, Yuexian
    SIGIR'13: THE PROCEEDINGS OF THE 36TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH & DEVELOPMENT IN INFORMATION RETRIEVAL, 2013, : 1021 - 1024