Sequential Ensemble Learning for Outlier Detection: A Bias-Variance Perspective

被引：0

作者：

Rayana, Shebuti ^{[1
]}

Zhong, Wen ^{[1
]}

Akoglu, Leman ^{[2
]}

机构：

[1] SUNY Stony Brook, Stony Brook, NY 11794 USA

[2] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA

来源：

2016 IEEE 16TH INTERNATIONAL CONFERENCE ON DATA MINING (ICDM) | 2016年

基金：

美国国家科学基金会;

关键词：

D O I：

10.1109/ICDM.2016.117

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Ensemble methods for classification have been effectively used for decades, while for outlier detection it has only been studied recently. In this work, we design a new ensemble approach for outlier detection in multi-dimensional point data, which provides improved accuracy by reducing error through both bias and variance by considering outlier detection as a binary classification task with unobserved labels. In this paper, we propose a sequential ensemble approach called CARE that employs a two-phase aggregation of the intermediate results in each iteration to reach the final outcome. Unlike existing outlier ensembles, our ensemble incorporates both the parallel and sequential building blocks to reduce bias as well as variance by (i) successively eliminating outliers from the original dataset to build a better data model on which outlierness is estimated (sequentially), and (ii) combining the results from individual base detectors and across iterations (parallelly). Through extensive experiments on 16 real-world datasets mainly from the UCI machine learning repository [1], we show that CARE performs significantly better than or at least similar to the individual baselines as well as the existing state-of-the-art outlier ensembles.

引用

页码：1167 / 1172

页数：6

共 50 条

[41] Bias-variance control via hard points shaving
Merler, S
Caprile, B
Furlanello, C
INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2004, 18 (05) : 891 - 903
[42] A bias-variance evaluation framework for information retrieval systems
Zhang, Peng
Gao, Hui
Hu, Zeting
Yang, Meng
Song, Dawei
Wang, Jun
Hou, Yuexian
Hu, Bin
INFORMATION PROCESSING & MANAGEMENT, 2022, 59 (01)
[43] Bias-variance analysis for controlling adaptive surface meshes
Wilson, RC
Hancock, ER
COMPUTER VISION AND IMAGE UNDERSTANDING, 2000, 77 (01) : 25 - 47
[44] There’s no such thing as a free lunchThe Bias-Variance dilemma
Vivek S. Borkar
Resonance, 1998, 3 (6) : 40 - 51
[45] Invariant operators, small samples, and the bias-variance dilemma
Shi, X
Manduchi, R
PROCEEDINGS OF THE 2004 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 2, 2004, : 528 - 534
[46] Contrastive clustering based on generalized bias-variance decomposition
Li, Shu
Han, Lixin
Wang, Yang
Pu, Yonglin
Zhu, Jun
Li, Jingxian
KNOWLEDGE-BASED SYSTEMS, 2024, 305
[47] Bias-variance tradeoff in machine learning: Theoretical formulation and implications to structural engineering applications
Guan, Xingquan
Burton, Henry
STRUCTURES, 2022, 46 : 17 - 30
[48] Meta-Optimization of Bias-Variance Trade-Off in Stochastic Model Learning
Aotani, Takumi
Kobayashi, Taisuke
Sugimoto, Kenji
IEEE ACCESS, 2021, 9 : 148783 - 148799
[49] Bias-Variance Tradeoffs for Designing Simultaneous Temporal Experiments
Xiong, Ruoxuan
Chin, Alex
Taylor, Sean
KDD'23 WORKSHOP ON CAUSAL DISCOVERY, PREDICTION AND DECISION, VOL 218, 2023, 218 : 115 - 131
[50] Rethinking learning difficulty and uncertainty of samples with a target perturbation-aware bias-variance decomposition
Yao, Rujing
Wu, Ou
Wang, Fang
INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2025,

← 1 2 3 4 5 →