HPOD: Hyperparameter Optimization for Unsupervised Outlier Detection

被引:0
|
作者
Zhao, Yue [1 ]
Akoglu, Leman [2 ]
机构
[1] Univ Southern Calif, Los Angeles, CA 90007 USA
[2] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Given an unsupervised outlier detection (OD) algorithm, how can we optimize its hyperparameter(s) (HP) on a new dataset, without using any labels? In this work, we address this challenging hyperparameter optimization for unsupervised OD problem, and propose the first continuous HP search method called HPOD. It capitalizes on the prior performance of a large collection of HPs on existing OD benchmark datasets, and transfers this information to enable HP evaluation on a new dataset without labels. Also, HPOD adapts a prominent, (originally) supervised, sampling paradigm to efficiently identify promising HPs in iterations. Extensive experiments show that HPOD works for both deep (e.g., Robust AutoEncoder (RAE)) and shallow (e.g., Local Outlier Factor (LOF) and Isolation Forest (iForest)) algorithms on discrete and continuous HP spaces. HPOD outperforms a wide range of diverse baselines with 37% improvement on average over the minimal loss HPs of RAE, and 58% and 66% improvement on average over the default HPs of LOF and iForest.
引用
收藏
页数:24
相关论文
共 50 条
  • [41] Unsupervised Boosting-Based Autoencoder Ensembles for Outlier Detection
    Sarvari, Hamed
    Domeniconi, Carlotta
    Prenkaj, Bardh
    Stilo, Giovanni
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2021, PT I, 2021, 12712 : 91 - 103
  • [42] On the evaluation of unsupervised outlier detection: measures, datasets, and an empirical study
    Campos, Guilherme O.
    Zimek, Arthur
    Sander, Jorg
    Campello, Ricardo J. G. B.
    Micenkova, Barbora
    Schubert, Erich
    Assent, Ira
    Houle, Michael E.
    DATA MINING AND KNOWLEDGE DISCOVERY, 2016, 30 (04) : 891 - 927
  • [43] E3Outlier: a Self-Supervised Framework for Unsupervised Deep Outlier Detection
    Wang, Siqi
    Zeng, Yijie
    Yu, Guang
    Cheng, Zhen
    Liu, Xinwang
    Zhou, Sihang
    Zhu, En
    Kloft, Marius
    Yin, Jianping
    Liao, Qing
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (03) : 2952 - 2969
  • [44] Unsupervised Outlier Detection in Streaming Data Using Weighted Clustering
    Thakran, Yogita
    Toshniwal, Durga
    2012 12TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS (ISDA), 2012, : 947 - 952
  • [45] Unsupervised Outlier detection in sensor networks using aggregation tree
    Zhang, Kejia
    Shi, Shengfei
    Gao, Hong
    Li, Jianzhong
    ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS, 2007, 4632 : 158 - +
  • [46] Robust and Explainable Autoencoders for Unsupervised Time Series Outlier Detection
    Kieu, Tung
    Yang, Bin
    Guo, Chenjuan
    Jensen, Christian S.
    Zhao, Yan
    Huang, Feiteng
    Zheng, Kai
    2022 IEEE 38TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2022), 2022, : 3038 - 3050
  • [47] Hyperparameter Sensitivity in Deep Outlier Detection Analysis and a Scalable Hyper-Ensemble Solution
    Ding, Xueying
    Zhao, Lingxiao
    Akoglu, Leman
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [48] Interpretable Single-dimension Outlier Detection (ISOD): An Unsupervised Outlier Detection Method Based on Quantiles and Skewness Coefficients
    Huang, Yuehua
    Liu, Wenfen
    Li, Song
    Guo, Ying
    Chen, Wen
    APPLIED SCIENCES-BASEL, 2024, 14 (01):
  • [49] Hyperparameter Optimization of Ensemble Models for Spam Email Detection
    Omotehinwa, Temidayo Oluwatosin
    Oyewola, David Opeoluwa
    APPLIED SCIENCES-BASEL, 2023, 13 (03):
  • [50] L0-norm Constrained Autoencoders for Unsupervised Outlier Detection
    Ishii, Yoshinao
    Koide, Satoshi
    Hayakawa, Keiichiro
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2020, PT II, 2020, 12085 : 674 - 687