Hyperparameter Sensitivity in Deep Outlier Detection Analysis and a Scalable Hyper-Ensemble Solution

被引:0
|
作者
Ding, Xueying [1 ]
Zhao, Lingxiao [1 ]
Akoglu, Leman [1 ]
机构
[1] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Outlier detection (OD) literature exhibits numerous algorithms as it applies to diverse domains. However, given a new detection task, it is unclear how to choose an algorithm to use, nor how to set its hyperparameter(s) (HPs) in unsupervised settings. HP tuning is an ever-growing problem with the arrival of many new detectors based on deep learning, which usually come with a long list of HPs. Surprisingly, the issue of model selection in the outlier mining literature has been "the elephant in the room"; a significant factor in unlocking the utmost potential of deep methods, yet little said or done to systematically tackle the issue. In the first part of this paper, we conduct the first large-scale analysis on the HP sensitivity of deep OD methods, and through more than 35,000 trained models, quantitatively demonstrate that model selection is inevitable. Next, we design a HP-robust and scalable deep hyper-ensemble model called ROBOD that assembles models with varying HP configurations, bypassing the choice paralysis. Importantly, we introduce novel strategies to speed up ensemble training, such as parameter sharing, batch/simultaneous training, and data subsampling, that allow us to train fewer models with fewer parameters. Extensive experiments on both image and tabular datasets show that ROBOD achieves and retains robust, state-of-the-art detection performance as compared to its modern counterparts, while taking only 2-10% of the time by the naive hyper-ensemble with independent training.
引用
收藏
页数:14
相关论文
共 34 条
  • [21] Champion-challenger analysis for credit card fraud detection: Hybrid ensemble and deep learning
    Kim, Eunji
    Lee, Jehyuk
    Shin, Hunsik
    Yang, Hoseong
    Cho, Sungzoon
    Nam, Seung-kwan
    Song, Youngmi
    Yoon, Jeong-a
    Kim, Jong-il
    EXPERT SYSTEMS WITH APPLICATIONS, 2019, 128 : 214 - 224
  • [22] Analysis of Scale Sensitivity of Ship Detection in an Anchor-Free Deep Learning Framework
    Jiang, Yongxin
    Huang, Li
    Zhang, Zhiyou
    Nie, Bu
    Zhang, Fan
    ELECTRONICS, 2023, 12 (01)
  • [23] Comparative Analysis of Machine Learning, Ensemble Learning and Deep Learning Classifiers for Parkinson’s Disease Detection
    Goyal P.
    Rani R.
    SN Computer Science, 5 (1)
  • [24] An Ensemble Deep Learning Model for Oral Squamous Cell Carcinoma Detection Using Histopathological Image Analysis
    Das, Madhusmita
    Dash, Rasmita
    Kumar Mishra, Sambit
    Kumar Dalai, Asish
    IEEE ACCESS, 2024, 12 : 127185 - 127197
  • [25] Is Fetal-Type Posterior Cerebral Artery a Risk Factor for Recurrence in Coiled Internal Carotid Artery-Incorporating Posterior Communicating Artery Aneurysms? Analysis of Conventional Statistics, Computational Fluid Dynamics, and Random Forest With Hyper-Ensemble Approach
    Chung, Jaewoo
    Cheong, Jin Hwan
    Kim, Jae Min
    Lee, Deok Hee
    Yi, Hyeong-Joong
    Choi, Kyu-Sun
    Ahn, Jae Sung
    Park, Jung Cheol
    Park, Wonhyoung
    NEUROSURGERY, 2023, 93 (03) : 611 - 621
  • [26] Microscopic image analysis in breast cancer detection using ensemble deep learning architectures integrated with web of things
    Sheeba, Adlin
    Kumar, P. Santhosh
    Ramamoorthy, M.
    Sasikala, S.
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 79
  • [27] Hyperparameter Optimization for Large-Scale Remote Sensing Image Analysis Tasks: A Case Study Based on Permafrost Landform Detection Using Deep Learning
    Perera, Amal S.
    Witharana, Chandi
    Manos, Elias
    Liljedahl, Anna K.
    IEEE ACCESS, 2024, 12 : 43062 - 43077
  • [28] MeMalDet: A memory analysis-based malware detection framework using deep autoencoders and stacked ensemble under temporal evaluations
    Maniriho, Pascal
    Mahmood, Abdun Naser
    Chowdhury, Mohammad Jabed Morshed
    COMPUTERS & SECURITY, 2024, 142
  • [29] Sensitivity analysis of scalable data on three PCA related fault detection methods considering data window and thermal load matching strategies
    Yang, Xuebin
    Chen, Jianfei
    Gu, Xuan
    He, Ruru
    Wang, Ji
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 234
  • [30] Analysis of Securing Edge-Cloud Computing and Network Based Deep Neural Intrusion Detection System as a Solution Model
    Girma, Anteneh
    Tamirat, Marshet
    INTELLIGENT SYSTEMS AND APPLICATIONS, VOL 1, INTELLISYS 2024, 2024, 1065 : 438 - 451