Fast One-class Classification using Class Boundary-preserving Random Projections

被引:4
|
作者
Bhattacharya, Arindam [1 ]
Varambally, Sumanth [2 ]
Bagchi, Amitabha [1 ]
Bedathur, Srikanta [1 ]
机构
[1] IIT Delhi, Dept Comp Sci, Delhi, India
[2] IIT Delhi, Dept Math, Delhi, India
关键词
one class classification; ensemble classifier; random projection; kernel based method;
D O I
10.1145/3447548.3467440
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Several applications, like malicious URL detection and web spam detection, require classification on very high-dimensional data. In such cases anomalous data is hard to find but normal data is easily available. As such it is increasingly common to use a one-class classifier (OCC). Unfortunately, most OCC algorithms cannot scale to datasets with extremely high dimensions. In this paper, we present Fast Random projection-based One-Class Classification (FROCC), an extremely efficient, scalable and easily parallelizable method for one-class classification with provable theoretical guarantees. Our method is based on the simple idea of transforming the training data by projecting it onto a set of random unit vectors that are chosen uniformly and independently from the unit sphere, and bounding the regions based on separation of the data. FROCC can be naturally extended with kernels. We provide a new theoretical framework to prove that that FROCC generalizes well in the sense that it is stable and has low bias for some parameter settings. We then develop a fast scalable approximation of FROCC using vectorization, exploiting data sparsity and parallelism to develop a new implementation called ParDFROCC. ParDFROCC achieves up to 2 percent points better ROC than the next best baseline, with up to 12x speedup in training and test times over a range of state-of-the-art benchmarks for the OCC task.
引用
收藏
页码:66 / 74
页数:9
相关论文
共 50 条
  • [1] Fast structural ensemble for One-Class Classification
    Liu, Jiachen
    Miao, Qiguang
    Sun, Yanan
    Song, Jianfeng
    Quan, Yining
    PATTERN RECOGNITION LETTERS, 2016, 80 : 179 - 187
  • [2] Active Learning for One-Class Classification Using Two One-Class Classifiers
    Schlachter, Patrick
    Yang, Bin
    2018 26TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2018, : 1197 - 1201
  • [3] Resampling approach for one-Class classification; Resampling approach for one-Class classification
    Lee H.-H.
    Park S.
    Im J.
    Pattern Recognition, 2023, 143
  • [4] One-Class Classification by Ensembles of Random Planes (OCCERPs)
    Ahmad, Amir
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [5] A New Random Forest Method for One-Class Classification
    Desir, Chesner
    Bernard, Simon
    Petitjean, Caroline
    Heutte, Laurent
    STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, 2012, 7626 : 282 - 290
  • [6] One-Class Risk Estimation for One-Class Hyperspectral Image Classification
    Zhao, Hengwei
    Zhong, Yanfei
    Wang, Xinyu
    Shu, Hong
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [7] AN OUTLIER GENERATION APPROACH FOR ONE-CLASS RANDOM FORESTS: AN EXAMPLE IN ONE-CLASS CLASSIFICATION OF REMOTE SENSING IMAGERY
    Shi, Zhongkui
    Li, Peijun
    Sun, Yi
    2016 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2016, : 5107 - 5110
  • [8] Using binary classifiers for one-class classification
    Kang, Seokho
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 187
  • [9] One-class classification with subgaussians
    Mamlouk, AM
    Kim, JT
    Barth, E
    Brauckmann, M
    Martinetz, T
    PATTERN RECOGNITION, PROCEEDINGS, 2003, 2781 : 346 - 353
  • [10] Structured one-class classification
    Wang, Defeng
    Yeung, Daniel S.
    Tsang, Eric C. C.
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2006, 36 (06): : 1283 - 1295