Fast One-class Classification using Class Boundary-preserving Random Projections

被引:4
|
作者
Bhattacharya, Arindam [1 ]
Varambally, Sumanth [2 ]
Bagchi, Amitabha [1 ]
Bedathur, Srikanta [1 ]
机构
[1] IIT Delhi, Dept Comp Sci, Delhi, India
[2] IIT Delhi, Dept Math, Delhi, India
来源
KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING | 2021年
关键词
one class classification; ensemble classifier; random projection; kernel based method;
D O I
10.1145/3447548.3467440
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Several applications, like malicious URL detection and web spam detection, require classification on very high-dimensional data. In such cases anomalous data is hard to find but normal data is easily available. As such it is increasingly common to use a one-class classifier (OCC). Unfortunately, most OCC algorithms cannot scale to datasets with extremely high dimensions. In this paper, we present Fast Random projection-based One-Class Classification (FROCC), an extremely efficient, scalable and easily parallelizable method for one-class classification with provable theoretical guarantees. Our method is based on the simple idea of transforming the training data by projecting it onto a set of random unit vectors that are chosen uniformly and independently from the unit sphere, and bounding the regions based on separation of the data. FROCC can be naturally extended with kernels. We provide a new theoretical framework to prove that that FROCC generalizes well in the sense that it is stable and has low bias for some parameter settings. We then develop a fast scalable approximation of FROCC using vectorization, exploiting data sparsity and parallelism to develop a new implementation called ParDFROCC. ParDFROCC achieves up to 2 percent points better ROC than the next best baseline, with up to 12x speedup in training and test times over a range of state-of-the-art benchmarks for the OCC task.
引用
收藏
页码:66 / 74
页数:9
相关论文
共 50 条
  • [11] Overview of one-Class Classification
    Sun Wenzhu
    Hu Wenting
    Xue Zufeng
    Cao Jianping
    2019 IEEE 4TH INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING (ICSIP 2019), 2019, : 6 - 10
  • [12] Deep One-Class Classification
    Ruff, Lukas
    Vandermeulen, Robert A.
    Goernitz, Nico
    Deecke, Lucas
    Siddiqui, Shoaib A.
    Binder, Alexander
    Mueller, Emmanuel
    Kloft, Marius
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
  • [13] DEEP ONE-CLASS CLASSIFICATION USING INTRA-CLASS SPLITTING
    Schlachter, Patrick
    Liao, Yiwen
    Yang, Bin
    2019 IEEE DATA SCIENCE WORKSHOP (DSW), 2019, : 100 - 104
  • [14] REFINING IMAGE RETRIEVAL USING ONE-CLASS CLASSIFICATION
    Xiao, Jie
    Fu, Yun
    Lu, Yijuan
    Tian, Qi
    ICME: 2009 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-3, 2009, : 314 - +
  • [15] Improved one-class classification using filled function
    Javad Hamidzadeh
    Mona Moradi
    Applied Intelligence, 2018, 48 : 3263 - 3279
  • [16] Improved one-class classification using filled function
    Hamidzadeh, Javad
    Moradi, Mona
    APPLIED INTELLIGENCE, 2018, 48 (10) : 3263 - 3279
  • [17] Fusion of Biometric Systems using One-Class Classification
    Bergamini, Cheila
    Oliveira, Luiz S.
    Koerich, Alessandro L.
    Sabourin, Robert
    2008 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-8, 2008, : 1308 - 1313
  • [18] One-Class Classification Using Generative Adversarial Networks
    Yang, Yang
    Hou, Chunping
    Lang, Yue
    Yue, Guanghui
    He, Yuan
    IEEE ACCESS, 2019, 7 : 37970 - 37979
  • [19] One-class remote sensing classification: one-class vs. binary classifiers
    Deng, Xueqing
    Li, Wenkai
    Liu, Xiaoping
    Guo, Qinghua
    Newsam, Shawn
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2018, 39 (06) : 1890 - 1910
  • [20] ON SIMPLE ONE-CLASS CLASSIFICATION METHODS
    Noumir, Zineb
    Honeine, Paul
    Richard, Cedric
    2012 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY PROCEEDINGS (ISIT), 2012,