Large-scale structural learning and predicting via hashing approximation

被引:0
|
作者
Dandan Chen
Yingjie Tian
机构
[1] University of Chinese Academy of Sciences,School of Mathematical Sciences
[2] Chinese Academy of Sciences,Research Center on Fictitious Economy and Data Science, Key Laboratory of Big Data Mining and Knowledge Management
来源
关键词
Nonparallel support vector machine; Structural information; Locality-sensitive hashing; Minwise hashing;
D O I
暂无
中图分类号
学科分类号
摘要
By combining the structural information with nonparallel support vector machine, structural nonparallel support vector machine (SNPSVM) can fully exploit prior knowledge to directly improve the algorithm’s generalization capacity. However, the scalability issue how to train SNPSVM efficiently on data with huge dimensions has not been studied. In this paper, we integrate linear SNPSVM with b-bit minwise hashing scheme to speedup the training phase for large-scale and high-dimensional statistical learning, and then we address the problem of speeding-up its prediction phase via locality-sensitive hashing. For one-against-one multi-class classification problems, a two-stage strategy is put forward: a series of hash-based classifiers are built in order to approximate the exact results and filter the hypothesis space in the first stage and then the classification can be refined by solving a multi-class SNPSVM on the remaining classes in the second stage. The proposed method can deal with large-scale classification problems with a huge number of features. Experimental results on two large-scale datasets (i.e., news20 and webspam) demonstrate the efficiency of structural learning via b-bit minwise hashing. Experimental results on the ImageNet-BOF dataset, and several large-scale UCI datasets show that the proposed hash-based prediction can be more than two orders of magnitude faster than the exact classifier with minor losses in quality.
引用
收藏
页码:2889 / 2903
页数:14
相关论文
共 50 条
  • [1] Large-scale structural learning and predicting via hashing approximation
    Chen, Dandan
    Tian, Yingjie
    NEURAL COMPUTING & APPLICATIONS, 2019, 31 (07): : 2889 - 2903
  • [2] Large-Scale Video Hashing via Structure Learning
    Ye, Guangnan
    Liu, Dong
    Wang, Jun
    Chang, Shih-Fu
    2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 2272 - 2279
  • [3] Large-Scale Unsupervised Hashing with Shared Structure Learning
    Liu, Xianglong
    Mu, Yadong
    Zhang, Danchen
    Lang, Bo
    Li, Xuelong
    IEEE TRANSACTIONS ON CYBERNETICS, 2015, 45 (09) : 1811 - 1822
  • [4] Large-Scale Linear NPSVM via One Permutation Hashing
    Tang, Jingjing
    Tian, Yingjie
    Liu, Dalian
    2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
  • [5] Large-Scale Distributed Learning via Private On-Device Locality-Sensitive Hashing
    Rabbani, Tahseen
    Bornstein, Marco
    Huang, Furong
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [6] Embedding Compression with Hashing for Efficient Representation Learning in Large-Scale Graph
    Yeh, Chin-Chia Michael
    Gu, Mengting
    Zheng, Yan
    Chen, Huiyuan
    Ebrahimi, Javid
    Zhuang, Zhongfang
    Wang, Junpeng
    Wang, Liang
    Zhang, Wei
    PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 4391 - 4401
  • [7] f-Fractional Bit Minwise Hashing for Large-Scale Learning
    Tang, Jingjing
    Tian, Yingjie
    2015 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY (WI-IAT), VOL 3, 2015, : 60 - 63
  • [8] Approximation Vector Machines for Large-scale Online Learning
    Trung Le
    Tu Dinh Nguyen
    Vu Nguyen
    Dinh Phung
    JOURNAL OF MACHINE LEARNING RESEARCH, 2017, 18
  • [9] Large-scale high-dimensional indexing by sparse hashing with l 0 approximation
    Borges, Pedro
    Mourao, Andre
    Magalhaes, Joao
    MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (22) : 24389 - 24412
  • [10] SLINGER: large-scale learning for predicting gene expression
    Vervier, Kevin
    Michaelson, Jacob J.
    SCIENTIFIC REPORTS, 2016, 6