Hashing for Adaptive Real-Time Graph Stream Classification With Concept Drifts

被引:16
|
作者
Chi, Lianhua [1 ]
Li, Bin [2 ]
Zhu, Xingquan [3 ,4 ]
Pan, Shirui [5 ]
Chen, Ling [5 ]
机构
[1] IBM Res Australia, Melbourne Res Lab, Southbank, Vic 3006, Australia
[2] CSIRO, Data61, Eveleigh, NSW 2015, Australia
[3] Florida Atlantic Univ, Dept Comp & Elect Engn & Comp Sci, Boca Raton, FL 33431 USA
[4] Fudan Univ, Sch Comp Sci, Shanghai 201203, Peoples R China
[5] Univ Technol Sydney, Broadway, NSW 2007, Australia
关键词
Cliques; concept drifts; graph stream classification; hashing; KERNELS;
D O I
10.1109/TCYB.2017.2708979
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Many applications involve processing networked streaming data in a timely manner. Graph stream classification aims to learn a classification model from a stream of graphs with only one-pass of data, requiring real-time processing in training and prediction. This is a nontrivial task, as many existing methods require multipass of the graph stream to extract sub-graph structures as features for graph classification which does not simultaneously satisfy "one-pass" and "real-time" requirements. In this paper, we propose an adaptive real-time graph stream classification method to address this challenge. We partition the unbounded graph stream data into consecutive graph chunks, each consisting of a fixed number of graphs and delivering a corresponding chunk-level classifier. We employ a random hashing function to compress the original node set of graphs in each chunk for fast feature detection when training chunk-level classifiers. Furthermore, a differential hashing strategy is applied to map unlimited increasing features (i.e., cliques) into a fixed-size feature space which is then used as a feature vector for stochastic learning. Finally, the chunk-level classifiers are weighted in an ensemble learning model for graph classification. The proposed method substantially speeds up the graph feature extraction and avoids unbounded graph feature growth. Moreover, it effectively offsets concept drifts in graph stream classification. Experiments on real-world and synthetic graph streams demonstrate that our method significantly outperforms existing methods in both classification accuracy and learning efficiency.
引用
收藏
页码:1591 / 1604
页数:14
相关论文
共 50 条
  • [1] Graph Hashing and Factorization for Fast Graph Stream Classification
    Guo, Ting
    Chi, Lianhua
    Zhu, Xingquan
    PROCEEDINGS OF THE 22ND ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM'13), 2013, : 1607 - 1612
  • [2] Fast Adaptive Real-Time Classification for Data Streams with Concept Drift
    Tennant, Mark
    Stahl, Frederic
    Gomes, Joao Bartolo
    INTERNET AND DISTRIBUTED COMPUTING SYSTEMS, IDCS 2015, 2015, 9258 : 265 - 272
  • [3] Persistent graph stream summarization for real-time graph analytics
    Yan Jia
    Zhaoquan Gu
    Zhihao Jiang
    Cuiyun Gao
    Jianye Yang
    World Wide Web, 2023, 26 : 2647 - 2667
  • [4] Persistent graph stream summarization for real-time graph analytics
    Jia, Yan
    Gu, Zhaoquan
    Jiang, Zhihao
    Gao, Cuiyun
    Yang, Jianye
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2023, 26 (05): : 2647 - 2667
  • [5] Real-Time Visual Concept Classification
    Uijlings, Jasper R. R.
    Smeulders, Arnold W. M.
    Scha, Remko J. H.
    IEEE TRANSACTIONS ON MULTIMEDIA, 2010, 12 (07) : 665 - 681
  • [6] Adaptive Evolutionary Filtering in Real-Time Twitter Stream
    Fan, Feifan
    Feng, Yansong
    Yao, Lili
    Zhao, Dongyan
    CIKM'16: PROCEEDINGS OF THE 2016 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2016, : 1079 - 1088
  • [7] An Adaptive Replica Mechanism for Real-time Stream Processing
    Ding, Weilong
    Zhao, Zhuofeng
    Han, Yanbo
    2016 INT IEEE CONFERENCES ON UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTING, SCALABLE COMPUTING AND COMMUNICATIONS, CLOUD AND BIG DATA COMPUTING, INTERNET OF PEOPLE, AND SMART WORLD CONGRESS (UIC/ATC/SCALCOM/CBDCOM/IOP/SMARTWORLD), 2016, : 449 - 455
  • [8] Multivariate detrending of fMRI signal drifts for real-time multiclass pattern classification
    Lee, Dongha
    Jang, Changwon
    Park, Hae-Jeong
    NEUROIMAGE, 2015, 108 : 203 - 213
  • [9] An Ensemble Classification Algorithm for Short Text Data Stream with Concept Drifts
    Sun, Gang
    Wang, Zhongxin
    Ding, Zhengqi
    Zhao, Jia
    IAENG International Journal of Computer Science, 2021, 48 (04) : 1056 - 1061
  • [10] Real-Time Parallel Hashing on the GPU
    Alcantara, Dan A.
    Sharf, Andrei
    Abbasinejad, Fatemeh
    Sengupta, Shubhabrata
    Mitzenmacher, Michael
    Owens, John D.
    Amenta, Nina
    ACM TRANSACTIONS ON GRAPHICS, 2009, 28 (05): : 1 - 9