Semi-Supervised Classification of Network Data Using Very Few Labels

被引:67
|
作者
Lin, Frank [1 ]
Cohen, William W. [1 ]
机构
[1] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
关键词
D O I
10.1109/ASONAM.2010.19
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The goal of semi-supervised learning (SSL) methods is to reduce the amount of labeled training data required by learning from both labeled and unlabeled instances. Macskassy and Provost [1] proposed the weighted-vote relational neighbor classifier (wvRN) as a simple yet effective baseline for semi-supervised learning on network data. It is similar to many recent graph-based SSL methods (e.g., [2], [3]) and is shown to be essentially the same as the Gaussian-field classifier proposed by Zhu et al. [4] and proves to be very effective on some benchmark network datasets. We describe another simple and intuitive semi-supervised learning method based on random graph walk that outperforms wvRN by a large margin on several benchmark datasets when very few labels are available. Additionally, we show that using authoritative instances as training seeds - instances that arguably cost much less to label - dramatically reduces the amount of labeled data required to achieve the same classification accuracy. For some existing state-of-the-art semi-supervised learning methods the labeled data needed is reduced by a factor of 50.
引用
收藏
页码:192 / 199
页数:8
相关论文
共 50 条
  • [41] Semi-supervised adaptive network for commutator defect detection with limited labels
    Wang, Zhenrong
    Li, Weifeng
    Wang, Miao
    Liu, Baohui
    Niu, Tongzhi
    Li, Bin
    JOURNAL OF MANUFACTURING SYSTEMS, 2024, 77 : 639 - 651
  • [42] Improved Classification with Semi-supervised Deep Belief Network
    Wang, Gongming
    Qiao, Junfei
    Li, Xiaoli
    Wang, Lei
    Qian, Xiaolong
    IFAC PAPERSONLINE, 2017, 50 (01): : 4174 - 4179
  • [43] Text Classification Using Semi-Supervised Clustering
    Zhang, Wen
    Yoshida, Taketoshi
    Tang, Xijin
    2009 INTERNATIONAL CONFERENCE ON BUSINESS INTELLIGENCE AND FINANCIAL ENGINEERING, PROCEEDINGS, 2009, : 197 - 200
  • [44] Network traffic classification based on semi-supervised clustering
    Information Security Center, State Key Laboratory of Networking and Switching Technology, Beijing University of Posts and Telecommunications, Beijing 100876, China
    不详
    不详
    不详
    J. China Univ. Post Telecom., SUPPL. 2 (84-88):
  • [45] Improving Semi-Supervised Classification using Clustering
    Arora, J.
    Tushir, M.
    Kashyap, R.
    EAI ENDORSED TRANSACTIONS ON SCALABLE INFORMATION SYSTEMS, 2020, 7 (25) : 1 - 9
  • [46] Community Attention Network for Semi-supervised Node Classification
    Yu, Zhongjing
    Wang, Han
    Liu, Yang
    Bohm, Christian
    Shao, Junming
    20TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2020), 2020, : 1382 - 1387
  • [47] SSBTCNet: Semi-Supervised Brain Tumor Classification Network
    Atha, Zubair
    Chaki, Jyotismita
    IEEE ACCESS, 2023, 11 : 141485 - 141499
  • [48] A Novel Approach for Semi-Supervised Network Traffic Classification
    Huo, Yonghua
    Song, Chunxiao
    Zhou, Meichao
    Lv, Rui
    Yang, Yang
    2022 IEEE 14TH INTERNATIONAL CONFERENCE ON ADVANCED INFOCOMM TECHNOLOGY (ICAIT 2022), 2022, : 64 - 69
  • [49] Using semi-supervised learning for question classification
    Tri, Nguyen Thanh
    Le, Nguyen Minh
    Shimazu, Akira
    COMPUTER PROCESSING OF ORIENTAL LANGUAGES, PROCEEDINGS: BEYOND THE ORIENT: THE RESEARCH CHALLENGES AHEAD, 2006, 4285 : 31 - +
  • [50] Semi-supervised classification using multiple clusterings
    Yu G.X.
    Feng L.
    Yao G.J.
    Wang J.
    Wang, J. (kingjun@swu.edu.cn), 1600, Izdatel'stvo Nauka (26): : 681 - 687