Semi-Supervised Classification of Network Data Using Very Few Labels

被引:67
|
作者
Lin, Frank [1 ]
Cohen, William W. [1 ]
机构
[1] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
关键词
D O I
10.1109/ASONAM.2010.19
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The goal of semi-supervised learning (SSL) methods is to reduce the amount of labeled training data required by learning from both labeled and unlabeled instances. Macskassy and Provost [1] proposed the weighted-vote relational neighbor classifier (wvRN) as a simple yet effective baseline for semi-supervised learning on network data. It is similar to many recent graph-based SSL methods (e.g., [2], [3]) and is shown to be essentially the same as the Gaussian-field classifier proposed by Zhu et al. [4] and proves to be very effective on some benchmark network datasets. We describe another simple and intuitive semi-supervised learning method based on random graph walk that outperforms wvRN by a large margin on several benchmark datasets when very few labels are available. Additionally, we show that using authoritative instances as training seeds - instances that arguably cost much less to label - dramatically reduces the amount of labeled data required to achieve the same classification accuracy. For some existing state-of-the-art semi-supervised learning methods the labeled data needed is reduced by a factor of 50.
引用
收藏
页码:192 / 199
页数:8
相关论文
共 50 条
  • [31] Barely-Supervised Learning: Semi-supervised Learning with Very Few Labeled Images
    Lucas, Thomas
    Weinzaepfel, Philippe
    Rogez, Gregory
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 1881 - 1889
  • [32] A Data-Centric Approach for Improving Ambiguous Labels with Combined Semi-supervised Classification and Clustering
    Schmarje, Lars
    Santarossa, Monty
    Schroeder, Simon-Martin
    Zelenka, Claudius
    Kiko, Rainer
    Stracke, Jenny
    Volkmann, Nina
    Koch, Reinhard
    COMPUTER VISION, ECCV 2022, PT VIII, 2022, 13668 : 363 - 380
  • [33] Semi-supervised clustering with soft labels
    Nebu, Cynthia Marea
    Joseph, Sumy
    2015 INTERNATIONAL CONFERENCE ON CONTROL COMMUNICATION & COMPUTING INDIA (ICCC), 2015, : 612 - 616
  • [34] Traffic sign classification via Semi-Supervised model with uncertain labels
    Yang, Luhui
    Liu, Qing
    Yang, Yun
    Yang, Po
    2020 IEEE 18TH INTERNATIONAL CONFERENCE ON INDUSTRIAL INFORMATICS (INDIN), VOL 1, 2020, : 926 - 931
  • [35] Semi-supervised learning with pseudo-negative labels for image classification
    Xu, Hao
    Xiao, Hui
    Hao, Huazheng
    Dong, Li
    Qiu, Xiaojie
    Peng, Chengbin
    KNOWLEDGE-BASED SYSTEMS, 2023, 260
  • [36] Balanced and Accurate Pseudo-Labels for Semi-Supervised Image Classification
    Zhao, Jian
    Liu, Xianhui
    Zhao, Weidong
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2022, 18 (03)
  • [37] Learning image features with fewer labels using a semi-supervised deep convolutional network
    dos Santos, Fernando P.
    Zor, Cemre
    Kittler, Josef
    Ponti, Moacir A.
    NEURAL NETWORKS, 2020, 132 : 131 - 143
  • [38] Semi-supervised classification of iEEG using temporal autoencoder neural network
    Nejedly, P.
    Kremen, V.
    Lepkova, K.
    Mivalt, F.
    Sladky, V.
    Balzekas, I.
    Pridalova, T.
    Klimes, P.
    Plesinger, F.
    Brazdil, M.
    Jurak, P.
    Worrell, G.
    EPILEPSIA, 2022, 63 : 81 - 82
  • [39] Semi-supervised Fuzzy Min-Max Neural Network for Data Classification
    Liu, Jinhai
    Ma, Yanjuan
    Qu, Fuming
    Zang, Dong
    NEURAL PROCESSING LETTERS, 2020, 51 (02) : 1445 - 1464
  • [40] A Generative Bayesian Graph Attention Network for Semi-supervised Classification on Scarce Data
    Sun, Zhongtian
    Harit, Anoushka
    Yu, Jialin
    Cristea, Alexandra, I
    Al Moubayed, Noura
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,