Semi-Supervised Classification of Network Data Using Very Few Labels

被引:67
|
作者
Lin, Frank [1 ]
Cohen, William W. [1 ]
机构
[1] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
关键词
D O I
10.1109/ASONAM.2010.19
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The goal of semi-supervised learning (SSL) methods is to reduce the amount of labeled training data required by learning from both labeled and unlabeled instances. Macskassy and Provost [1] proposed the weighted-vote relational neighbor classifier (wvRN) as a simple yet effective baseline for semi-supervised learning on network data. It is similar to many recent graph-based SSL methods (e.g., [2], [3]) and is shown to be essentially the same as the Gaussian-field classifier proposed by Zhu et al. [4] and proves to be very effective on some benchmark network datasets. We describe another simple and intuitive semi-supervised learning method based on random graph walk that outperforms wvRN by a large margin on several benchmark datasets when very few labels are available. Additionally, we show that using authoritative instances as training seeds - instances that arguably cost much less to label - dramatically reduces the amount of labeled data required to achieve the same classification accuracy. For some existing state-of-the-art semi-supervised learning methods the labeled data needed is reduced by a factor of 50.
引用
收藏
页码:192 / 199
页数:8
相关论文
共 50 条
  • [1] SOIL ANALYSIS WITH VERY FEW LABELS USING SEMI-SUPERVISED HYPERSPECTRAL IMAGE CLASSIFICATION
    Grabowski, Bartosz
    Wijata, Agata M.
    Tulczyjew, Lukasz
    Le Saux, Bertrand
    Nalepa, Jakub
    IGARSS 2024-2024 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, IGARSS 2024, 2024, : 407 - 411
  • [2] Contrast-Enhanced Semi-supervised Text Classification with Few Labels
    Tsai, Austin Cheng-Yun
    Lin, Sheng-Ya
    Fu, Li-Chen
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 11394 - 11402
  • [3] SEMI-SUPERVISED HANDWRITTEN DIGIT RECOGNITION USING VERY FEW LABELED DATA
    Van Vaerenbergh, Steven
    Santamaria, Ignacio
    Barbano, Paolo Emilio
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 2136 - 2139
  • [4] Labels diffusion on graphs: Application to semi-supervised segmentation and data classification
    Diffusion de labels sur graphe: Application à la segmentation semi-supervisée et à la classification de données
    1600, Lavoisier (27): : 299 - 320
  • [5] Multi-MCCR: Multiple models regularization for semi-supervised text classification with few labels
    Zhou, Nai
    Yao, Nianmin
    Li, Qibin
    Zhao, Jian
    Zhang, Yanan
    KNOWLEDGE-BASED SYSTEMS, 2023, 272
  • [6] Semi-Supervised Network Traffic Classification
    Erman, Jeffrey
    Mahanti, Anirban
    Arlitt, Martin
    Cohen, Ira
    Williamson, Carey
    SIGMETRICS'07: PROCEEDINGS OF THE 2007 INTERNATIONAL CONFERENCE ON MEASUREMENT & MODELING OF COMPUTER SYSTEMS, 2007, 35 (01): : 369 - 370
  • [7] Data Augmentation for Graph Convolutional Network on Semi-supervised Classification
    Tang, Zhengzheng
    Qiao, Ziyue
    Hong, Xuehai
    Wang, Yang
    Dharejo, Fayaz Ali
    Zhou, Yuanchun
    Du, Yi
    WEB AND BIG DATA, APWEB-WAIM 2021, PT II, 2021, 12859 : 33 - 48
  • [8] Semi-Supervised Deep Learning Using Pseudo Labels for Hyperspectral Image Classification
    Wu, Hao
    Prasad, Saurabh
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (03) : 1259 - 1270
  • [9] Dual-Training-Based Semi-Supervised Learning with Few Labels
    Wu, Hao
    Sun, Jun
    Chen, Qidong
    APPLIED SCIENCES-BASEL, 2024, 14 (12):
  • [10] Few Labels are Enough! Semi-supervised Graph Learning for Social Interaction
    Corbellini, Nicola
    Giraldo, Jhony H.
    Varni, Giovanna
    Volpe, Gualtiero
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 3052 - 3060