Cluster-aware Semi-supervised Learning: Relational Knowledge Distillation Provably Learns Clustering

被引:0
|
作者
Dong, Yijun [1 ]
Miller, Kevin [2 ]
Lei, Qi [1 ,3 ]
Ward, Rachel [2 ]
机构
[1] NYU, Courant Inst Math Sci, New York, NY 10003 USA
[2] Univ Texas Austin, Oden Inst Computat Engn & Sci, Austin, TX USA
[3] NYU, Ctr Data Sci, New York, NY USA
关键词
MATRIX;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Despite the empirical success and practical significance of (relational) knowledge distillation that matches (the relations of) features between teacher and student models, the corresponding theoretical interpretations remain limited for various knowledge distillation paradigms. In this work, we take an initial step toward a theoretical understanding of relational knowledge distillation (RKD), with a focus on semi-supervised classification problems. We start by casting RKD as spectral clustering on a population-induced graph unveiled by a teacher model. Via a notion of clustering error that quantifies the discrepancy between the predicted and ground truth clusterings, we illustrate that RKD over the population provably leads to low clustering error. Moreover, we provide a sample complexity bound for RKD with limited unlabeled samples. For semi-supervised learning, we further demonstrate the label efficiency of RKD through a general framework of cluster-aware semi-supervised learning that assumes low clustering errors. Finally, by unifying data augmentation consistency regularization into this cluster-aware framework, we show that despite the common effect of learning accurate clusterings, RKD facilitates a "global" perspective through spectral clustering, whereas consistency regularization focuses on a "local" perspective via expansion.
引用
收藏
页数:33
相关论文
共 50 条
  • [21] Semi-Supervised Blind Image Quality Assessment through Knowledge Distillation and Incremental Learning
    Pan, Wensheng
    Gao, Timin
    Zhang, Yan
    Zheng, Xiawu
    Shen, Yunhang
    Li, Ke
    Hu, Runze
    Liu, Yutao
    Dai, Pingyang
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 5, 2024, : 4388 - 4396
  • [22] SeDPGK: Semi-supervised software defect prediction with graph representation learning and knowledge distillation
    Liu, Wangshu
    Yue, Ye
    Chen, Xiang
    Gu, Qing
    Zhao, Pengzhan
    Liu, Xuejun
    Zhao, Jianjun
    INFORMATION AND SOFTWARE TECHNOLOGY, 2024, 174
  • [23] TOWARDS GENERALIZABLE DEEPFAKE FACE FORGERY DETECTION WITH SEMI-SUPERVISED LEARNING AND KNOWLEDGE DISTILLATION
    Lin, Yuzhen
    Chen, Han
    Li, Bin
    Wu, Junqiang
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 576 - 580
  • [24] Semi-supervised clustering using incomplete prior knowledge
    Wang, Chao
    Chen, Weijun
    Yin, Peipei
    Wang, Jianmin
    COMPUTATIONAL SCIENCE - ICCS 2007, PT 1, PROCEEDINGS, 2007, 4487 : 192 - +
  • [25] Semi-Supervised Knowledge Distillation for Cross-Modal Hashing
    Su, Mingyue
    Gu, Guanghua
    Ren, Xianlong
    Fu, Hao
    Zhao, Yao
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 662 - 675
  • [26] Ensemble Knowledge Distillation for Federated Semi-Supervised Image Classification
    Shang, Ertong
    Liu, Hui
    Zhang, Jingyang
    Zhao, Runqi
    Du, Junzhao
    TSINGHUA SCIENCE AND TECHNOLOGY, 2025, 30 (01): : 112 - 123
  • [27] EEG-Oriented Self-Supervised Learning and Cluster-Aware Adaptation
    Ko, Wonjun
    Suk, Heung-Il
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 4143 - 4147
  • [28] Integrating distance metric learning and cluster-level constraints in semi-supervised clustering
    Nogueira, Bruno Magalhaes
    Benevides Tomas, Yuri Karan
    Marcacini, Ricardo Marcondes
    2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 4118 - 4125
  • [29] Learning sample-aware threshold for semi-supervised learning
    Wei, Qi
    Feng, Lei
    Sun, Haoliang
    Wang, Ren
    He, Rundong
    Yin, Yilong
    MACHINE LEARNING, 2024, 113 (08) : 5423 - 5445
  • [30] Uncertainty Aware Semi-Supervised Learning on Graph Data
    Zhao, Xujiang
    Chen, Feng
    Hu, Shu
    Cho, Jin-Hee
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33