Class-Aware Pseudo-Labeling for Non-Random Missing Labels in Semi-Supervised Learning

被引:0
|
作者
Gui, Qian [1 ]
Wu, Xinting [1 ]
Niu, Baoning [1 ]
机构
[1] Taiyuan Univ Technol, Sch Informat & Comp, Taiyuan, Peoples R China
关键词
Semi-supervised learning; missing label not at random;
D O I
10.1142/S1793351X23640018
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Semi-supervised learning (SSL) is a classic missing label problem. Existing SSL algorithms always rely on the basic assumption, label missing completely at random (MCAR), where both labeled and unlabeled data share the same class distribution. Compared to MCAR, the label missing not at random (MNAR) problem is more realistic. In MNAR, the labeled and unlabeled data have different class distributions resulting in biased label imputation, which leads to the performance degradation of SSL models. Existing SSL algorithms can hardly perform well on tail classes (the classes with few training examples) in MNAR setting, since the pseudo-labels learned from unlabeled data tend to be biased toward head classes (the classes with a large number of training examples). To alleviate this issue, we propose a class-aware pseudo-labeling (CAPL) for non-random missing labels in SSL, which utilizes the unlabeled data by dynamically adjusting the threshold for selecting pseudo-labels. Under various MNAR settings, our method achieves up to 15.0% overall accuracy gain upon FixMatch in CIFAR-10 compared with existing baselines.
引用
收藏
页码:531 / 543
页数:13
相关论文
共 50 条
  • [41] Pseudo-Labeling Based Practical Semi-Supervised Meta-Training for Few-Shot Learning
    Dong, Xingping
    Ouyang, Tianran
    Liao, Shengcai
    Du, Bo
    Shao, Ling
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 5663 - 5675
  • [42] Semi-Supervised Learning on an Augmented Graph with Class Labels
    Li, Nan
    Latecki, Longin Jan
    ECAI 2016: 22ND EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, 285 : 1571 - 1572
  • [43] CRMSP: A semi-supervised approach for key information extraction with Class-Rebalancing and Merged Semantic Pseudo-Labeling
    Zhang, Qi
    Song, Yonghong
    Guo, Pengcheng
    Hui, Yangyang
    NEUROCOMPUTING, 2025, 616
  • [44] Semi-Supervised Learning of Semantic Correspondence with Pseudo-Labels
    Kim, Jiwon
    Ryoo, Kwangrok
    Seo, Junyoung
    Lee, Gyuseong
    Kim, Daehwan
    Cho, Hansang
    Kim, Seungryong
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 19667 - 19677
  • [45] MAPLE: Masked Pseudo-Labeling autoEncoder for Semi-supervised Point Cloud Action Recognition
    Chen, Xiaodong
    Liu, Wu
    Liu, Xinchen
    Zhang, Yongdong
    Han, Jungong
    Mei, Tao
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022,
  • [46] Striving for Simplicity: Simple Yet Effective Prior-Aware Pseudo-labeling for Semi-supervised Ultrasound Image Segmentation
    Chen, Yaxiong
    Wang, Yujie
    Zheng, Zixuan
    Hu, Jingliang
    Shi, Yilei
    Xiong, Shengwu
    Zhu, Xiao Xiang
    Mou, Lichao
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT IX, 2024, 15009 : 604 - 614
  • [47] Graph Segmentation-Based Pseudo-Labeling for Semi-Supervised Pathology Image Classification
    Shin, Hong-Kyu
    Uhmn, Kwang-Hyun
    Choi, Kyuyeon
    Xu, Zhixin
    Jung, Seung-Won
    Ko, Sung-Jea
    IEEE ACCESS, 2022, 10 : 93960 - 93970
  • [48] SEMI-SUPERVISED 3D OBJECT DETECTION VIA ADAPTIVE PSEUDO-LABELING
    Xu, Hongyi
    Liu, Fengqi
    Zhou, Qianyu
    Hao, Jinkun
    Cao, Zhijie
    Feng, Zhengyang
    Ma, Lizhuang
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 3183 - 3187
  • [49] Pseudo-Labeling Optimization Based Ensemble Semi-Supervised Soft Sensor in the Process Industry
    Li, Youwei
    Jin, Huaiping
    Dong, Shoulong
    Yang, Biao
    Chen, Xiangguang
    SENSORS, 2021, 21 (24)
  • [50] CAWM: Class-Aware Weight Map for Improved Semi-Supervised Nuclei Segmentation
    Lim, Seohoon
    Xu, Zhixin
    Chong, Yosep
    Jung, Seung-Won
    IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 81 - 85