Self-supervised learning of pseudo classes for generalized zero-shot fine-grained recognition

被引:0
|
作者
Chen Y.-H. [1 ]
Yeh M.-C. [1 ]
机构
[1] Department of Computer Science and Information Engineering, National Taiwan Normal University, No. 88, Sec. 4, Tingzhou Rd., Taipei
关键词
Generalized zero-shot learning; Generative modeling; Self-supervised learning; Visual-semantic embedding;
D O I
10.1007/s11042-024-19266-w
中图分类号
学科分类号
摘要
Generalized zero-shot learning (GZSL) attempts to recognize visual instances from both seen and unseen classes by transferring knowledge from seen classes to unseen classes through semantic information (e.g., attributes). Generative methods are commonly employed to alleviate the issue of extreme data imbalance in which visual samples from unseen classes are not available during training, by synthesizing training samples for unseen classes from class prototypes. However, in the context of GZSL applied to fine-grained recognition, a notable complication arises. Similar class prototypes among different categories lead to ambiguity when generating synthetic data for classification. In response, we present a novel solution: a self-supervised pseudo-labeling (SSPL) module designed to enhance the generation of discerning synthetic data. This enhancement is achieved through an unsupervised grouping of fake and real samples using pseudo classes. By doing so, the SSPL module addresses the challenge of generating discriminative fake data, ultimately improving the overall quality of synthesized samples for classification. Our experimental results, conducted on three widely recognized GZSL datasets, demonstrate the effectiveness of the proposed method. Notably, the SSPL module not only produces well-distributed synthetic samples, but also enhances the discriminative and generalizable visual features derived from both real and synthetic samples within the GZSL framework. © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2024.
引用
收藏
页码:7915 / 7930
页数:15
相关论文
共 50 条
  • [41] Self-Supervised GlobalLocal Contrastive Learning for Fine-Grained Change Detection in VHR Images
    Jiang, Fenlong
    Gong, Maoguo
    Zheng, Hanhong
    Liu, Tongfei
    Zhang, Mingyang
    Liu, Jialu
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [42] SELF SUPERVISED DEEP REPRESENTATION LEARNING FOR FINE-GRAINED BODY PART RECOGNITION
    Zhang, Pengyue
    Wang, Fusheng
    Zheng, Yefeng
    2017 IEEE 14TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI 2017), 2017, : 578 - 582
  • [43] Zero-shot fine-grained entity typing in information security based on ontology
    Zhang, Han
    Zhu, Jiaxian
    Chen, Jicheng
    Liu, Junxiu
    Ji, Lixia
    KNOWLEDGE-BASED SYSTEMS, 2021, 232
  • [44] Coreset Sampling from Open-Set for Fine-Grained Self-Supervised Learning
    Kim, Sungnyun
    Bae, Sangmin
    Yun, Young
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 7537 - 7547
  • [45] Self-Supervised Antipodal Grasp Learning With Fine-Grained Grasp Quality Feedback in Clutter
    Hou, Yanxu
    Li, Jun
    Chen, I-Ming
    IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2024, 71 (04) : 3853 - 3861
  • [46] An Asymmetric Augmented Self-Supervised Learning Method for Unsupervised Fine-Grained Image Hashing
    Hu, Feiran
    Zhang, Chenlin
    Guo, Jiangliang
    Wei, Shen
    Zhao, Lin
    Xu, Anqi
    Gao, Lingyan
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 17648 - 17657
  • [47] Patch-wise self-supervised visual representation learning: a fine-grained approach
    Javidani, Ali
    Sadeghi, Mohammad Amin
    Araabi, Babak Nadjar
    SIGNAL IMAGE AND VIDEO PROCESSING, 2025, 19 (06)
  • [48] Improved Multi-shot Diffusion-Weighted MRI with Zero-Shot Self-supervised Learning Reconstruction
    Cho, Jaejin
    Jun, Yohan
    Wang, Xiaoqing
    Kobayashi, Caique
    Bilgic, Berkin
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT I, 2023, 14220 : 457 - 466
  • [49] UNCERTAINTY AS A PREDICTOR: LEVERAGING SELF-SUPERVISED LEARNING FOR ZERO-SHOT MOS PREDICTION<bold> </bold>
    Ravuri, Aditya
    Cooper, Erica
    Yamagishi, Junichi
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING WORKSHOPS, ICASSPW 2024, 2024, : 580 - 584
  • [50] Fine-grained Textual Inversion Network for Zero-Shot Composed Image Retrieval
    Lin, Haoqiang
    Wen, Haokun
    Song, Xuemeng
    Liu, Meng
    Hu, Yupeng
    Nie, Liqiang
    PROCEEDINGS OF THE 47TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2024, 2024, : 240 - 250