Boosting Zero-Shot Learning via Contrastive Optimization of Attribute Representations

被引:7
|
作者
Du, Yu [1 ]
Shi, Miaojing [2 ]
Wei, Fangyun [3 ]
Li, Guoqi [4 ]
机构
[1] Tsinghua Univ, Dept Precis Instrument, Beijing 100084, Peoples R China
[2] Tongji Univ, Coll Elect & Informat Engn, Shanghai 201804, Peoples R China
[3] Microsoft Res Asia, Beijing 100080, Peoples R China
[4] Chinese Acad Sci, Inst Automat, Beijing 100084, Peoples R China
基金
中国国家自然科学基金;
关键词
Index Terms- Attributes; contrastive learning; prototype gen-eration; transformer; zero-shot learning (ZSL);
D O I
10.1109/TNNLS.2023.3297134
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Zero-shot learning (ZSL) aims to recognize classes that do not have samples in the training set. One representative solution is to directly learn an embedding function associating visual features with corresponding class semantics for recognizing new classes. Many methods extend upon this solution, and recent ones are especially keen on extracting rich features from images, e.g., attribute features. These attribute features are normally extracted within each individual image; however, the common traits for features across images yet belonging to the same attribute are not emphasized. In this article, we propose a new framework to boost ZSL by explicitly learning attribute prototypes beyond images and contrastively optimizing them with attribute-level features within images. Besides the novel architecture, two elements are highlighted for attribute representations: a new prototype generation module (PM) is designed to generate attribute prototypes from attribute semantics; a hard-example-based contrastive optimization scheme is introduced to reinforce attribute-level features in the embedding space. We explore two alternative backbones, CNN-based and transformer-based, to build our framework and conduct experiments on three standard benchmarks, Caltech-UCSD Birds-200-2011 (CUB), SUN attribute database (SUN), and animals with attributes 2 (AwA2). Results on these benchmarks demonstrate that our method improves the state of the art by a considerable margin. Our codes will be available at https://github.com/dyabel/CoAR-ZSL.git.
引用
收藏
页码:1 / 14
页数:14
相关论文
共 50 条
  • [41] A zero-shot learning boosting framework via concept-constrained clustering
    Yue, Qin
    Cui, Junbiao
    Bai, Liang
    Liang, Jianqing
    Liang, Jiye
    PATTERN RECOGNITION, 2024, 145
  • [42] Attribute Attention for Semantic Disambiguation in Zero-Shot Learning
    Liu, Yang
    Guo, Jishun
    Cai, Deng
    He, Xiaofei
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 6697 - 6706
  • [43] Generating visual representations for zero-shot learning via adversarial learning and variational autoencoders
    Gull, Muqaddas
    Arif, Omar
    INTERNATIONAL JOURNAL OF GENERAL SYSTEMS, 2023, 52 (05) : 636 - 651
  • [44] Model-Agnostic Zero-Shot Intent Detection via Contrastive Transfer Learning
    Maqbool, M. H.
    Fereidouni, Moghis
    Siddique, A. B.
    Foroosh, Hassan
    INTERNATIONAL JOURNAL OF SEMANTIC COMPUTING, 2024, 18 (01) : 5 - 24
  • [45] Learning Invariant Visual Representations for Compositional Zero-Shot Learning
    Zhang, Tian
    Liang, Kongming
    Du, Ruoyi
    Sun, Xian
    Ma, Zhanyu
    Guo, Jun
    COMPUTER VISION, ECCV 2022, PT XXIV, 2022, 13684 : 339 - 355
  • [46] Unleashing the Power of Contrastive Learning for Zero-Shot Video Summarization
    Pang, Zongshang
    Nakashima, Yuta
    Otani, Mayu
    Nagahara, Hajime
    JOURNAL OF IMAGING, 2024, 10 (09)
  • [47] Enhancing Zero-Shot Stance Detection with Contrastive and Prompt Learning
    Yao, Zhenyin
    Yang, Wenzhong
    Wei, Fuyuan
    ENTROPY, 2024, 26 (04)
  • [48] Dual Prototype Contrastive Network for Generalized Zero-Shot Learning
    Jiang, Huajie
    Li, Zhengxian
    Hu, Yongli
    Yin, Baocai
    Yang, Jian
    van den Hengel, Anton
    Yang, Ming-Hsuan
    Qi, Yuankai
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2025, 35 (02) : 1111 - 1122
  • [49] Contrastive visual feature filtering for generalized zero-shot learning
    Meng, Shixuan
    Jiang, Rongxin
    Tian, Xiang
    Zhou, Fan
    Chen, Yaowu
    Liu, Junjie
    Shen, Chen
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024,
  • [50] Siamese Contrastive Embedding Network for Compositional Zero-Shot Learning
    Li, Xiangyu
    Yang, Xu
    Wei, Kun
    Deng, Cheng
    Yang, Muli
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 9316 - 9325