Transductive zero-shot learning with generative model-driven structure alignment

被引:3
|
作者
Liu, Yang [1 ,2 ]
Tao, Keda [1 ]
Tian, Tianhui [1 ]
Gao, Xinbo [3 ]
Han, Jungong [4 ]
Shao, Ling [5 ]
机构
[1] Xidian Univ, Sch Telecommun Engn, Xian 710071, Shaanxi, Peoples R China
[2] Shanghai Key Lab Comp Software Evaluating & Testin, Shanghai, Peoples R China
[3] Chongqing Univ Posts & Telecommun, Chongqing Key Lab Image Cognit, Chongqing 400065, Peoples R China
[4] Univ Sheffield, Dept Comp Sci, Sheffield S10 2TN, Yorks, England
[5] Univ Chinese Acad Sci, UCAS Terminus AI Lab, Beijing 100190, Peoples R China
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
Domain shift; Transductive zero-shot learning; Structure alignment;
D O I
10.1016/j.patcog.2024.110561
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Zero -shot learning (ZSL) facilitates the transfer of knowledge from seen to unseen categories through highdimensional vectors that capture both known and unknown class names. However it encounters challenges with domain shift arising from a lack of sufficient labeled data. Although transductive zero -shot learning (TZSL) addresses this bias by including samples from unseen classes, it still faces obstacles in enhancing TZSL performance. In this study, We introduce the Structure Alignment Variational Autoencoder Generative Adversarial Network (SA-VAEGAN), a novel approach that enhances the alignment between visual and auxiliary spaces. We delved into the underlying causes of domain shift and introduced a structural alignment (SA) strategy to tackle these challenges. The SA model thoroughly accounts for both inter -class and intra-class dynamics, designed to leverage the model's comprehension of high-level semantic relations to disambiguate confusion among similar classes and mitigate intra-class confusion by penalizing atypical visual samples within classes. Assessed across four benchmark datasets, SA-VAEGAN has established a new performance standard, underscoring its efficiency in addressing the domain shift challenge within TZSL tasks, and achieving high accuracy.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Bidirectional generative transductive zero-shot learning
    Xinpeng Li
    Dan Zhang
    Mao Ye
    Xue Li
    Qiang Dou
    Qiao Lv
    Neural Computing and Applications, 2021, 33 : 5313 - 5326
  • [2] Bidirectional generative transductive zero-shot learning
    Li, Xinpeng
    Zhang, Dan
    Ye, Mao
    Li, Xue
    Dou, Qiang
    Lv, Qiao
    NEURAL COMPUTING & APPLICATIONS, 2021, 33 (10): : 5313 - 5326
  • [3] Transductive Zero-Shot Learning with Visual Structure Constraint
    Wan, Ziyu
    Chen, Dongdong
    Li, Yan
    Yan, Xingguang
    Zhang, Junge
    Yu, Yizhou
    Liao, Jing
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [4] VMAN: A Virtual Mainstay Alignment Network for Transductive Zero-Shot Learning
    Xie, Guo-Sen
    Zhang, Xu-Yao
    Yao, Yazhou
    Zhang, Zheng
    Zhao, Fang
    Shao, Ling
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 4316 - 4329
  • [5] Bi-directional Distribution Alignment for Transductive Zero-Shot Learning
    Wang, Zhicai
    Hao, Yanbin
    Mu, Tingting
    Li, Ouxiang
    Wang, Shuo
    He, Xiangnan
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 19893 - 19902
  • [6] Visual Structure Constraint for Transductive Zero-Shot Learning in the Wild
    Ziyu Wan
    Dongdong Chen
    Jing Liao
    International Journal of Computer Vision, 2021, 129 : 1893 - 1909
  • [7] Visual Structure Constraint for Transductive Zero-Shot Learning in the Wild
    Wan, Ziyu
    Chen, Dongdong
    Liao, Jing
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2021, 129 (06) : 1893 - 1909
  • [8] A Joint Generative Model for Zero-Shot Learning
    Gao, Rui
    Hou, Xingsong
    Qin, Jie
    Liu, Li
    Zhu, Fan
    Zhang, Zhao
    COMPUTER VISION - ECCV 2018 WORKSHOPS, PT IV, 2019, 11132 : 631 - 646
  • [9] Adversarial strategy for transductive zero-shot learning
    Liu, Youfa
    Du, Bo
    Ni, Fuchuan
    INFORMATION SCIENCES, 2021, 578 : 750 - 761
  • [10] Transductive Learning for Zero-Shot Object Detection
    Rahman, Shafin
    Khan, Salman
    Barnes, Nick
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 6081 - 6090