ADAPTIVE MULTI-SCALE SEMANTIC FUSION NETWORK FOR ZERO-SHOT LEARNING

被引:0
|
作者
Song, Jing [1 ]
Peng, Peixi [2 ]
Zhai, Yunpeng [1 ]
Zhang, Chong [1 ]
Tian, Yonghong [2 ]
机构
[1] Peking Univ, Shenzhen Grad Sch, Shenzhen, Peoples R China
[2] Peking Univ, Beijing, Peoples R China
关键词
Multi-scale; attribute attention; Semantic fusion; global and local semantic attributes; class-center triplet loss;
D O I
10.1109/ICMEW53276.2021.9455945
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Zero-shot learning aims at accurately recognizing unseen objects by learning matrices that bridge the gap between visual information and semantic attributes. Existing approaches predominantly focus on learning the proper mapping function for visual-semantic embedding while neglecting the effect of learning discriminative semantic features, which leads to severe semantic ambiguity. We propose a practical Adaptive Multi-scale Semantic Fusion (AMSF) framework to perform object-based multi-scale attribute attention for semantic disambiguation. Considering both low-level visual information and global class-level features that relate to this ambiguity, the proposed method jointly learns cooperative global and local semantic attributes from different scales. Moreover, with the joint supervision of embedding softmax loss and class-center triplet loss, the model is encouraged to learn high discriminative semantic features and visual features with high interclass dispersion and infra-class compactness. The method is evaluated on CUB, AwA2, and SUN datasets, and the experimental results indicate the method achieves state-of-the-art performance.
引用
收藏
页数:6
相关论文
共 50 条
  • [21] Semantic guided knowledge graph for large-scale zero-shot learning
    Wei, Jiwei
    Sun, Haotian
    Yang, Yang
    Xu, Xing
    Li, Jingjing
    Shen, Heng Tao
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2022, 88
  • [22] Multidomain Features Fusion for Zero-Shot Learning
    Liu, Zhihao
    Zeng, Zhigang
    Lian, Cheng
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2020, 4 (06): : 764 - 773
  • [23] Multi-Scale Adaptive Task Attention Network for Few-Shot Learning
    Chen, Haoxing
    Li, Huaxiong
    Li, Yaohui
    Chen, Chunlin
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 4765 - 4771
  • [24] Augmented semantic feature based generative network for generalized zero-shot learning
    Li, Zhiqun
    Chen, Qiong
    Liu, Qingfa
    NEURAL NETWORKS, 2021, 143 : 1 - 11
  • [25] Visual-semantic consistency matching network for generalized zero-shot learning
    Zhang, Zhenqi
    Cao, Wenming
    NEUROCOMPUTING, 2023, 536 : 30 - 39
  • [26] Semantic-Guided Multi-Attention Localization for Zero-Shot Learning
    Zhu, Yizhe
    Xie, Jianwen
    Tang, Zhiqiang
    Peng, Xi
    Elgammal, Ahmed
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [27] Semantic combined network for zero-shot scene parsing
    Wang, Yinduo
    Zhang, Haofeng
    Wang, Shidong
    Long, Yang
    Yang, Longzhi
    IET IMAGE PROCESSING, 2020, 14 (04) : 757 - 765
  • [28] Adaptive Metric Learning For Zero-Shot Recognition
    Jiang, Huajie
    Wang, Ruiping
    Shan, Shiguang
    Chen, Xilin
    IEEE SIGNAL PROCESSING LETTERS, 2019, 26 (09) : 1270 - 1274
  • [29] Zero-shot Semantic Segmentation Using Relation Network
    Zhang, Yindong
    Khriyenko, Oleksiy
    PROCEEDINGS OF THE 28TH CONFERENCE OF OPEN INNOVATIONS ASSOCIATION FRUCT, 2021, : 516 - 527
  • [30] Learning exclusive discriminative semantic information for zero-shot learning
    Mi, Jian-Xun
    Zhang, Zhonghao
    Tai, Debao
    Zhou, Li-Fang
    Jia, Wei
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2023, 14 (03) : 761 - 772