ADAPTIVE MULTI-SCALE SEMANTIC FUSION NETWORK FOR ZERO-SHOT LEARNING

被引:0
|
作者
Song, Jing [1 ]
Peng, Peixi [2 ]
Zhai, Yunpeng [1 ]
Zhang, Chong [1 ]
Tian, Yonghong [2 ]
机构
[1] Peking Univ, Shenzhen Grad Sch, Shenzhen, Peoples R China
[2] Peking Univ, Beijing, Peoples R China
关键词
Multi-scale; attribute attention; Semantic fusion; global and local semantic attributes; class-center triplet loss;
D O I
10.1109/ICMEW53276.2021.9455945
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Zero-shot learning aims at accurately recognizing unseen objects by learning matrices that bridge the gap between visual information and semantic attributes. Existing approaches predominantly focus on learning the proper mapping function for visual-semantic embedding while neglecting the effect of learning discriminative semantic features, which leads to severe semantic ambiguity. We propose a practical Adaptive Multi-scale Semantic Fusion (AMSF) framework to perform object-based multi-scale attribute attention for semantic disambiguation. Considering both low-level visual information and global class-level features that relate to this ambiguity, the proposed method jointly learns cooperative global and local semantic attributes from different scales. Moreover, with the joint supervision of embedding softmax loss and class-center triplet loss, the model is encouraged to learn high discriminative semantic features and visual features with high interclass dispersion and infra-class compactness. The method is evaluated on CUB, AwA2, and SUN datasets, and the experimental results indicate the method achieves state-of-the-art performance.
引用
收藏
页数:6
相关论文
共 50 条
  • [41] A meaningful learning method for zero-shot semantic segmentation
    Xianglong LIU
    Shihao BAI
    Shan AN
    Shuo WANG
    Wei LIU
    Xiaowei ZHAO
    Yuqing MA
    ScienceChina(InformationSciences), 2023, 66 (11) : 35 - 53
  • [42] Marginalized Latent Semantic Encoder for Zero-Shot Learning
    Ding, Zhengming
    Liu, Hongfu
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 6184 - 6192
  • [43] Boosted Zero-Shot Learning with Semantic Correlation Regularization
    Pi, Te
    Li, Xi
    Zhang, Zhongfei
    PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 2599 - 2605
  • [44] Semantic Feature Extraction for Generalized Zero-Shot Learning
    Kim, Junhan
    Shim, Kyuhong
    Shim, Byonghyo
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 1166 - 1173
  • [45] Learning exclusive discriminative semantic information for zero-shot learning
    Jian-Xun Mi
    Zhonghao Zhang
    Debao Tai
    Li-Fang Zhou
    Wei Jia
    International Journal of Machine Learning and Cybernetics, 2023, 14 : 761 - 772
  • [46] Deep Semantic Structural Constraints for Zero-Shot Learning
    Li, Yan
    Jia, Zhen
    Zhang, Junge
    Huang, Kaiqi
    Tan, Tieniu
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 7049 - 7056
  • [47] Semantic Contrastive Embedding for Generalized Zero-Shot Learning
    Han, Zongyan
    Fu, Zhenyong
    Chen, Shuo
    Yang, Jian
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2022, 130 (11) : 2606 - 2622
  • [48] A study on zero-shot learning from semantic viewpoint
    Bhagat, P. K.
    Choudhary, Prakash
    Singh, Kh Manglem
    VISUAL COMPUTER, 2023, 39 (05): : 2149 - 2163
  • [49] A meaningful learning method for zero-shot semantic segmentation
    Xianglong Liu
    Shihao Bai
    Shan An
    Shuo Wang
    Wei Liu
    Xiaowei Zhao
    Yuqing Ma
    Science China Information Sciences, 2023, 66
  • [50] Multi-modal generative adversarial network for zero-shot learning
    Ji, Zhong
    Chen, Kexin
    Wang, Junyue
    Yu, Yunlong
    Zhang, Zhongfei
    KNOWLEDGE-BASED SYSTEMS, 2020, 197