Zero-Shot Object Recognition by Semantic Manifold Distance

被引:0
|
作者
Fu, Zhenyong [1 ]
Xiang, Tao [1 ]
Kodirov, Elyor [1 ]
Gong, Shaogang [1 ]
机构
[1] Queen Mary Univ London, London E1 4NS, England
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Object recognition by zero-shot learning (ZSL) aims to recognise objects without seeing any visual examples by learning knowledge transfer between seen and unseen object classes. This is typically achieved by exploring a semantic embedding space such as attribute space or semantic word vector space. In such a space, both seen and unseen class labels, as well as image features can be embedded (projected), and the similarity between them can thus be measured directly. Existing works differ in what embedding space is used and how to project the visual data into the semantic embedding space. Yet, they all measure the similarity in the space using a conventional distance metric (e.g. cosine) that does not consider the rich intrinsic structure, i.e. semantic manifold, of the semantic categories in the embedding space. In this paper we propose to model the semantic manifold in an embedding space using a semantic class label graph. The semantic manifold structure is used to redefine the distance metric in the semantic embedding,space for more effective ZSL. The proposed semantic manifold distance is computed using a novel absorbing Markov chain process (AMP), which has a very efficient closed-form solution. The proposed new model improves upon and seamlessly unifies various existing ZSL, algorithms. Extensive experiments on both the large scale ImageNet dataset and the widely used Animal with Attribute (AwA) dataset show that our model outperforms significantly the state-of-the-arts.
引用
收藏
页码:2635 / 2644
页数:10
相关论文
共 50 条
  • [31] Zero-Shot Object Recognition System Based on Topic Model
    Hoo, Wai Lam
    Chan, Chee Seng
    IEEE TRANSACTIONS ON HUMAN-MACHINE SYSTEMS, 2015, 45 (04) : 518 - 525
  • [32] Hierarchical-Dynamic Embedding for Zero-shot Object Recognition
    Han, Xuebo
    Li, Kan
    PROCEEDINGS 2017 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE (CSCI), 2017, : 520 - 525
  • [33] Improved Visual-Semantic Alignment for Zero-Shot Object Detection
    Rahman, Shafin
    Khan, Salman
    Barnes, Nick
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 11932 - 11939
  • [34] Semantic Policy Network for Zero-Shot Object Goal Visual Navigation
    Zhao, Qianfan
    Zhang, Lu
    He, Bin
    Liu, Zhiyong
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (11) : 7655 - 7662
  • [35] ChatNav: Leveraging LLM to Zero-Shot Semantic Reasoning in Object Navigation
    Zhu, Yong
    Wen, Zhenyu
    Li, Xiong
    Shi, Xiufang
    Wu, Xiang
    Dong, Hui
    Chen, Jiming
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2025, 35 (03) : 2369 - 2381
  • [36] Decoupling Zero-Shot Semantic Segmentation
    Ding, Jian
    Xue, Nan
    Xia, Gui-Song
    Dai, Dengxin
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 11573 - 11582
  • [37] JSSE: Joint Sequential Semantic Encoder for Zero-Shot Event Recognition
    Madapana N.
    Wachs J.P.
    IEEE Transactions on Artificial Intelligence, 2023, 4 (06): : 1472 - 1483
  • [38] Exemplar-Based, Semantic Guided Zero-Shot Visual Recognition
    Zhang, Chunjie
    Liang, Chao
    Zhao, Yao
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 3056 - 3065
  • [39] Indirect visual-semantic alignment for generalized zero-shot recognition
    Chen, Yan-He
    Yeh, Mei-Chen
    MULTIMEDIA SYSTEMS, 2024, 30 (02)
  • [40] Learning adversarial semantic embeddings for zero-shot recognition in open worlds
    Li, Tianqi
    Pang, Guansong
    Bai, Xiao
    Zheng, Jin
    Zhou, Lei
    Ning, Xin
    PATTERN RECOGNITION, 2024, 149