SAUI: Scale-Aware Unseen Imagineer for Zero-Shot Object Detection

被引:0
|
作者
Wang, Jiahao [1 ]
Yan, Caixia [1 ]
Zhang, Weizhan [1 ]
Liu, Huan [1 ]
Sun, Hao [2 ]
Zheng, Qinghua [1 ]
机构
[1] Xi An Jiao Tong Univ, MOEKLINNS Lab, Sch Comp Sci & Technol, Xian, Peoples R China
[2] China Telecom Artificial Intelligence Technol Co, Hong Kong, Peoples R China
基金
国家重点研发计划; 中国博士后科学基金; 中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Zero-shot object detection (ZSD) aims to localize and classify unseen objects without access to their training annotations. As a prevailing solution to ZSD, generation-based methods synthesize unseen visual features by taking seen features as reference and class semantic embeddings as guideline. Although previous works continuously improve the synthesis quality, they fail to consider the scale-varying nature of unseen objects. The generation process is preformed over a single scale of object features and thus lacks scale-diversity among synthesized features. In this paper, we reveal the scale-varying challenge in ZSD and propose a Scale-Aware Unseen Imagineer (SAUI) to lead the way of a novel scale-aware ZSD paradigm. To obtain multi-scale features of seen-class objects, we design a specialized coarse-to-fine extractor to capture features through multiple scale-views. To generate unseen features scale by scale, we innovate a Series-GAN synthesizer along with three scale-aware contrastive components to imagine separable, diverse and robust scale-wise unseen features. Extensive experiments on PASCAL VOC, COCO and DIOR datasets demonstrate SAUI's better performance in different scenarios, especially for scale-varying and small objects. Notably, SAUI achieves the new state-of-the-art performance on COCO and DIOR.
引用
收藏
页码:5445 / 5453
页数:9
相关论文
共 50 条
  • [31] Visual Language Based Succinct Zero-Shot Object Detection
    Zheng, Ye
    Huang, Xi
    Cui, Li
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 5410 - 5418
  • [32] Learning unseen visual prototypes for zero-shot classification
    Li, Xiao
    Fang, Min
    Feng, Dazheng
    Li, Haikun
    Wu, Jinqiao
    KNOWLEDGE-BASED SYSTEMS, 2018, 160 : 176 - 187
  • [33] Zero-Shot Embedding for Unseen Entities in Knowledge Graph
    Zhao, Yu
    Gao, Sheng
    Gallinari, Patrick
    Guo, Jun
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2017, E100D (07): : 1440 - 1447
  • [34] Zero-shot Object Detection Based on Dynamic Semantic Vectors
    Li, Haoyu
    Mei, Jilin
    Zhou, Jiancong
    Hu, Yu
    2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2023), 2023, : 9267 - 9273
  • [35] Zero-shot object rumor detection based on contrastive learning
    Chen, Ke
    Zhang, Wenhao
    Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2024, 58 (09): : 1790 - 1800
  • [36] Scale-Aware Automatic Augmentations for Object Detection With Dynamic Training
    Chen, Yukang
    Zhang, Peizhen
    Kong, Tao
    Li, Yanwei
    Zhang, Xiangyu
    Qi, Lu
    Sun, Jian
    Jia, Jiaya
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (02) : 2367 - 2383
  • [37] AdaZoom: Towards Scale-Aware Large Scene Object Detection
    Xu, Jingtao
    Li, Ya-Li
    Wang, Shengjin
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 4598 - 4609
  • [38] Scale-Aware Squeeze-and-Excitation for Lightweight Object Detection
    Xu, Zhihua
    Hong, Xiaobin
    Chen, Tianshui
    Yang, Zhijing
    Shi, Yukai
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (01) : 49 - 56
  • [39] Scale-aware feature pyramid architecture for marine object detection
    Xu, Fengqiang
    Wang, Huibing
    Peng, Jinjia
    Fu, Xianping
    NEURAL COMPUTING & APPLICATIONS, 2021, 33 (08): : 3637 - 3653
  • [40] Spatial-Aware Object Embeddings for Zero-Shot Localization and Classification of Actions
    Mettes, Pascal
    Snoek, Cees G. M.
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 4453 - 4462