SAUI: Scale-Aware Unseen Imagineer for Zero-Shot Object Detection

被引:0
|
作者
Wang, Jiahao [1 ]
Yan, Caixia [1 ]
Zhang, Weizhan [1 ]
Liu, Huan [1 ]
Sun, Hao [2 ]
Zheng, Qinghua [1 ]
机构
[1] Xi An Jiao Tong Univ, MOEKLINNS Lab, Sch Comp Sci & Technol, Xian, Peoples R China
[2] China Telecom Artificial Intelligence Technol Co, Hong Kong, Peoples R China
基金
国家重点研发计划; 中国博士后科学基金; 中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Zero-shot object detection (ZSD) aims to localize and classify unseen objects without access to their training annotations. As a prevailing solution to ZSD, generation-based methods synthesize unseen visual features by taking seen features as reference and class semantic embeddings as guideline. Although previous works continuously improve the synthesis quality, they fail to consider the scale-varying nature of unseen objects. The generation process is preformed over a single scale of object features and thus lacks scale-diversity among synthesized features. In this paper, we reveal the scale-varying challenge in ZSD and propose a Scale-Aware Unseen Imagineer (SAUI) to lead the way of a novel scale-aware ZSD paradigm. To obtain multi-scale features of seen-class objects, we design a specialized coarse-to-fine extractor to capture features through multiple scale-views. To generate unseen features scale by scale, we innovate a Series-GAN synthesizer along with three scale-aware contrastive components to imagine separable, diverse and robust scale-wise unseen features. Extensive experiments on PASCAL VOC, COCO and DIOR datasets demonstrate SAUI's better performance in different scenarios, especially for scale-varying and small objects. Notably, SAUI achieves the new state-of-the-art performance on COCO and DIOR.
引用
收藏
页码:5445 / 5453
页数:9
相关论文
共 50 条
  • [21] Robust Region Feature Synthesizer for Zero-Shot Object Detection
    Huang, Peiliang
    Han, Junwei
    Cheng, De
    Zhang, Dingwen
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 7612 - 7621
  • [22] Zero-shot object detection with contrastive semantic association network
    Haohe Li
    Chong Wang
    Weijie Liu
    Yilin Gong
    Xinmiao Dai
    Applied Intelligence, 2023, 53 : 30056 - 30068
  • [23] A Multi-Space Approach to Zero-Shot Object Detection
    Gupta, Dikshant
    Anantharaman, Aditya
    Mamgain, Nehal
    Kamath, Sowmya S.
    Balasubramanian, Vineeth N.
    Jawahar, C., V
    2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 1198 - 1206
  • [24] A dynamic semantic knowledge graph for zero-shot object detection
    Lv, Wen
    Shi, Hongbo
    Tan, Shuai
    Song, Bing
    Tao, Yang
    VISUAL COMPUTER, 2023, 39 (10): : 4513 - 4527
  • [25] Zero-shot object detection with contrastive semantic association network
    Li, Haohe
    Wang, Chong
    Liu, Weijie
    Gong, Yilin
    Dai, Xinmiao
    APPLIED INTELLIGENCE, 2023, 53 (24) : 30056 - 30068
  • [26] Zero-Shot Aerial Object Detection with Visual Description Regularization
    Zang, Zhengqing
    Lin, Chenyu
    Tang, Chenwei
    Wang, Tao
    Lv, Jiancheng
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 7, 2024, : 6926 - 6934
  • [27] A dynamic semantic knowledge graph for zero-shot object detection
    Wen Lv
    Hongbo Shi
    Shuai Tan
    Bing Song
    Yang Tao
    The Visual Computer, 2023, 39 : 4513 - 4527
  • [28] Learning Latent Semantic Attributes for Zero-Shot Object Detection
    Wang, Kang
    Zhang, Lu
    Tan, Yifan
    Zhao, Jiajia
    Zhou, Shuigeng
    2020 IEEE 32ND INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI), 2020, : 230 - 237
  • [29] Adaptive adjustment with semantic embedding for zero-shot object detection
    Lv, Wen
    Shi, Hongbo
    Tan, Shuai
    Song, Bing
    Tao, Yang
    JOURNAL OF ELECTRONIC IMAGING, 2023, 32 (03)
  • [30] GTNet: Generative Transfer Network for Zero-Shot Object Detection
    Zhao, Shizhen
    Gao, Changxin
    Shao, Yuanjie
    Li, Lerenhan
    Yu, Changqian
    Ji, Zhong
    Sang, Nang
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 12967 - 12974