Zero-shot object detection with contrastive semantic association network

被引:1
|
作者
Li, Haohe [1 ]
Wang, Chong [1 ]
Liu, Weijie [1 ,2 ]
Gong, Yilin [1 ]
Dai, Xinmiao [1 ]
机构
[1] Ningbo Univ, Fac Elect Engn & Comp Sci, Ningbo 315000, Zhejiang, Peoples R China
[2] Shenzhen Anker Innovat, Informat & Control Engn, Shenzhen 518055, Guangdong, Peoples R China
关键词
Semantic association; Graph propagation; Contrastive learning; Zero-shot object detection;
D O I
10.1007/s10489-023-05117-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Zero-shot object detection (ZSD) is dedicated to the task of precisely localizing and identifying unfamiliar objects that have not been encountered before. In this paper, a contrastive semantic association network is proposed to address the knowledge transfer challenge from seen classes to unseen ones in ZSD. It enables efficient information propagation through similarity-based connections, thereby establishing a clearer link between seen and unseen categories. Moreover, a visual-semantic contrastive learning technique is developed to mitigate the node convergence issue caused by the graph structure of the proposed network. By emphasizing the visual and semantic distinctiveness across different categories, the proposed model leverages semantic information and graph structure knowledge to enhance the generalization capability of seen and unseen feature projection. Extensive experiments demonstrate the superior performance of our model compared to other zero-shot object detection methods, showcasing notable improvement in mean average precision (mAP) on the MS-COCO dataset. The code and models are publicly available at: https://github.com/lihh1023/CSA-ZSD/tree/master.
引用
收藏
页码:30056 / 30068
页数:13
相关论文
共 50 条
  • [41] ChatNav: Leveraging LLM to Zero-Shot Semantic Reasoning in Object Navigation
    Zhu, Yong
    Wen, Zhenyu
    Li, Xiong
    Shi, Xiufang
    Wu, Xiang
    Dong, Hui
    Chen, Jiming
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2025, 35 (03) : 2369 - 2381
  • [42] Feature Enhanced Zero-Shot Stance Detection via Contrastive Learning
    Zhao, Xuechen
    Zou, Jiaying
    Zhang, Zhong
    Xie, Feng
    Zhou, Bin
    Tian, Lei
    PROCEEDINGS OF THE 2023 SIAM INTERNATIONAL CONFERENCE ON DATA MINING, SDM, 2023, : 900 - 908
  • [43] JointCL: A Joint Contrastive Learning Framework for Zero-Shot Stance Detection
    Liang, Bin
    Zhu, Qinglin
    Li, Xiang
    Yang, Min
    Gui, Lin
    He, Yulan
    Xu, Ruifeng
    PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 81 - 91
  • [44] Transformer-Based Zero-Shot Detection via Contrastive Learning
    Liu, Wei
    Chen, Hui
    Ma, Yongqiang
    Wang, Jianji
    Zheng, Nanning
    ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, AIAI 2022, PART I, 2022, 646 : 316 - 327
  • [45] Feature Enhanced Projection Network for Zero-shot Semantic Segmentation
    Lu, Hongchao
    Fang, Longwei
    Lin, Matthieu
    Deng, Zhidong
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 14011 - 14017
  • [46] MSDN: Mutually Semantic Distillation Network for Zero-Shot Learning
    Chen, Shiming
    Hong, Ziming
    Xie, Guo-Sen
    Yang, Wenhan
    Peng, Qinmu
    Wang, Kai
    Zhao, Jian
    You, Xinge
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 7602 - 7611
  • [47] Robust Zero-Shot Intent Detection via Contrastive Transfer Learning
    Maqbool, M. H.
    Khan, F. A.
    Siddique, A. B.
    Foroosh, Hassan
    2023 IEEE 17TH INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING, ICSC, 2023, : 49 - 56
  • [48] Contrastive Embedding for Generalized Zero-Shot Learning
    Han, Zongyan
    Fu, Zhenyong
    Chen, Shuo
    Yang, Jian
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 2371 - 2381
  • [49] Generation-based contrastive model with semantic alignment for generalized zero-shot learning
    Yang, Jingqi
    Shen, Qi
    Xie, Cheng
    IMAGE AND VISION COMPUTING, 2023, 137
  • [50] Decoupling Zero-Shot Semantic Segmentation
    Ding, Jian
    Xue, Nan
    Xia, Gui-Song
    Dai, Dengxin
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 11573 - 11582