Zero-shot object detection with contrastive semantic association network

被引：1

作者：

Li, Haohe ^{[1
]}

Wang, Chong ^{[1
]}

Liu, Weijie ^{[1
,2
]}

Gong, Yilin ^{[1
]}

Dai, Xinmiao ^{[1
]}

机构：

[1] Ningbo Univ, Fac Elect Engn & Comp Sci, Ningbo 315000, Zhejiang, Peoples R China

[2] Shenzhen Anker Innovat, Informat & Control Engn, Shenzhen 518055, Guangdong, Peoples R China

来源：

APPLIED INTELLIGENCE | 2023年 / 53卷 / 24期

关键词：

Semantic association; Graph propagation; Contrastive learning; Zero-shot object detection;

D O I：

10.1007/s10489-023-05117-y

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Zero-shot object detection (ZSD) is dedicated to the task of precisely localizing and identifying unfamiliar objects that have not been encountered before. In this paper, a contrastive semantic association network is proposed to address the knowledge transfer challenge from seen classes to unseen ones in ZSD. It enables efficient information propagation through similarity-based connections, thereby establishing a clearer link between seen and unseen categories. Moreover, a visual-semantic contrastive learning technique is developed to mitigate the node convergence issue caused by the graph structure of the proposed network. By emphasizing the visual and semantic distinctiveness across different categories, the proposed model leverages semantic information and graph structure knowledge to enhance the generalization capability of seen and unseen feature projection. Extensive experiments demonstrate the superior performance of our model compared to other zero-shot object detection methods, showcasing notable improvement in mean average precision (mAP) on the MS-COCO dataset. The code and models are publicly available at: https://github.com/lihh1023/CSA-ZSD/tree/master.

引用

页码：30056 / 30068

页数：13

共 50 条

[41] ChatNav: Leveraging LLM to Zero-Shot Semantic Reasoning in Object Navigation
Zhu, Yong
Wen, Zhenyu
Li, Xiong
Shi, Xiufang
Wu, Xiang
Dong, Hui
Chen, Jiming
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2025, 35 (03) : 2369 - 2381
[42] Feature Enhanced Zero-Shot Stance Detection via Contrastive Learning
Zhao, Xuechen
Zou, Jiaying
Zhang, Zhong
Xie, Feng
Zhou, Bin
Tian, Lei
PROCEEDINGS OF THE 2023 SIAM INTERNATIONAL CONFERENCE ON DATA MINING, SDM, 2023, : 900 - 908
[43] JointCL: A Joint Contrastive Learning Framework for Zero-Shot Stance Detection
Liang, Bin
Zhu, Qinglin
Li, Xiang
Yang, Min
Gui, Lin
He, Yulan
Xu, Ruifeng
PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 81 - 91
[44] Transformer-Based Zero-Shot Detection via Contrastive Learning
Liu, Wei
Chen, Hui
Ma, Yongqiang
Wang, Jianji
Zheng, Nanning
ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, AIAI 2022, PART I, 2022, 646 : 316 - 327
[45] Feature Enhanced Projection Network for Zero-shot Semantic Segmentation
Lu, Hongchao
Fang, Longwei
Lin, Matthieu
Deng, Zhidong
2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 14011 - 14017
[46] MSDN: Mutually Semantic Distillation Network for Zero-Shot Learning
Chen, Shiming
Hong, Ziming
Xie, Guo-Sen
Yang, Wenhan
Peng, Qinmu
Wang, Kai
Zhao, Jian
You, Xinge
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 7602 - 7611
[47] Robust Zero-Shot Intent Detection via Contrastive Transfer Learning
Maqbool, M. H.
Khan, F. A.
Siddique, A. B.
Foroosh, Hassan
2023 IEEE 17TH INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING, ICSC, 2023, : 49 - 56
[48] Contrastive Embedding for Generalized Zero-Shot Learning
Han, Zongyan
Fu, Zhenyong
Chen, Shuo
Yang, Jian
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 2371 - 2381
[49] Generation-based contrastive model with semantic alignment for generalized zero-shot learning
Yang, Jingqi
Shen, Qi
Xie, Cheng
IMAGE AND VISION COMPUTING, 2023, 137
[50] Decoupling Zero-Shot Semantic Segmentation
Ding, Jian
Xue, Nan
Xia, Gui-Song
Dai, Dengxin
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 11573 - 11582

← 1 2 3 4 5 →