MSO-DETR: Metric space optimization for few-shot object detection

被引：0

作者：

Sima, Haifeng ^{[1
,2
]}

Wang, Manyang ^{[1
]}

Liu, Lanlan ^{[3
]}

Zhang, Yudong ^{[1
,4
]}

Sun, Junding ^{[1
]}

机构：

[1] Henan Polytech Univ, Sch Comp Sci & Technol, Jiaozuo, Peoples R China

[2] Henan Polytech Univ, Inst Quantitat Remote Sensing & Smart Agr, Jiaozuo, Peoples R China

[3] Henan Polytech Univ, Fac Arts & Law, Jiaozuo, Peoples R China

[4] Univ Leicester, Sch Comp & Math Sci, Leicester, England

来源：

CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY | 2024年 / 9卷 / 06期

基金：

中国国家自然科学基金;

关键词：

computer vision; deep learning; machine learning;

D O I：

10.1049/cit2.12342

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In the metric-based meta-learning detection model, the distribution of training samples in the metric space has great influence on the detection performance, and this influence is usually ignored by traditional meta-detectors. In addition, the design of metric space might be interfered with by the background noise of training samples. To tackle these issues, we propose a metric space optimisation method based on hyperbolic geometry attention and class-agnostic activation maps. First, the geometric properties of hyperbolic spaces to establish a structured metric space are used. A variety of feature samples of different classes are embedded into the hyperbolic space with extremely low distortion. This metric space is more suitable for representing tree-like structures between categories for image scene analysis. Meanwhile, a novel similarity measure function based on Poincar & eacute; distance is proposed to evaluate the distance of various types of objects in the feature space. In addition, the class-agnostic activation maps (CCAMs) are employed to re-calibrate the weight of foreground feature information and suppress background information. Finally, the decoder processes the high-level feature information as the decoding of the query object and detects objects by predicting their locations and corresponding task encodings. Experimental evaluation is conducted on Pascal VOC and MS COCO datasets. The experiment results show that the effectiveness of the authors' method surpasses the performance baseline of the excellent few-shot detection models.

引用

页码：1515 / 1533

页数：19

共 50 条

[41] Temporal Speciation Network for Few-Shot Object Detection
Zhao, Xiaowei
Liu, Xianglong
Ma, Yuqing
Bai, Shihao
Shen, Yifan
Hao, Zeyu
Liu, Aishan
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 8267 - 8278
[42] Orthogonal Progressive Network for Few-shot Object Detection
Wang, Bingxin
Yu, Dehong
EXPERT SYSTEMS WITH APPLICATIONS, 2025, 264
[43] Generalized Few-Shot Object Detection without Forgetting
Fan, Zhibo
Ma, Yuchen
Li, Zeming
Sun, Jian
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 4525 - 4534
[44] Open-World Few-Shot Object Detection
Chen, Wei
Zhang, Shengchuan
ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, ICIC 2023, PT I, 2023, 14086 : 556 - 567
[45] Few-Shot Object Detection on Remote Sensing Images
Li, Xiang
Deng, Jingyu
Fang, Yi
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[46] Multiple knowledge embedding for few-shot object detection
Gong, Xiaolin
Cai, Youpeng
Wang, Jian
SIGNAL IMAGE AND VIDEO PROCESSING, 2023, 17 (05) : 2231 - 2240
[47] Explicit Margin Equilibrium for Few-Shot Object Detection
Liu, Chang
Li, Bohao
Shi, Mengnan
Chen, Xiaozhong
Ye, Qixiang
Ji, Xiangyang
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,
[48] Restoring Negative Information in Few-Shot Object Detection
Yang, Yukuan
Wei, Fangyun
Shi, Miaojing
Li, Guoqi
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
[49] Few-Shot Object Detection via Knowledge Transfer
Kim, Geonuk
Jung, Hong-Gyu
Lee, Seong-Whan
2020 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2020, : 3564 - 3569
[50] Few-shot object detection via baby learning
Vu, Anh-Khoa Nguyen
Nguyen, Nhat-Duy
Nguyen, Khanh-Duy
Nguyen, Vinh-Tiep
Ngo, Thanh Duc
Do, Thanh-Toan
Nguyen, Tam V.
IMAGE AND VISION COMPUTING, 2022, 120

← 1 2 3 4 5 →