MSO-DETR: Metric space optimization for few-shot object detection

被引:0
|
作者
Sima, Haifeng [1 ,2 ]
Wang, Manyang [1 ]
Liu, Lanlan [3 ]
Zhang, Yudong [1 ,4 ]
Sun, Junding [1 ]
机构
[1] Henan Polytech Univ, Sch Comp Sci & Technol, Jiaozuo, Peoples R China
[2] Henan Polytech Univ, Inst Quantitat Remote Sensing & Smart Agr, Jiaozuo, Peoples R China
[3] Henan Polytech Univ, Fac Arts & Law, Jiaozuo, Peoples R China
[4] Univ Leicester, Sch Comp & Math Sci, Leicester, England
基金
中国国家自然科学基金;
关键词
computer vision; deep learning; machine learning;
D O I
10.1049/cit2.12342
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the metric-based meta-learning detection model, the distribution of training samples in the metric space has great influence on the detection performance, and this influence is usually ignored by traditional meta-detectors. In addition, the design of metric space might be interfered with by the background noise of training samples. To tackle these issues, we propose a metric space optimisation method based on hyperbolic geometry attention and class-agnostic activation maps. First, the geometric properties of hyperbolic spaces to establish a structured metric space are used. A variety of feature samples of different classes are embedded into the hyperbolic space with extremely low distortion. This metric space is more suitable for representing tree-like structures between categories for image scene analysis. Meanwhile, a novel similarity measure function based on Poincar & eacute; distance is proposed to evaluate the distance of various types of objects in the feature space. In addition, the class-agnostic activation maps (CCAMs) are employed to re-calibrate the weight of foreground feature information and suppress background information. Finally, the decoder processes the high-level feature information as the decoding of the query object and detects objects by predicting their locations and corresponding task encodings. Experimental evaluation is conducted on Pascal VOC and MS COCO datasets. The experiment results show that the effectiveness of the authors' method surpasses the performance baseline of the excellent few-shot detection models.
引用
收藏
页码:1515 / 1533
页数:19
相关论文
共 50 条
  • [41] Temporal Speciation Network for Few-Shot Object Detection
    Zhao, Xiaowei
    Liu, Xianglong
    Ma, Yuqing
    Bai, Shihao
    Shen, Yifan
    Hao, Zeyu
    Liu, Aishan
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 8267 - 8278
  • [42] Orthogonal Progressive Network for Few-shot Object Detection
    Wang, Bingxin
    Yu, Dehong
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 264
  • [43] Generalized Few-Shot Object Detection without Forgetting
    Fan, Zhibo
    Ma, Yuchen
    Li, Zeming
    Sun, Jian
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 4525 - 4534
  • [44] Open-World Few-Shot Object Detection
    Chen, Wei
    Zhang, Shengchuan
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, ICIC 2023, PT I, 2023, 14086 : 556 - 567
  • [45] Few-Shot Object Detection on Remote Sensing Images
    Li, Xiang
    Deng, Jingyu
    Fang, Yi
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [46] Multiple knowledge embedding for few-shot object detection
    Gong, Xiaolin
    Cai, Youpeng
    Wang, Jian
    SIGNAL IMAGE AND VIDEO PROCESSING, 2023, 17 (05) : 2231 - 2240
  • [47] Explicit Margin Equilibrium for Few-Shot Object Detection
    Liu, Chang
    Li, Bohao
    Shi, Mengnan
    Chen, Xiaozhong
    Ye, Qixiang
    Ji, Xiangyang
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,
  • [48] Restoring Negative Information in Few-Shot Object Detection
    Yang, Yukuan
    Wei, Fangyun
    Shi, Miaojing
    Li, Guoqi
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [49] Few-Shot Object Detection via Knowledge Transfer
    Kim, Geonuk
    Jung, Hong-Gyu
    Lee, Seong-Whan
    2020 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2020, : 3564 - 3569
  • [50] Few-shot object detection via baby learning
    Vu, Anh-Khoa Nguyen
    Nguyen, Nhat-Duy
    Nguyen, Khanh-Duy
    Nguyen, Vinh-Tiep
    Ngo, Thanh Duc
    Do, Thanh-Toan
    Nguyen, Tam V.
    IMAGE AND VISION COMPUTING, 2022, 120