MSFFAL: Few-Shot Object Detection via Multi-Scale Feature Fusion and Attentive Learning

被引:2
|
作者
Zhang, Tianzhao [1 ,2 ]
Sun, Ruoxi [1 ,3 ]
Wan, Yong [4 ]
Zhang, Fuping [1 ]
Wei, Jianming [1 ]
机构
[1] Chinese Acad Sci, Shanghai Adv Res Inst, Shanghai 201210, Peoples R China
[2] Univ Chinese Acad Sci, Sch Elect Elect & Commun Engn, Beijing 100049, Peoples R China
[3] ShanghaiTech Univ, Sch Informat Sci & Technol, Shanghai 201210, Peoples R China
[4] Chinese Acad Sci, Inst Rock & Soil Mech, State Key Lab Geomech & Geotech Engn, Wuhan 430071, Peoples R China
基金
中国国家自然科学基金;
关键词
few-shot object detection; few-shot learning; attention mechanism; multi-scale feature fusion;
D O I
10.3390/s23073609
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Few-shot object detection (FSOD) is proposed to solve the application problem of traditional detectors in scenarios lacking training samples. The meta-learning methods have attracted the researchers' attention for their excellent generalization performance. They usually select the same class of support features according to the query labels to weight the query features. However, the model cannot possess the ability of active identification only by using the same category support features, and feature selection causes difficulties in the testing process without labels. The single-scale feature of the model also leads to poor performance in small object detection. In addition, the hard samples in the support branch impact the backbone's representation of the support features, thus impacting the feature weighting process. To overcome these problems, we propose a multi-scale feature fusion and attentive learning (MSFFAL) framework for few-shot object detection. We first design the backbone with multi-scale feature fusion and channel attention mechanism to improve the model's detection accuracy on small objects and the representation of hard support samples. Based on this, we propose an attention loss to replace the feature weighting module. The loss allows the model to consistently represent the objects of the same category in the two branches and realizes the active recognition of the model. The model no longer depends on query labels to select features when testing, optimizing the model testing process. The experiments show that MSFFAL outperforms the state-of-the-art (SOTA) by 0.7-7.8% on the Pascal VOC and exhibits 1.61 times the result of the baseline model in MS COCO's small objects detection.
引用
收藏
页数:18
相关论文
共 50 条
  • [31] Adaptive multi-scale transductive information propagation for few-shot learning
    Fu, Sichao
    Liu, Baodi
    Liu, Weifeng
    Zou, Bin
    You, Xinhua
    Peng, Qinmu
    Jing, Xiao-Yuan
    KNOWLEDGE-BASED SYSTEMS, 2022, 249
  • [32] Multi-Scale Adaptive Task Attention Network for Few-Shot Learning
    Chen, Haoxing
    Li, Huaxiong
    Li, Yaohui
    Chen, Chunlin
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 4765 - 4771
  • [33] Matching Multi-Scale Feature Sets in Vision Transformer for Few-Shot Classification
    Song, Mingchen
    Yao, Fengqin
    Zhong, Guoqiang
    Ji, Zhong
    Zhang, Xiaowei
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (12) : 12638 - 12651
  • [34] Few-shot wildlife detection based on multi-scale context extraction
    Liu, Ke
    Lin, Shanling
    Shi, Xinyu
    Lin, Jianpu
    Lu, Shanhong
    Lin, Zhixian
    Guo, Tailiang
    CHINESE JOURNAL OF LIQUID CRYSTALS AND DISPLAYS, 2025, 40 (03) : 516 - 526
  • [35] Few-Shot Object Detection via Transfer Learning and Contrastive Reweighting
    Wu, Zhen
    Li, Haowei
    Zhang, Dongyu
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT VII, 2023, 14260 : 78 - 87
  • [36] Multi-Object Detection and Tracking Based on Few-Shot Learning
    Luo, Da-Peng
    Du, Guo-Qing
    Zeng, Zhi-Peng
    Wei, Long-Sheng
    Gao, Chang-Xin
    Cheng, Ying
    Xiao, Fei
    Luo, Chen
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2021, 49 (01): : 183 - 191
  • [37] Few-Shot Object Detection via Back Propagation and Dynamic Learning
    You, Dianlong
    Wang, Peng
    Zhang, Yi
    Wang, Ling
    Jin, Shunfu
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 2903 - 2908
  • [38] Adaptive Multi-task Learning for Few-Shot Object Detection
    Ren, Yan
    Li, Yanling
    Kong, Adams Wai-Kin
    COMPUTER VISION-ECCV 2024, PT VII, 2025, 15065 : 297 - 314
  • [39] Multi-scale Self-attention-based Few-shot Object Detection for Remote Sensing Images
    Wang, Run
    Wang, Qiong
    Yu, Jiawei
    Tong, Jiaxing
    2022 IEEE 24TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2022,
  • [40] Few-Shot Batch Incremental Road Object Detection via Detector Fusion
    Tambwekar, Anuj
    Agrawal, Kshitij
    Majee, Anay
    Subramanian, Anbumani
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, : 3063 - 3070