Distilling object detectors with mask-guided feature and relation-based knowledge

被引:0
|
作者
Zeng, Liang [1 ]
Ma, Liyan [1 ]
Luo, Xiangfeng [1 ]
Guo, Yinsai [1 ]
Chen, Xue [1 ,2 ]
机构
[1] Shanghai Univ, Sch Comp Engn & Sci, Shanghai 200444, Peoples R China
[2] State Key Lab Math Engn & Adv Comp, Wuxi 214083, Peoples R China
基金
中国国家自然科学基金;
关键词
knowledge distillation; multi-value mask; object detection;
D O I
10.1504/IJCSE.2024.137291
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Knowledge distillation (KD) is an effective technique for network compression and model accuracy enhancement in image classification, semantic segmentation, pre-trained language model, and so on. However, existing KD methods are specialised for image classification and cannot be used effectively for object detection tasks, with the following two limitations: the imbalance of foreground and background instances and the neglect distillation of relation-based knowledge. In this paper, we present a general mask-guided feature and relation-based knowledge distillation framework (MAR) consisting of two components, mask-guided distillation, and relation-based distillation, to address the above problems. The mask-guided distillation is designed to emphasise students' learning of close-to-object features via multi-value masks, while relation-based distillation is proposed to mimic the relational information between different feature pixels on the classification head. Extensive experiments show that our methods achieve excellent AP improvements on both one-stage and two-stage detectors. Specifically, faster R-CNN with ResNet50 backbone achieves 40.6% in mAP under 1 x schedule on the COCO dataset, which is 3.2% higher than the baseline and even surpasses the teacher detector.
引用
收藏
页码:195 / 203
页数:10
相关论文
共 50 条
  • [21] Relation-Based Knowledge Distillation for Anomaly Detection
    Cheng, Hekai
    Yang, Lu
    Liu, Zulong
    PATTERN RECOGNITION AND COMPUTER VISION, PT I, 2021, 13019 : 105 - 116
  • [22] Mask-Guided Target Node Feature Learning and Dynamic Detailed Feature Enhancement for lncRNA-Disease Association Prediction
    Xuan, Ping
    Wang, Wei
    Cui, Hui
    Wang, Shuai
    Nakaguchi, Toshiya
    Zhang, Tiangang
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2024, 64 (16) : 6662 - 6675
  • [23] Development of an object relation-based typology of adolescent sex offenders
    Gamache, Dominick
    Diguer, Louis
    Laverdiere, Olivier
    Rousseau, Jean-Pierre
    BULLETIN OF THE MENNINGER CLINIC, 2012, 76 (04) : 329 - 364
  • [24] MGQFormer: Mask-Guided Query-Based Transformer for Image Manipulation Localization
    Zeng, Kunlun
    Cheng, Ri
    Tan, Weimin
    Yan, Bo
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 7, 2024, : 6944 - 6952
  • [25] Mask-guided generative adversarial network for MRI-based CT synthesis
    Luo, Yu
    Zhang, Shaowei
    Ling, Jie
    Lin, Zhiyi
    Wang, Zongming
    Yao, Shun
    KNOWLEDGE-BASED SYSTEMS, 2024, 295
  • [26] Mask-guided discriminative feature network for occluded person re-identification (vol 101, 104178, 2024)
    Zhong, Fujin
    Wang, Yunhe
    Yu, Hong
    Hu, Jun
    Yang, Yan
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 104
  • [27] SMG-Diff: Adversarial Attack Method Based on Semantic Mask-Guided Diffusion
    Zhang, Yongliang
    Liu, Jing
    MULTIMEDIA MODELING, MMM 2025, PT IV, 2025, 15523 : 44 - 57
  • [28] Knowledge distilling based model compression and feature learning in fault diagnosis
    Zhang, Wenfeng
    Biswas, Gautam
    Zhao, Qi
    Zhao, Hongbo
    Feng, Wenquan
    APPLIED SOFT COMPUTING, 2020, 88
  • [29] Cosine similarity-guided knowledge distillation for robust object detectors
    Park, Sangwoo
    Kang, Donggoo
    Paik, Joonki
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [30] Mask-Guided Mamba Fusion for Drone-Based Visible-Infrared Vehicle Detection
    Wang, Simiao
    Wang, Chunpeng
    Shi, Chaoyi
    Liu, Yunan
    Lu, Mingyu
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62