DANet: Multi-scale UAV Target Detection with Dynamic Feature Perception and Scale-aware Knowledge Distillation

被引:1
|
作者
Fang, Houzhang [1 ]
Liao, Zikai [1 ]
Wang, Lu [1 ]
Li, Qingshan [1 ]
Chang, Yi [2 ]
Yan, Luxin [2 ]
Wang, Xuhua [1 ]
机构
[1] Xidian Univ, Sch Comp Sci & Technol, Xian, Peoples R China
[2] Huazhong Univ Sci & Technol, Sch Artificial Intelligence & Automat, Wuhan, Peoples R China
基金
中国国家自然科学基金;
关键词
Unmanned aerial vehicle; multi-scale infrared target detection; attention mechanism; contrastive learning; knowledge distillation;
D O I
10.1145/3581783.3612146
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-scale infrared unmanned aerial vehicle ( UAV) targets (IRUTs) detection under dynamic scenarios remains a challenging task due to weak target features, varying shapes and poses, and complex background interference. Current detection methods find it difficult to address the above issues accurately and efficiently. In this paper, we design a dynamic attentive network (DANet) incorporating a scale-adaptive feature enhancement mechanism (SaFEM) and an attention-guided cross-weighting feature aggregator (ACFA). The SaFEM adaptively adjusts the network's receptive fields at hierarchical network levels leveraging separable deformable convolution (SDC), which enhances the network's multi-scale IRUT awareness. The ACFA, modulated by two crossing attention mechanisms, strengthens structural and semantic properties on neighboring levels for the accurate representation of multi-scale IRUT features from different levels. A plug-and-play anti-distractor contrastive regularization (ADCR) is also imposed on our DANet, which enforces similarity on features of targets and distractors from a new uncompressed feature projector (UFP) to increase the network's anti-distractor ability in complex backgrounds. To further increase the multi-scale UAV detection performance of DANet while maintaining its efficiency superiority, we propose a novel scale-specific knowledge distiller (SSKD) based on a divide-and-conquer strategy. For the "divide" stage, we intendedly construct three task-oriented teachers to learn tailored knowledge for small-, medium-, and largescale IRUTs. For the "conquer" stage, we propose a novel elementwise attentive distillation module (EADM), where we employ a pixel-wise attention mechanism to highlight teacher and student IRUT features, and incorporate IRUT-associated prior knowledge for the collaborative transfer of refined multi-scale IRUT features to our DANet. Extensive experiments on real infrared UAV datasets demonstrate that our DANet is able to detect multi-scale UAVs with a satisfactory balance between accuracy and efficiency.
引用
收藏
页码:2121 / 2130
页数:10
相关论文
共 50 条
  • [31] Multi-Scale and Occlusion Aware Network for Vehicle Detection and Segmentation on UAV Aerial Images
    Zhang, Wang
    Liu, Chunsheng
    Chang, Faliang
    Song, Ye
    REMOTE SENSING, 2020, 12 (11)
  • [32] Multi-scale Dynamic Human Fatigue Detection with Feature Level Fusion
    Fan, Xiao
    Sun, Yanfeng
    Yin, Baocai
    2008 8TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE & GESTURE RECOGNITION (FG 2008), VOLS 1 AND 2, 2008, : 134 - 139
  • [33] Multi-Scale Feature Attention-DEtection TRansformer: Multi-Scale Feature Attention for security check object detection
    Sima, Haifeng
    Chen, Bailiang
    Tang, Chaosheng
    Zhang, Yudong
    Sun, Junding
    IET COMPUTER VISION, 2024, 18 (05) : 613 - 625
  • [34] An Effective and Lightweight Full-Scale Target Detection Network for UAV Images Based on Deformable Convolutions and Multi-Scale Contextual Feature Optimization
    Yu, Wanwan
    Zhang, Junping
    Liu, Dongyang
    Xi, Yunqiao
    Wu, Yinhu
    REMOTE SENSING, 2024, 16 (16)
  • [35] Scale-Aware Regional Collective Feature Enhancement Network for Scene Object Detection
    Li, Yiyao
    Liu, Jin
    Gao, Zhenyu
    NEURAL PROCESSING LETTERS, 2023, 55 (05) : 6289 - 6310
  • [36] Scene text detection by adaptive feature selection with text scale-aware loss
    Qin Wu
    Wenli Luo
    Zhilei Chai
    Guodong Guo
    Applied Intelligence, 2022, 52 : 514 - 529
  • [37] Scene text detection by adaptive feature selection with text scale-aware loss
    Wu, Qin
    Luo, Wenli
    Chai, Zhilei
    Guo, Guodong
    APPLIED INTELLIGENCE, 2022, 52 (01) : 514 - 529
  • [38] Knowledge-aware Multi-scale Time Series Abnormal Segment Detection
    Huang, Zhen-Wei
    Gao, Yuefang
    Huang, Ling
    Zeng, Zhongjie
    Chen, Jiaquan
    Kuang, Yingjie
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2024, PT 5, 2024, 14854 : 426 - 441
  • [39] Scale-Aware Multi-branch Decoder for Salient Object Detection
    Lin, Yang
    Zhou, Huajun
    Xie, Xiaohua
    Lai, Jianhuang
    PATTERN RECOGNITION AND COMPUTER VISION, PT I, 2021, 13019 : 280 - 292
  • [40] Learning region-guided scale-aware feature selection for object detection
    Liu, Liu
    Wang, Rujing
    Xie, Chengjun
    Li, Rui
    Wang, Fangyuan
    Zhou, Man
    Teng, Yue
    NEURAL COMPUTING & APPLICATIONS, 2021, 33 (11): : 6389 - 6403