DKTNet: Dual-Key Transformer Network for small object detection

被引：28

作者：

Xu, Shoukun ^{[1
]}

Gu, Jianan ^{[1
]}

Hua, Yining ^{[2
]}

Liu, Yi ^{[1
]}

机构：

[1] Changzhou Univ, Changzhou 213164, Jiangsu, Peoples R China

[2] Univ Aberdeen, Aberdeen, Scotland

来源：

NEUROCOMPUTING | 2023年 / 525卷

基金：

中国国家自然科学基金;

关键词：

Small object detection; Transformer; Dual-key;

D O I：

10.1016/j.neucom.2023.01.055

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Object detection is a fundamental computer vision task that plays a crucial role in a wide range of real-world applications. However, it is still a challenging task to detect the small size objects in the complex scene, due to the low resolution and noisy representation appearance caused by occlusion, distant depth view, etc. To tackle this issue, a novel transformer architecture, Dual-Key Transformer Network (DKTNet), is proposed in this paper. To improve the feature attention ability, the coherence of linear layer outputs Q and V are enhanced by a dual-K integrated from K1 and K2, which are computed along Q and V, respectively. Instead of spatial-wise attention, channel-wise self-attention mechanism is adopted to promote the important feature channels and suppress the confusing ones. Moreover, 2D and 1D convolution computations for Q, K and V are proposed. Compared with the fully-connected computa-tion in conventional transformer architectures, the 2D convolution can better capture local details and global contextual information, and the 1D convolution can reduce network complexity significantly. Experimental evaluation is conducted on both general and small object detection datasets. The superior-ity of the aforementioned features in our proposed approach is demonstrated with the comparison against the state-of-the-art approaches.(c) 2023 Elsevier B.V. All rights reserved.

引用

页码：29 / 41

页数：13

共 50 条

[31] GPNet: Key Point Generation Auxiliary Network for Object Detection
Shao, Mingwen
Sun, Yuantao
Liu, Zeting
Peng, Zilu
Li, Shunhang
Li, Cunhe
ADVANCED THEORY AND SIMULATIONS, 2023, 6 (05)
[32] Dynamic Feature Focusing Network for small object detection
Jing, Rudong
Zhang, Wei
Li, Yuzhuo
Li, Wenlin
Liu, Yanyan
INFORMATION PROCESSING & MANAGEMENT, 2024, 61 (06)
[33] Attentional feature pyramid network for small object detection
Min, Kyungseo
Lee, Gun-Hee
Lee, Seong-Whan
NEURAL NETWORKS, 2022, 155 : 439 - 450
[34] Construction of a feature enhancement network for small object detection
Zhang, Hongyun
Li, Miao
Miao, Duoqian
Pedrycz, Witold
Wang, Zhaoguo
Jiang, Minghui
PATTERN RECOGNITION, 2023, 143
[35] Lightweight small object detection network with attention mechanism
Zhu W.
Wang L.
Jin Z.
He D.
Guangxue Jingmi Gongcheng/Optics and Precision Engineering, 2022, 30 (08): : 998 - 1010
[36] Extended Feature Pyramid Network for Small Object Detection
Deng, Chunfang
Wang, Mengmeng
Liu, Liang
Liu, Yong
Jiang, Yunliang
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 1968 - 1979
[37] Faster Dual-Key Stealth Address for Blockchain-Based Internet of Things Systems
Fan, Xinxin
BLOCKCHAIN - ICBC 2018, 2018, 10974 : 127 - 138
[38] A novel dual-key management protocol based on a merarchical multicast infrastructure in mobile Internet
Cao, JN
Liao, L
Wang, GJ
Xiao, B
NETWORKING AND MOBILE COMPUTING, PROCEEDINGS, 2005, 3619 : 560 - 569
[39] DIG: dual interaction and guidance network for salient object detection
Jia, Ning
Chen, Yufei
Liu, Xianhui
Wang, Hui
APPLIED INTELLIGENCE, 2023, 53 (23) : 28039 - 28053
[40] Dual Refinement Network for Single-Shot Object Detection
Chen, Xingyu
Yang, Xiyuan
Kong, Shihan
Wu, Zhengxing
Yu, Junzhi
2019 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2019, : 8305 - 8310

← 1 2 3 4 5 →