DKTNet: Dual-Key Transformer Network for small object detection

被引:28
|
作者
Xu, Shoukun [1 ]
Gu, Jianan [1 ]
Hua, Yining [2 ]
Liu, Yi [1 ]
机构
[1] Changzhou Univ, Changzhou 213164, Jiangsu, Peoples R China
[2] Univ Aberdeen, Aberdeen, Scotland
基金
中国国家自然科学基金;
关键词
Small object detection; Transformer; Dual-key;
D O I
10.1016/j.neucom.2023.01.055
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Object detection is a fundamental computer vision task that plays a crucial role in a wide range of real-world applications. However, it is still a challenging task to detect the small size objects in the complex scene, due to the low resolution and noisy representation appearance caused by occlusion, distant depth view, etc. To tackle this issue, a novel transformer architecture, Dual-Key Transformer Network (DKTNet), is proposed in this paper. To improve the feature attention ability, the coherence of linear layer outputs Q and V are enhanced by a dual-K integrated from K1 and K2, which are computed along Q and V, respectively. Instead of spatial-wise attention, channel-wise self-attention mechanism is adopted to promote the important feature channels and suppress the confusing ones. Moreover, 2D and 1D convolution computations for Q, K and V are proposed. Compared with the fully-connected computa-tion in conventional transformer architectures, the 2D convolution can better capture local details and global contextual information, and the 1D convolution can reduce network complexity significantly. Experimental evaluation is conducted on both general and small object detection datasets. The superior-ity of the aforementioned features in our proposed approach is demonstrated with the comparison against the state-of-the-art approaches.(c) 2023 Elsevier B.V. All rights reserved.
引用
收藏
页码:29 / 41
页数:13
相关论文
共 50 条
  • [31] GPNet: Key Point Generation Auxiliary Network for Object Detection
    Shao, Mingwen
    Sun, Yuantao
    Liu, Zeting
    Peng, Zilu
    Li, Shunhang
    Li, Cunhe
    ADVANCED THEORY AND SIMULATIONS, 2023, 6 (05)
  • [32] Dynamic Feature Focusing Network for small object detection
    Jing, Rudong
    Zhang, Wei
    Li, Yuzhuo
    Li, Wenlin
    Liu, Yanyan
    INFORMATION PROCESSING & MANAGEMENT, 2024, 61 (06)
  • [33] Attentional feature pyramid network for small object detection
    Min, Kyungseo
    Lee, Gun-Hee
    Lee, Seong-Whan
    NEURAL NETWORKS, 2022, 155 : 439 - 450
  • [34] Construction of a feature enhancement network for small object detection
    Zhang, Hongyun
    Li, Miao
    Miao, Duoqian
    Pedrycz, Witold
    Wang, Zhaoguo
    Jiang, Minghui
    PATTERN RECOGNITION, 2023, 143
  • [35] Lightweight small object detection network with attention mechanism
    Zhu W.
    Wang L.
    Jin Z.
    He D.
    Guangxue Jingmi Gongcheng/Optics and Precision Engineering, 2022, 30 (08): : 998 - 1010
  • [36] Extended Feature Pyramid Network for Small Object Detection
    Deng, Chunfang
    Wang, Mengmeng
    Liu, Liang
    Liu, Yong
    Jiang, Yunliang
    IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 1968 - 1979
  • [37] Faster Dual-Key Stealth Address for Blockchain-Based Internet of Things Systems
    Fan, Xinxin
    BLOCKCHAIN - ICBC 2018, 2018, 10974 : 127 - 138
  • [38] A novel dual-key management protocol based on a merarchical multicast infrastructure in mobile Internet
    Cao, JN
    Liao, L
    Wang, GJ
    Xiao, B
    NETWORKING AND MOBILE COMPUTING, PROCEEDINGS, 2005, 3619 : 560 - 569
  • [39] DIG: dual interaction and guidance network for salient object detection
    Jia, Ning
    Chen, Yufei
    Liu, Xianhui
    Wang, Hui
    APPLIED INTELLIGENCE, 2023, 53 (23) : 28039 - 28053
  • [40] Dual Refinement Network for Single-Shot Object Detection
    Chen, Xingyu
    Yang, Xiyuan
    Kong, Shihan
    Wu, Zhengxing
    Yu, Junzhi
    2019 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2019, : 8305 - 8310