DKTNet: Dual-Key Transformer Network for small object detection

被引:28
|
作者
Xu, Shoukun [1 ]
Gu, Jianan [1 ]
Hua, Yining [2 ]
Liu, Yi [1 ]
机构
[1] Changzhou Univ, Changzhou 213164, Jiangsu, Peoples R China
[2] Univ Aberdeen, Aberdeen, Scotland
基金
中国国家自然科学基金;
关键词
Small object detection; Transformer; Dual-key;
D O I
10.1016/j.neucom.2023.01.055
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Object detection is a fundamental computer vision task that plays a crucial role in a wide range of real-world applications. However, it is still a challenging task to detect the small size objects in the complex scene, due to the low resolution and noisy representation appearance caused by occlusion, distant depth view, etc. To tackle this issue, a novel transformer architecture, Dual-Key Transformer Network (DKTNet), is proposed in this paper. To improve the feature attention ability, the coherence of linear layer outputs Q and V are enhanced by a dual-K integrated from K1 and K2, which are computed along Q and V, respectively. Instead of spatial-wise attention, channel-wise self-attention mechanism is adopted to promote the important feature channels and suppress the confusing ones. Moreover, 2D and 1D convolution computations for Q, K and V are proposed. Compared with the fully-connected computa-tion in conventional transformer architectures, the 2D convolution can better capture local details and global contextual information, and the 1D convolution can reduce network complexity significantly. Experimental evaluation is conducted on both general and small object detection datasets. The superior-ity of the aforementioned features in our proposed approach is demonstrated with the comparison against the state-of-the-art approaches.(c) 2023 Elsevier B.V. All rights reserved.
引用
收藏
页码:29 / 41
页数:13
相关论文
共 50 条
  • [21] Boosting Camouflaged Object Detection with Dual-Task Interactive Transformer
    Liu, Zhengyi
    Zhang, Zhili
    Tan, Yacheng
    Wu, Wei
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 140 - 146
  • [22] Feature aggregation network for small object detection
    Jing, Rudong
    Zhang, Wei
    Li, Yuzhuo
    Li, Wenlin
    Liu, Yanyan
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 255
  • [23] Dual Semantic Fusion Network for Video Object Detection
    Lin, Lijian
    Chen, Haosheng
    Zhang, Honglun
    Liang, Jun
    Li, Yu
    Shan, Ying
    Wang, Hanzi
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 1855 - 1863
  • [24] A dual neural network for object detection in UAV images
    Tian, Gangyi
    Liu, Jianran
    Yang, Wenyuan
    NEUROCOMPUTING, 2021, 443 : 292 - 301
  • [25] A novel dual-key management protocol based on a hierarchical multicast infrastructure in mobile internet
    Cao, Jiannong
    Liao, Lin
    Wang, Guojun
    Ma, Hao
    Xiao, Bin
    INTERNATIONAL JOURNAL OF AD HOC AND UBIQUITOUS COMPUTING, 2009, 4 (3-4) : 183 - 190
  • [26] A Novel Small Object Detection Method Based on Improved Transformer Model
    Wei, Zixuan
    Zan, Guokuan
    Wan, Zhibo
    ARTIFICIAL INTELLIGENCE AND ROBOTICS, ISAIR 2023, 2024, 1998 : 219 - 225
  • [27] Small object detection algorithm incorporating swin transformer for tea buds
    Shi, Meiling
    Zheng, Dongling
    Wu, Tianhao
    Zhang, Wenjing
    Fu, Ruijie
    Huang, Kailiang
    PLOS ONE, 2024, 19 (03):
  • [28] Uncertainty-guided Siamese Transformer Network for salient object detection
    Han, Pengfei
    Huang, Ju
    Yang, Jian
    Li, Xuelong
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 272
  • [29] A Simple Yet Effective Network Based on Vision Transformer for Camouflaged Object and Salient Object Detection
    Hao, Chao
    Yu, Zitong
    Liu, Xin
    Xu, Jun
    Yue, Huanjing
    Yang, Jingyu
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2025, 34 : 608 - 622
  • [30] Video Key Object Detection Network via Reinforcement Learning
    Li, Yue
    Zhou, Xiangchun
    Cui, Tao
    Gong, Ruohan
    Tang, Zuqi
    Wang, Chuang
    Wang, Wei
    2020 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2020,