DETR-SPP: a fine-tuned vehicle detection with transformer

被引:4
|
作者
Krishnendhu, S. P. [1 ]
Mohandas, Prabu [1 ]
机构
[1] Natl Inst Technol Calicut, Dept Comp Sci & Engn, Intelligent Comp Lab, Kattangal, Kerala, India
关键词
Detection transformer; Intelligent transportation systems (ITS); Real-time vehicle detection; Spatial pyramid pooling (SPP); Bipartite matching;
D O I
10.1007/s11042-023-16502-7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Real-time vehicle detection is the most challenging and crucial task in intelligent transportation systems. Speed and accuracy are the most anticipated qualities for a vehicle detection model. The existing real-time vehicle detection models lack either one of these qualities, i.e., higher accuracy is achieved at the expense of speed and vice versa. This makes them unfit for real-time deployment, where both speed and accuracy are equally important. Also, occlusion is an inevitable factor that makes detection more complex and affects the system's accuracy. Furthermore, there is no dedicated model for vehicle detection. This study proposes a better one-stage vehicle detection network, DETR-SPP, based on bipartite matching and a transformer encoder-decoder architecture. The feature extraction network, the Convolutional Neural Network (CNN), of the DEtection TRansformer (DETR) object detection model is modified to increase the real-time detection speed and accuracy. The spatial pyramid pooling concept is added to remove the fixed-size constraint and increase the learning capacity of the network. The network is trained only with vehicle classes from the MS COCO 2017 dataset, such as bus, car, motorcycle, and truck. When compared with the other state-of-the-art models, DETR-SPP gives higher accuracy in real-time vehicle detection. On the MS COCO 2017 dataset, the proposed model achieves a better mAP of 51.31%, which is 5.19% higher as compared to the DETR baseline model. Moreover, the proposed DETR-SPP attained a p value of 0.03 while performing the Wilcoxon signed-rank test. Thus, the proposed DETR-SPP is a better model for vehicle detection.
引用
收藏
页码:25573 / 25594
页数:22
相关论文
共 50 条
  • [1] DETR-SPP: a fine-tuned vehicle detection with transformer
    Krishnendhu S P
    Prabu Mohandas
    Multimedia Tools and Applications, 2024, 83 : 25573 - 25594
  • [2] Fine-Tuned Transformer Model for Sentiment Analysis
    Liu, Sishun
    Shuai, Pengju
    Zhang, Xiaowu
    Chen, Shuang
    Li, Li
    Liu, Ming
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT (KSEM 2020), PT II, 2020, 12275 : 336 - 343
  • [3] Fine-Tuned Understanding: Enhancing Social Bot Detection With Transformer-Based Classification
    Sallah, Amine
    Alaoui, El Arbi Abdellaoui
    Agoujil, Said
    Wani, Mudasir Ahmad
    Hammad, Mohamed
    Maleh, Yassine
    Abd El-Latif, Ahmed A.
    IEEE ACCESS, 2024, 12 : 118250 - 118269
  • [4] Multi-Level Fine-Tuned Transformer for Gait Recognition
    Wu, Huimin
    Zhao, Aite
    2022 INTERNATIONAL CONFERENCE ON VIRTUAL REALITY, HUMAN-COMPUTER INTERACTION AND ARTIFICIAL INTELLIGENCE, VRHCIAI, 2022, : 83 - 89
  • [5] The fine-tuned universe
    Theunis, Andre
    Gidion, Gunnar
    STRAD, 2019, 130 (1550): : 66 - 67
  • [6] Fine-tuned canoes
    Logan, A
    NEW SCIENTIST, 2002, 174 (2337) : 51 - 51
  • [7] Fine-tuned kraft
    Papermaker, 1996, 59 (03):
  • [8] THE FINE-TUNED ORGANIZATION
    HAMMONS, C
    MADDUX, GA
    QUALITY PROGRESS, 1992, 25 (02) : 47 - 48
  • [9] Fine-tuned antifungals
    Naomi Attar
    Nature Reviews Microbiology, 2015, 13 (7) : 398 - 398
  • [10] FINE-TUNED OF NECESSITY?
    Page, Ben
    RES PHILOSOPHICA, 2018, 95 (04) : 663 - 692