DETR-SPP: a fine-tuned vehicle detection with transformer

被引:4
|
作者
Krishnendhu, S. P. [1 ]
Mohandas, Prabu [1 ]
机构
[1] Natl Inst Technol Calicut, Dept Comp Sci & Engn, Intelligent Comp Lab, Kattangal, Kerala, India
关键词
Detection transformer; Intelligent transportation systems (ITS); Real-time vehicle detection; Spatial pyramid pooling (SPP); Bipartite matching;
D O I
10.1007/s11042-023-16502-7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Real-time vehicle detection is the most challenging and crucial task in intelligent transportation systems. Speed and accuracy are the most anticipated qualities for a vehicle detection model. The existing real-time vehicle detection models lack either one of these qualities, i.e., higher accuracy is achieved at the expense of speed and vice versa. This makes them unfit for real-time deployment, where both speed and accuracy are equally important. Also, occlusion is an inevitable factor that makes detection more complex and affects the system's accuracy. Furthermore, there is no dedicated model for vehicle detection. This study proposes a better one-stage vehicle detection network, DETR-SPP, based on bipartite matching and a transformer encoder-decoder architecture. The feature extraction network, the Convolutional Neural Network (CNN), of the DEtection TRansformer (DETR) object detection model is modified to increase the real-time detection speed and accuracy. The spatial pyramid pooling concept is added to remove the fixed-size constraint and increase the learning capacity of the network. The network is trained only with vehicle classes from the MS COCO 2017 dataset, such as bus, car, motorcycle, and truck. When compared with the other state-of-the-art models, DETR-SPP gives higher accuracy in real-time vehicle detection. On the MS COCO 2017 dataset, the proposed model achieves a better mAP of 51.31%, which is 5.19% higher as compared to the DETR baseline model. Moreover, the proposed DETR-SPP attained a p value of 0.03 while performing the Wilcoxon signed-rank test. Thus, the proposed DETR-SPP is a better model for vehicle detection.
引用
收藏
页码:25573 / 25594
页数:22
相关论文
共 50 条
  • [21] Camel detection using fine-tuned YOLOv8
    Gasmi, Rim
    Chetoui, Mohamed
    Fahem, Messilva
    Benslimane, Heythem
    Akhloufi, Moulay A.
    PROGRAM OF THE 2ND INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING AND AUTOMATIC CONTROL, ICEEAC 2024, 2024,
  • [22] Exploring Generalizability of Fine-Tuned Models for Fake News Detection
    Suprem, Abhijit
    Vaidya, Sanjyot
    Pu, Calton
    2022 IEEE 8TH INTERNATIONAL CONFERENCE ON COLLABORATION AND INTERNET COMPUTING, CIC, 2022, : 82 - 88
  • [23] Supervised fine-tuned approach for automated detection of diabetic retinopathy
    Ohri, Kriti
    Kumar, Mukesh
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (05) : 14259 - 14280
  • [24] Potato Blight Detection Using Fine-Tuned CNN Architecture
    Al-Adhaileh, Mosleh Hmoud
    Verma, Amit
    Aldhyani, Theyazn H. H.
    Koundal, Deepika
    MATHEMATICS, 2023, 11 (06)
  • [25] Genealogical Relationship Extraction from Unstructured Text Using Fine-Tuned Transformer Models
    Parrolivelli, Carloangello
    Stanchev, Lubomir
    2023 IEEE 17TH INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING, ICSC, 2023, : 167 - 174
  • [26] Classification of Cleft Lip and Palate Speech Using Fine-Tuned Transformer Pretrained Models
    Bhattacharjee, Susmita
    Shekhawat, H. S.
    Prasanna, S. R. M.
    INTELLIGENT HUMAN COMPUTER INTERACTION, IHCI 2023, PT I, 2024, 14531 : 55 - 61
  • [27] Facial Expression Recognition Based on Fine-Tuned Channel-Spatial Attention Transformer
    Yao, Huang
    Yang, Xiaomeng
    Chen, Di
    Wang, Zhao
    Tian, Yuan
    SENSORS, 2023, 23 (15)
  • [28] Few-Shot Tabular Data Enrichment Using Fine-Tuned Transformer Architectures
    Harari, Asaf
    Katz, Gilad
    PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 1577 - 1591
  • [29] Exploring the Power of Deep Learning: Fine-Tuned Vision Transformer for Accurate and Efficient Brain Tumor Detection in MRI Scans
    Asiri, Abdullah A. A.
    Shaf, Ahmad
    Ali, Tariq
    Shakeel, Unza
    Irfan, Muhammad
    Mehdar, Khlood M. M.
    Halawani, Hanan Talal
    Alghamdi, Ali H. H.
    Alshamrani, Abdullah Fahad A.
    Alqhtani, Samar M. M.
    DIAGNOSTICS, 2023, 13 (12)
  • [30] Robust Vulnerability Detection in Solidity-Based Ethereum Smart Contracts Using Fine-Tuned Transformer Encoder Models
    Le, Thi-Thu-Huong
    Kim, Jaehyun
    Lee, Sangmyeong
    Kim, Howon
    IEEE ACCESS, 2024, 12 : 154700 - 154717