HCFormer: A Lightweight Pest Detection Model Combining CNN and ViT

被引:1
|
作者
Zeng, Meiqi [1 ]
Chen, Shaonan [1 ]
Liu, Hongshan [1 ]
Wang, Weixing [2 ]
Xie, Jiaxing [1 ,3 ]
机构
[1] South China Agr Univ, Coll Elect Engn, Coll Artificial Intelligence, Guangzhou 510642, Peoples R China
[2] South China Agr Univ, Zhujiang Coll, Guangzhou 510900, Peoples R China
[3] Engn Res Ctr Monitoring Agr Informat Guangdong Pro, Guangzhou 510642, Peoples R China
来源
AGRONOMY-BASEL | 2024年 / 14卷 / 09期
关键词
pest detection; image processing; deep learning; vision transformer; lightweight; INSECT PESTS;
D O I
10.3390/agronomy14091940
中图分类号
S3 [农学(农艺学)];
学科分类号
0901 ;
摘要
Pests are widely distributed in nature, characterized by their small size, which, along with environmental factors such as lighting conditions, makes their identification challenging. A lightweight pest detection network, HCFormer, combining convolutional neural networks (CNNs) and a vision transformer (ViT) is proposed in this study. Data preprocessing is conducted using a bottleneck-structured convolutional network and a Stem module to reduce computational latency. CNNs with various kernel sizes capture local information at different scales, while the ViT network's attention mechanism and global feature extraction enhance pest feature representation. A down-sampling method reduces the input image size, decreasing computational load and preventing overfitting while enhancing model robustness. Improved attention mechanisms effectively capture feature relationships, balancing detection accuracy and speed. The experimental results show that HCFormer achieves 98.17% accuracy, 91.98% recall, and a mean average precision (mAP) of 90.57%. Compared with SENet, CrossViT, and YOLOv8, HCFormer improves the average accuracy by 7.85%, 2.01%, and 3.55%, respectively, outperforming the overall mainstream detection models. Ablation experiments indicate that the model's parameter count is 26.5 M, demonstrating advantages in lightweight design and detection accuracy. HCFormer's efficiency and flexibility in deployment, combined with its high detection accuracy and precise classification, make it a valuable tool for identifying and classifying crop pests in complex environments, providing essential guidance for future pest monitoring and control.
引用
收藏
页数:21
相关论文
共 50 条
  • [1] PWDViTNet: A lightweight early pine wilt disease detection model based on the fusion of ViT and CNN
    Chen, Zhichao
    Lin, Haifeng
    Bai, Di
    Qian, Jingjing
    Zhou, Hongping
    Gao, Yunya
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2025, 230
  • [2] Lightweight CNN-ViT with cross-module representational constraint for express parcel detection
    Zhang, Guowei
    Li, Wuzhi
    Tang, Yutong
    Chen, Shuixuan
    Wang, Li
    VISUAL COMPUTER, 2024,
  • [3] Plant disease detection based on lightweight CNN model
    Liu, Yang
    Gao, Guoqin
    Zhang, Zhenhui
    2021 4TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMPUTER TECHNOLOGIES (ICICT 2021), 2021, : 64 - 68
  • [4] Lightweight Multiscale CNN Model for Wheat Disease Detection
    Fang, Xin
    Zhen, Tong
    Li, Zhihui
    APPLIED SCIENCES-BASEL, 2023, 13 (09):
  • [5] Pest Detection Based on Lightweight Locality-Aware Faster R-CNN
    Li, Kai-Run
    Duan, Li-Jun
    Deng, Yang-Jun
    Liu, Jin-Ling
    Long, Chen-Feng
    Zhu, Xing-Hui
    AGRONOMY-BASEL, 2024, 14 (10):
  • [6] Lightweight Object Detection Model with Data Augmentation for Tiny Pest Detection
    Yuan, Zhipeng
    Li, Shunbao
    Yang, Po
    Li, Yang
    2022 IEEE 20TH INTERNATIONAL CONFERENCE ON INDUSTRIAL INFORMATICS (INDIN), 2022, : 233 - 238
  • [7] A Lightweight Intrusion Detection System for Vehicular Networks Based on an Improved ViT Model
    Wang, Shaoqiang
    Zheng, Baosen
    Liu, Zhaoyuan
    Fan, Ziyao
    Liu, Yubao
    Dai, Yinfei
    IEEE ACCESS, 2024, 12 : 118842 - 118856
  • [8] Lightweight Hybrid CNN Model for Face Presentation Attack Detection
    Turhal, Ugur
    Yilmaz, Asuman Gunay
    Nabiyev, Vasif
    INFORMATION TECHNOLOGIES AND THEIR APPLICATIONS, PT II, ITTA 2024, 2025, 2226 : 228 - 240
  • [9] Lightweight CNN model: automated vehicle detection in aerial images
    Md Abdul Momin
    Mohamad Haniff Junos
    Anis Salwa Mohd Khairuddin
    Mohamad Sofian Abu Talip
    Signal, Image and Video Processing, 2023, 17 : 1209 - 1217
  • [10] Lightweight CNN model: automated vehicle detection in aerial images
    Momin, Md Abdul
    Junos, Mohamad Haniff
    Khairuddin, Anis Salwa Mohd
    Abu Talip, Mohamad Sofian
    SIGNAL IMAGE AND VIDEO PROCESSING, 2023, 17 (04) : 1209 - 1217