HCFormer: A Lightweight Pest Detection Model Combining CNN and ViT

被引:1
|
作者
Zeng, Meiqi [1 ]
Chen, Shaonan [1 ]
Liu, Hongshan [1 ]
Wang, Weixing [2 ]
Xie, Jiaxing [1 ,3 ]
机构
[1] South China Agr Univ, Coll Elect Engn, Coll Artificial Intelligence, Guangzhou 510642, Peoples R China
[2] South China Agr Univ, Zhujiang Coll, Guangzhou 510900, Peoples R China
[3] Engn Res Ctr Monitoring Agr Informat Guangdong Pro, Guangzhou 510642, Peoples R China
来源
AGRONOMY-BASEL | 2024年 / 14卷 / 09期
关键词
pest detection; image processing; deep learning; vision transformer; lightweight; INSECT PESTS;
D O I
10.3390/agronomy14091940
中图分类号
S3 [农学(农艺学)];
学科分类号
0901 ;
摘要
Pests are widely distributed in nature, characterized by their small size, which, along with environmental factors such as lighting conditions, makes their identification challenging. A lightweight pest detection network, HCFormer, combining convolutional neural networks (CNNs) and a vision transformer (ViT) is proposed in this study. Data preprocessing is conducted using a bottleneck-structured convolutional network and a Stem module to reduce computational latency. CNNs with various kernel sizes capture local information at different scales, while the ViT network's attention mechanism and global feature extraction enhance pest feature representation. A down-sampling method reduces the input image size, decreasing computational load and preventing overfitting while enhancing model robustness. Improved attention mechanisms effectively capture feature relationships, balancing detection accuracy and speed. The experimental results show that HCFormer achieves 98.17% accuracy, 91.98% recall, and a mean average precision (mAP) of 90.57%. Compared with SENet, CrossViT, and YOLOv8, HCFormer improves the average accuracy by 7.85%, 2.01%, and 3.55%, respectively, outperforming the overall mainstream detection models. Ablation experiments indicate that the model's parameter count is 26.5 M, demonstrating advantages in lightweight design and detection accuracy. HCFormer's efficiency and flexibility in deployment, combined with its high detection accuracy and precise classification, make it a valuable tool for identifying and classifying crop pests in complex environments, providing essential guidance for future pest monitoring and control.
引用
收藏
页数:21
相关论文
共 50 条
  • [41] A lightweight hybrid CNN-LSTM explainable model for ECG-based arrhythmia detection
    Alamatsaz, Negin
    Tabatabaei, Leyla
    Yazdchi, Mohammadreza
    Payan, Hamidreza
    Alamatsaz, Nima
    Nasimi, Fahimeh
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 90
  • [42] Lightweight convolutional neural network (CNN) model for obesity early detection using thermal images
    Leo, Hendrik
    Saddami, Khairun
    Roslidar, Rusdha
    Muharar, Rusdha
    Munadi, Khairul
    Arnia, Fitri
    DIGITAL HEALTH, 2024, 10
  • [43] A Lightweight Crop Pest Detection Method Based on Improved RTMDet
    Wang, Wanqing
    Fu, Haoyue
    INFORMATION, 2024, 15 (09)
  • [44] A Lightweight Quantized CNN Model for Plant Disease Recognition
    Abdul Fattah Rakib
    Rashik Rahman
    Alim Al Razi
    A. S. M. Touhidul Hasan
    Arabian Journal for Science and Engineering, 2024, 49 : 4097 - 4108
  • [45] Lightweight ViT Model for Micro-Expression Recognition Enhanced by Transfer Learning
    Liu, Yanju
    Li, Yange
    Yi, Xinhai
    Hu, Zuojin
    Zhang, Huiyu
    Liu, Yanzhong
    FRONTIERS IN NEUROROBOTICS, 2022, 16
  • [46] Facial Expression Recognition Based on A Lightweight CNN Model
    Chen, Qianqian
    Jing, Xiaojun
    Zhang, Fangpei
    Mu, Junsheng
    2022 IEEE INTERNATIONAL SYMPOSIUM ON BROADBAND MULTIMEDIA SYSTEMS AND BROADCASTING (BMSB), 2022,
  • [47] A Lightweight Quantized CNN Model for Plant Disease Recognition
    Rakib, Abdul Fattah
    Rahman, Rashik
    Al Razi, Alim
    Hasan, A. S. M. Touhidul
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2024, 49 (03) : 4097 - 4108
  • [48] Facial Expression Recognition Based on A Lightweight CNN Model
    Chen, Qianqian
    Jing, Xiaojun
    Zhang, Fangpei
    Mu, Junsheng
    IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, BMSB, 2022, 2022-June
  • [49] A Lightweight Hybrid Model with Location-Preserving ViT for Efficient Food Recognition
    Sheng, Guorui
    Min, Weiqing
    Zhu, Xiangyi
    Xu, Liang
    Sun, Qingshuo
    Yang, Yancun
    Wang, Lili
    Jiang, Shuqiang
    NUTRIENTS, 2024, 16 (02)
  • [50] Pest-YOLO: A YOLOv5-Based Lightweight Crop Pest Detection Algorithm
    Luo, Wanbo
    INTERNATIONAL JOURNAL OF ENGINEERING AND TECHNOLOGY INNOVATION, 2025, 15 (01) : 11 - 25