A wheat spike detection method based on Transformer

被引:17
|
作者
Zhou, Qiong [1 ,2 ,3 ]
Huang, Ziliang [1 ,2 ]
Zheng, Shijian [1 ,4 ]
Jiao, Lin [1 ,5 ]
Wang, Liusan [1 ]
Wang, Rujing [1 ,2 ]
机构
[1] Chinese Acad Sci, Inst Intelligent Machines, Hefei Inst Phys Sci, Hefei, Peoples R China
[2] Univ Sci & Technol China, Sci Isl Branch, Hefei, Peoples R China
[3] Anhui Agr Univ, Coll Informat & Comp, Hefei, Peoples R China
[4] Univ Sci & Technol, Dept Informat Engn Southwest, Mianyang, Peoples R China
[5] Anhui Univ, Sch Internet, Hefei, Peoples R China
来源
基金
国家重点研发计划;
关键词
deep learning; IoU loss function; transformer; wheat spike detection; agriculture; DENSITY;
D O I
10.3389/fpls.2022.1023924
中图分类号
Q94 [植物学];
学科分类号
071001 ;
摘要
Wheat spike detection has important research significance for production estimation and crop field management. With the development of deep learning-based algorithms, researchers tend to solve the detection task by convolutional neural networks (CNNs). However, traditional CNNs equip with the inductive bias of locality and scale-invariance, which makes it hard to extract global and long-range dependency. In this paper, we propose a Transformer-based network named Multi-Window Swin Transformer (MW-Swin Transformer). Technically, MW-Swin Transformer introduces the ability of feature pyramid network to extract multi-scale features and inherits the characteristic of Swin Transformer that performs self-attention mechanism by window strategy. Moreover, bounding box regression is a crucial step in detection. We propose a Wheat Intersection over Union loss by incorporating the Euclidean distance, area overlapping, and aspect ratio, thereby leading to better detection accuracy. We merge the proposed network and regression loss into a popular detection architecture, fully convolutional one-stage object detection, and name the unified model WheatFormer. Finally, we construct a wheat spike detection dataset (WSD-2022) to evaluate the performance of the proposed methods. The experimental results show that the proposed network outperforms those state-of-the-art algorithms with 0.459 mAP (mean average precision) and 0.918 AP(50). It has been proved that our Transformer-based method is effective to handle wheat spike detection under complex field conditions.
引用
收藏
页数:14
相关论文
共 50 条
  • [41] Photovoltaic arc fault detection method based on transformer voltage signal
    Chen Y.
    Xiong L.
    Fan Y.
    Liu X.
    Guo K.
    Chen, Yonghui (chenyonghui426@163.com), 1600, Science Press (42): : 68 - 75
  • [42] A multi-label classification method based on transformer for deepfake detection
    Deng, Liwei
    Zhu, Yunlong
    Zhao, Dexu
    Chen, Fei
    IMAGE AND VISION COMPUTING, 2024, 152
  • [43] Tetrode Spike Detection Method Based on Quaternion Principle Component Feature Extraction
    Zhao, Yong
    Wang, Yibo
    Tan, Ailing
    PROCEEDINGS OF THE 2014 INTERNATIONAL CONFERENCE ON MECHATRONICS, ELECTRONIC, INDUSTRIAL AND CONTROL ENGINEERING, 2014, 5 : 456 - +
  • [44] An automatic detection method to the field wheat based on image processing
    Wang, Yu
    Cao, Zhiguo
    Bai, Xiaodong
    Yu, Zhenghong
    Li, Yanan
    MIPPR 2013: AUTOMATIC TARGET RECOGNITION AND NAVIGATION, 2013, 8918
  • [45] A novel detection method for wheat aging based on the delayed luminescence
    Gong Yue-hong
    Liu Yu-kun
    Gong Zhi-le
    Zhong Xiao-yan
    Zhao Wei-ting
    Li Bing
    Ge Hong-yi
    Lyu Qiong-shuai
    SCIENTIFIC REPORTS, 2024, 14 (01)
  • [46] A novel detection method for wheat aging based on the delayed luminescence
    Gong Yue-hong
    Liu Yu-kun
    Gong Zhi-le
    Zhong Xiao-yan
    Zhao Wei-ting
    Li Bing
    Ge Hong-yi
    Lyu Qiong-shuai
    Scientific Reports, 14
  • [48] Spike-driven Transformer
    Yao, Man
    Hu, Jiakui
    Zhou, Zhaokun
    Yuan, Li
    Tian, Yonghong
    Xu, Bo
    Li, Guoqi
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [49] Automatic detection and counting of wheat spike based on DMseg-Count (vol 15, 4103 , 2025)
    Zang, Hecang
    Peng, Yilong
    Zhou, Meng
    Li, Guoqiang
    Zheng, Guoqing
    Shen, Hualei
    SCIENTIFIC REPORTS, 2025, 15 (01):
  • [50] A Hybrid Wheat Head Detection model with Incorporated CNN and Transformer
    Harada, Sho
    Han, Xian-Hua
    2023 18TH INTERNATIONAL CONFERENCE ON MACHINE VISION AND APPLICATIONS, MVA, 2023,