A wheat spike detection method based on Transformer

被引:17
|
作者
Zhou, Qiong [1 ,2 ,3 ]
Huang, Ziliang [1 ,2 ]
Zheng, Shijian [1 ,4 ]
Jiao, Lin [1 ,5 ]
Wang, Liusan [1 ]
Wang, Rujing [1 ,2 ]
机构
[1] Chinese Acad Sci, Inst Intelligent Machines, Hefei Inst Phys Sci, Hefei, Peoples R China
[2] Univ Sci & Technol China, Sci Isl Branch, Hefei, Peoples R China
[3] Anhui Agr Univ, Coll Informat & Comp, Hefei, Peoples R China
[4] Univ Sci & Technol, Dept Informat Engn Southwest, Mianyang, Peoples R China
[5] Anhui Univ, Sch Internet, Hefei, Peoples R China
来源
基金
国家重点研发计划;
关键词
deep learning; IoU loss function; transformer; wheat spike detection; agriculture; DENSITY;
D O I
10.3389/fpls.2022.1023924
中图分类号
Q94 [植物学];
学科分类号
071001 ;
摘要
Wheat spike detection has important research significance for production estimation and crop field management. With the development of deep learning-based algorithms, researchers tend to solve the detection task by convolutional neural networks (CNNs). However, traditional CNNs equip with the inductive bias of locality and scale-invariance, which makes it hard to extract global and long-range dependency. In this paper, we propose a Transformer-based network named Multi-Window Swin Transformer (MW-Swin Transformer). Technically, MW-Swin Transformer introduces the ability of feature pyramid network to extract multi-scale features and inherits the characteristic of Swin Transformer that performs self-attention mechanism by window strategy. Moreover, bounding box regression is a crucial step in detection. We propose a Wheat Intersection over Union loss by incorporating the Euclidean distance, area overlapping, and aspect ratio, thereby leading to better detection accuracy. We merge the proposed network and regression loss into a popular detection architecture, fully convolutional one-stage object detection, and name the unified model WheatFormer. Finally, we construct a wheat spike detection dataset (WSD-2022) to evaluate the performance of the proposed methods. The experimental results show that the proposed network outperforms those state-of-the-art algorithms with 0.459 mAP (mean average precision) and 0.918 AP(50). It has been proved that our Transformer-based method is effective to handle wheat spike detection under complex field conditions.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] Research on transformer fault detection method based on a regression algorithm
    He X.
    Zhang Y.
    Cui G.
    Li B.
    Liang X.
    Wang H.
    Dianli Xitong Baohu yu Kongzhi/Power System Protection and Control, 2020, 48 (21): : 132 - 139
  • [22] Multiscale fire image detection method based on CNN and Transformer
    Shengbao Wu
    Buyun Sheng
    Gaocai Fu
    Daode Zhang
    Yuchao Jian
    Multimedia Tools and Applications, 2024, 83 : 49787 - 49811
  • [23] PCB Defect Detection Method Based on Transformer-YOLO
    Chen, Wei
    Huang, Zhongtian
    Mu, Qian
    Sun, Yi
    IEEE ACCESS, 2022, 10 : 129480 - 129489
  • [24] Broiler Behavior Detection and Tracking Method Based on Lightweight Transformer
    Qi, Haixia
    Chen, Zihong
    Liang, Guangsheng
    Chen, Riyao
    Jiang, Jinzhuo
    Luo, Xiwen
    APPLIED SCIENCES-BASEL, 2025, 15 (06):
  • [25] Transformer Oil Leakage Detection Based on Loop Training Method
    Bao W.
    Gu L.
    He J.
    Kuai Y.
    Liu W.
    Huang W.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2021, 33 (03): : 431 - 438
  • [26] Multiscale fire image detection method based on CNN and Transformer
    Wu, Shengbao
    Sheng, Buyun
    Fu, Gaocai
    Zhang, Daode
    Jian, Yuchao
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (16) : 49787 - 49811
  • [27] Transformer learning-based efficient MIMO detection method
    Burera
    Ahmed, Saleem
    Kim, Sooyoung
    PHYSICAL COMMUNICATION, 2025, 70
  • [28] Wheat-Net: An Automatic Dense Wheat Spike Segmentation Method Based on an Optimized Hybrid Task Cascade Model
    Zhang, Jiajing
    Min, An
    Steffenson, Brian J.
    Su, Wen-Hao
    Hirsch, Cory D.
    Anderson, James
    Wei, Jian
    Ma, Qin
    Yang, Ce
    FRONTIERS IN PLANT SCIENCE, 2022, 13
  • [29] Improving multi-scale detection layers in the deep learning network for wheat spike detection based on interpretive analysis
    Yan, Jiawei
    Zhao, Jianqing
    Cai, Yucheng
    Wang, Suwan
    Qiu, Xiaolei
    Yao, Xia
    Tian, Yongchao
    Zhu, Yan
    Cao, Weixing
    Zhang, Xiaohu
    PLANT METHODS, 2023, 19 (01)
  • [30] Improving multi-scale detection layers in the deep learning network for wheat spike detection based on interpretive analysis
    Jiawei Yan
    Jianqing Zhao
    Yucheng Cai
    Suwan Wang
    Xiaolei Qiu
    Xia Yao
    Yongchao Tian
    Yan Zhu
    Weixing Cao
    Xiaohu Zhang
    Plant Methods, 19