A methods guideline for deep learning for tabular data in agriculture with a case study to forecast cereal yield

被引:18
|
作者
Richetti, Jonathan [1 ]
Diakogianis, Foivos I. [2 ]
Bender, Asher [3 ]
Colaco, Andre F. [4 ]
Lawes, Roger A. [1 ]
机构
[1] CSIRO, 147 Underwood Ave, Floreat, WA 6014, Australia
[2] Data61, CSIRO, 26 Dick Perry Ave, Kensington, WA 6151, Australia
[3] Univ Sydney, Australian Ctr Field Robot, Camperdown, NSW 2006, Australia
[4] CSIRO, Waite Campus,Locked Bag 2, Glen Osmond, SA 5064, Australia
关键词
Machine learning; Artificial neural network; Multi -layer perceptron; Random forest; Xgboost; Tabnet; Wheat; Barley; NEURAL-NETWORKS; NITROGEN; PREDICTION; MODELS;
D O I
10.1016/j.compag.2023.107642
中图分类号
S [农业科学];
学科分类号
09 ;
摘要
Machine learning (ML) and its branch, deep learning (DL), is rapidly evolving and gaining popularity as it outperforms other, more traditional methods in different areas of agriculture. However, ML and DL techniques must be correctly applied to a problem to produce an acceptable solution. This article provides guidelines for using DL techniques with a case study using different models/methods to forecast yields in cereals; some of the concepts presented here are also applicable to ML more broadly. The objective is to provide clarity for new users around the use of DL techniques to solve agronomic problems. DL concepts are introduced; best practices for data pre-processing steps and metrics are recommended. Cross-validation is clarified, and its importance is high-lighted. It is shown that DL performance can vary with architecture and that the optimal choice is task -dependent. Emphasis on practical aspects for applying DL models for agricultural datasets is provided, such as dataset size (26 representative samples in each field sufficed) and cross-validation (indispensable on small datasets). Lastly, a standard guideline for DL applied to tabular data is recommended.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Recent deep learning methods for tabular data
    Hwang, Yejin
    Song, Jongwoo
    COMMUNICATIONS FOR STATISTICAL APPLICATIONS AND METHODS, 2023, 30 (02) : 215 - 226
  • [2] Predicting stroke outcome: A case for multimodal deep learning methods with tabular and CT Perfusion data
    Borsos, Balazs
    Allaart, Corinne G.
    van Halteren, Aart
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2024, 147
  • [3] MACHINE LEARNING APPROACHESIN AGRICULTURE: A TABULAR STUDY
    Kumar, Ashok
    Yadav, Rakesh Kumar
    INTERNATIONAL JOURNAL OF EARLY CHILDHOOD SPECIAL EDUCATION, 2022, 14 (05) : 3638 - 3644
  • [4] Revisiting Deep Learning Models for Tabular Data
    Gorishniy, Yury
    Rubachev, Ivan
    Khrulkov, Valentin
    Babenko, Artem
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [5] LocalGLMnet: interpretable deep learning for tabular data
    Richman, Ronald
    Wuethrich, Mario, V
    SCANDINAVIAN ACTUARIAL JOURNAL, 2023, 2023 (01) : 71 - 95
  • [6] Is Deep Learning on Tabular Data Enough? An Assessment
    Fayaz, Sheikh Amir
    Zaman, Majid
    Kaul, Sameer
    Butt, Muheet Ahmed
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (04) : 466 - 473
  • [7] Tabular data: Deep learning is not all you need
    Shwartz-Ziv, Ravid
    Armon, Amitai
    INFORMATION FUSION, 2022, 81 : 84 - 90
  • [8] Investigating Group Distributionally Robust Optimization for Deep Imbalanced Learning: A Case Study of Binary Tabular Data Classification
    Mustapha, Ismail. B.
    Hasan, Shafaatunnur
    Nabbus, Hatem S. Y.
    Montaser, Mohamed Mostafa Ali
    Olatunji, Sunday Olusanya
    Shamsuddin, Siti Maryam
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (02) : 739 - 748
  • [9] TableNN: Deep Learning Framework for Learning Domain Specific Tabular Data
    Sankhe, Pranav
    Khabiri, Elham
    Agrawal, Bhavna
    Li, Yingjie
    Proceedings - 2021 IEEE International Conference on Big Data, Big Data 2021, 2021, : 4097 - 4102
  • [10] TableNN: Deep Learning Framework for Learning Domain Specific Tabular Data
    Sankhe, Pranav
    Khabiri, Elham
    Agrawal, Bhavna
    Li, Yingjie
    2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 4097 - 4102