Using machine learning for crop yield prediction in the past or the future

被引:27
|
作者
Morales, Alejandro [1 ]
Villalobos, Francisco J. [2 ,3 ]
机构
[1] Wageningen Univ & Res, Ctr Crop Syst Anal, Plant Sci Grp, Wageningen, Netherlands
[2] Consejo Super Invest Cient IAS CSIC, Inst Agr Sostenible, Cordoba, Spain
[3] Univ Cordoba, Dept Agron, ETSIAM, Cordoba, Spain
来源
关键词
machine learning; crop simulation model; wheat; sunflower; DSSAT; neural network; ARTIFICIAL NEURAL-NETWORKS; WHEAT; MODEL;
D O I
10.3389/fpls.2023.1128388
中图分类号
Q94 [植物学];
学科分类号
071001 ;
摘要
The use of ML in agronomy has been increasing exponentially since the start of the century, including data-driven predictions of crop yields from farm-level information on soil, climate and management. However, little is known about the effect of data partitioning schemes on the actual performance of the models, in special when they are built for yield forecast. In this study, we explore the effect of the choice of predictive algorithm, amount of data, and data partitioning strategies on predictive performance, using synthetic datasets from biophysical crop models. We simulated sunflower and wheat data using OilcropSun and Ceres-Wheat from DSSAT for the period 2001-2020 in 5 areas of Spain. Simulations were performed in farms differing in soil depth and management. The data set of farm simulated yields was analyzed with different algorithms (regularized linear models, random forest, artificial neural networks) as a function of seasonal weather, management, and soil. The analysis was performed with Keras for neural networks and R packages for all other algorithms. Data partitioning for training and testing was performed with ordered data (i.e., older data for training, newest data for testing) in order to compare the different algorithms in their ability to predict yields in the future by extrapolating from past data. The Random Forest algorithm had a better performance (Root Mean Square Error 35-38%) than artificial neural networks (37-141%) and regularized linear models (64-65%) and was easier to execute. However, even the best models showed a limited advantage over the predictions of a sensible baseline (average yield of the farm in the training set) which showed RMSE of 42%. Errors in seasonal weather forecasting were not taken into account, so real-world performance is expected to be even closer to the baseline. Application of AI algorithms for yield prediction should always include a comparison with the best guess to evaluate if the additional cost of data required for the model compensates for the increase in predictive power. Random partitioning of data for training and validation should be avoided in models for yield forecasting. Crop models validated for the region and cultivars of interest may be used before actual data collection to establish the potential advantage as illustrated in this study.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] CROP YIELD PREDICTION: AN OPERATIONAL APPROACH TO CROP YIELD MODELING ON FIELD AND SUBFIELD LEVEL WITH MACHINE LEARNING MODELS
    Helber, Patrick
    Bischke, Benjamin
    Habelitz, Peter
    Sanchez, Cristhian
    Pathak, Deepak
    Miranda, Miro
    Najjar, Hiba
    Mena, Francisco
    Siddamsetty, Jayanth
    Arenas, Diego
    Vollmer, Michaela
    Charfuelan, Marcela
    Nuske, Marlon
    Dengel, Andreas
    IGARSS 2023 - 2023 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2023, : 2763 - 2766
  • [42] Crop Prediction Model Using Machine Learning Algorithms
    Elbasi, Ersin
    Zaki, Chamseddine
    Topcu, Ahmet E.
    Abdelbaki, Wiem
    Zreikat, Aymen I.
    Cina, Elda
    Shdefat, Ahmed
    Saker, Louai
    APPLIED SCIENCES-BASEL, 2023, 13 (16):
  • [43] Coupling machine learning and crop modeling improves crop yield prediction in the US Corn Belt
    Mohsen Shahhosseini
    Guiping Hu
    Isaiah Huber
    Sotirios V. Archontoulis
    Scientific Reports, 11
  • [44] Coupling machine learning and crop modeling improves crop yield prediction in the US Corn Belt
    Shahhosseini, Mohsen
    Hu, Guiping
    Huber, Isaiah
    Archontoulis, Sotirios V.
    SCIENTIFIC REPORTS, 2021, 11 (01)
  • [45] Enrichment of Crop Yield Prophecy Using Machine Learning Algorithms
    Grace, R. Kingsy
    Induja, K.
    Lincy, M.
    INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2022, 31 (01): : 279 - 296
  • [46] County-scale crop yield prediction by integrating crop simulation with machine learning models
    Sajid, Saiara Samira
    Shahhosseini, Mohsen
    Huber, Isaiah
    Hu, Guiping
    Archontoulis, Sotirios, V
    FRONTIERS IN PLANT SCIENCE, 2022, 13
  • [47] Recommendations of crop yield and fertilizers using machine learning algorithm
    Senapati, Biswa Ranjan
    Sanskar
    Trishna, Aditi
    Swain, Rakesh Ranjan
    JOURNAL OF INFORMATION & OPTIMIZATION SCIENCES, 2022, 43 (05): : 1029 - 1037
  • [48] Comparative Analysis of Machine Learning Models for Crop Yield Prediction Across Multiple Crop Types
    Yashraj Patil
    Harikrishnan Ramachandran
    Sridhevi Sundararajan
    P. Srideviponmalar
    SN Computer Science, 6 (1)
  • [49] Crop Yield Management System Using Machine Learning Techniques
    Senthilnayaki, B.
    Narashiman, D.
    Mahalakshmi, G.
    Therese, Julie M.
    Devi, A.
    Dharanyadevi, P.
    2021 IEEE INTERNATIONAL CONFERENCE ON MOBILE NETWORKS AND WIRELESS COMMUNICATIONS (ICMNWC), 2021,
  • [50] A proposed framework for crop yield prediction using hybrid feature selection approach and optimized machine learning
    Abdel-salam, Mahmoud
    Kumar, Neeraj
    Mahajan, Shubham
    Neural Computing and Applications, 2024, 36 (33) : 20723 - 20750