Combining Multi-Source Data and Machine Learning Approaches to Predict Winter Wheat Yield in the Conterminous United States

被引:102
|
作者
Wang, Yumiao [1 ,2 ]
Zhang, Zhou [1 ]
Feng, Luwei [1 ,2 ]
Du, Qingyun [2 ,3 ,4 ,5 ]
Runge, Troy [1 ]
机构
[1] Univ Wisconsin, Biol Syst Engn, Madison, WI 53706 USA
[2] Wuhan Univ, Sch Resources & Environm Sci, Wuhan 430079, Peoples R China
[3] Wuhan Univ, Key Lab GIS, Minist Educ, Wuhan 430079, Peoples R China
[4] Wuhan Univ, Key Lab Digital Mapping & Land Informat Applicat, Natl Adm Surveying Mapping & Geoinformat, Wuhan 430079, Peoples R China
[5] Wuhan Univ, Collaborat Innovat Ctr Geospatial Technol, Wuhan 430079, Peoples R China
基金
美国食品与农业研究所;
关键词
Winter wheat; yield prediction; machine learning; multi-source data; CONUS; MODIS-NDVI; VEGETATION INDEXES; NEURAL-NETWORKS; MAIZE YIELD; MODEL; SATELLITE; TEMPERATURE; PERFORMANCE; SIMULATION; RESPONSES;
D O I
10.3390/rs12081232
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Winter wheat (Triticum aestivum L.) is one of the most important cereal crops, supplying essential food for the world population. Because the United States is a major producer and exporter of wheat to the world market, accurate and timely forecasting of wheat yield in the United States (U.S.) is fundamental to national crop management as well as global food security. Previous studies mainly have focused on developing empirical models using only satellite remote sensing images, while other yield determinants have not yet been adequately explored. In addition, these models are based on traditional statistical regression algorithms, while more advanced machine learning approaches have not been explored. This study used advanced machine learning algorithms to establish within-season yield prediction models for winter wheat using multi-source data to address these issues. Specifically, yield driving factors were extracted from four different data sources, including satellite images, climate data, soil maps, and historical yield records. Subsequently, two linear regression methods, including ordinary least square (OLS) and least absolute shrinkage and selection operator (LASSO), and four well-known machine learning methods, including support vector machine (SVM), random forest (RF), Adaptive Boosting (AdaBoost), and deep neural network (DNN), were applied and compared for estimating the county-level winter wheat yield in the Conterminous United States (CONUS) within the growing season. Our models were trained on data from 2008 to 2016 and evaluated on data from 2017 and 2018, with the results demonstrating that the machine learning approaches performed better than the linear regression models, with the best performance being achieved using the AdaBoost model (R-2 = 0.86, RMSE = 0.51 t/ha, MAE = 0.39 t/ha). Additionally, the results showed that combining data from multiple sources outperformed single source satellite data, with the highest accuracy being obtained when the four data sources were all considered in the model development. Finally, the prediction accuracy was also evaluated against timeliness within the growing season, with reliable predictions (R-2 > 0.84) being able to be achieved 2.5 months before the harvest when the multi-source data were combined.
引用
收藏
页数:21
相关论文
共 50 条
  • [21] Improving the spatial and temporal estimation of ecosystem respiration using multi-source data and machine learning methods in a rainfed winter wheat cropland
    Lu, Ruhua
    Zhang, Pei
    Fu, Zhaopeng
    Jiang, Jie
    Wu, Jiancheng
    Cao, Qiang
    Tian, Yongchao
    Zhu, Yan
    Cao, Weixing
    Liu, Xiaojun
    SCIENCE OF THE TOTAL ENVIRONMENT, 2023, 871
  • [22] Estimation of Winter Wheat Yield in Arid and Semiarid Regions Based on Assimilated Multi-Source Sentinel Data and the CERES-Wheat Model
    Liu, Zhengchun
    Xu, Zhanjun
    Bi, Rutian
    Wang, Chao
    He, Peng
    Jing, Yaodong
    Yang, Wude
    SENSORS, 2021, 21 (04) : 1 - 16
  • [23] A Method for Identifying Geospatial Data Sharing Websites by Combining Multi-Source Semantic Information and Machine Learning
    Cheng, Quanying
    Zhu, Yunqiang
    Zeng, Hongyun
    Song, Jia
    Wang, Shu
    Zhang, Jinqu
    Qian, Lang
    Qi, Yanmin
    APPLIED SCIENCES-BASEL, 2021, 11 (18):
  • [24] Winter wheat yield prediction in the conterminous United States using solar-induced chlorophyll fluorescence data and XGBoost and random forest algorithm
    Joshi, Abhasha
    Pradhan, Biswajeet
    Chakraborty, Subrata
    Behera, Mukunda Dev
    ECOLOGICAL INFORMATICS, 2023, 77
  • [25] Use of remote sensing data for estimation of winter wheat yield in the United States
    Salazar, L.
    Kogan, F.
    Roytman, L.
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2007, 28 (17) : 3795 - 3811
  • [26] Oil palm yield prediction across blocks from multi-source data using machine learning and deep learning
    Yuhao Ang
    Helmi Zulhaidi Mohd Shafri
    Yang Ping Lee
    Shahrul Azman Bakar
    Haryati Abidin
    Mohd Umar Ubaydah Mohd Junaidi
    Shaiful Jahari Hashim
    Nik Norasma Che’Ya
    Mohd Roshdi Hassan
    Hwee San Lim
    Rosni Abdullah
    Yusri Yusup
    Syahidah Akmal Muhammad
    Sin Yin Teh
    Mohd Na’aim Samad
    Earth Science Informatics, 2022, 15 : 2349 - 2367
  • [27] Oil palm yield prediction across blocks from multi-source data using machine learning and deep learning
    Ang, Yuhao
    Shafri, Helmi Zulhaidi Mohd
    Lee, Yang Ping
    Bakar, Shahrul Azman
    Abidin, Haryati
    Junaidi, Mohd Umar Ubaydah Mohd
    Hashim, Shaiful Jahari
    Che'Ya, Nik Norasma
    Hassan, Mohd Roshdi
    San Lim, Hwee
    Abdullah, Rosni
    Yusup, Yusri
    Muhammad, Syahidah Akmal
    Teh, Sin Yin
    Samad, Mohd Na'aim
    EARTH SCIENCE INFORMATICS, 2022, 15 (04) : 2349 - 2367
  • [28] Mapping Himalayan leucogranites by machine learning using multi-source data
    Wang Z.
    Zuo R.
    Earth Science Frontiers, 2023, 30 (05) : 216 - 226
  • [29] Predicting Maize Yield at the Plot Scale of Different Fertilizer Systems by Multi-Source Data and Machine Learning Methods
    Meng, Linghua
    Liu, Huanjun
    Ustin, Susan L.
    Zhang, Xinle
    REMOTE SENSING, 2021, 13 (18)
  • [30] A Machine Learning Approach for Convective Initiation Detection Using Multi-source Data
    Liu, Xuan
    Chen, Haonan
    Han, Lei
    Ge, Yurong
    2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 6518 - 6521