Using Machine Learning and Feature Selection for Alfalfa Yield Prediction

被引:31
|
作者
Whitmire, Christopher D. D. [1 ]
Vance, Jonathan M. M. [2 ]
Rasheed, Hend K. K.
Missaoui, Ali [3 ]
Rasheed, Khaled M. M. [1 ,2 ]
Maier, Frederick W. W. [1 ]
机构
[1] Univ Georgia, Inst Artificial Intelligence, 515 Boyd Grad Studies,200 DW Brooks Dr, Athens, GA 30602 USA
[2] Univ Georgia, Dept Comp Sci, 415 Boyd Grad Studies,200 D W Brooks Dr, Athens, GA 30602 USA
[3] Univ Georgia, Inst Plant Breeding Genet & Genom, Dept Crop & Soil Sci, 4317 Miller Plant Sci, Athens, GA 30602 USA
关键词
alfalfa; cross validation; feature selection; machine learning; regression; yield prediction;
D O I
10.3390/ai2010006
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Predicting alfalfa biomass and crop yield for livestock feed is important to the daily lives of virtually everyone, and many features of data from this domain combined with corresponding weather data can be used to train machine learning models for yield prediction. In this work, we used yield data of different alfalfa varieties from multiple years in Kentucky and Georgia, and we compared the impact of different feature selection methods on machine learning (ML) models trained to predict alfalfa yield. Linear regression, regression trees, support vector machines, neural networks, Bayesian regression, and nearest neighbors were all developed with cross validation. The features used included weather data, historical yield data, and the sown date. The feature selection methods that were compared included a correlation-based method, the ReliefF method, and a wrapper method. We found that the best method was the correlation-based method, and the feature set it found consisted of the Julian day of the harvest, the number of days between the sown and harvest dates, cumulative solar radiation since the previous harvest, and cumulative rainfall since the previous harvest. Using these features, the k-nearest neighbor and random forest methods achieved an average R value over 0.95, and average mean absolute error less than 200 lbs./acre. Our top R-2 of 0.90 beats a previous work's best R-2 of 0.87. Our primary contribution is the demonstration that ML, with feature selection, shows promise in predicting crop yields even on simple datasets with a handful of features, and that reporting accuracies in R and R-2 offers an intuitive way to compare results among various crops.
引用
收藏
页码:71 / 88
页数:18
相关论文
共 50 条
  • [1] Sarcopenia feature selection and risk prediction using machine learning
    Yoo, Jun-Il
    Park, Chan-Ho
    Kim, Hyeonmok
    JOURNAL OF BONE AND MINERAL RESEARCH, 2019, 34 : 145 - 145
  • [2] Prediction of Heart Failure by using Machine Learning and Feature Selection
    Aslam, Muhammad Haseeb
    Hussain, Syed Fawad
    2022 17TH INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGIES (ICET'22), 2022, : 160 - 165
  • [3] Alfalfa yield prediction using machine learning and UAV multispectral remote sensing
    Yan H.
    Zhuo Y.
    Li M.
    Wang Y.
    Guo H.
    Wang J.
    Li C.
    Ding F.
    Nongye Gongcheng Xuebao/Transactions of the Chinese Society of Agricultural Engineering, 2022, 38 (11): : 64 - 71
  • [4] A proposed framework for crop yield prediction using hybrid feature selection approach and optimized machine learning
    Abdel-salam, Mahmoud
    Kumar, Neeraj
    Mahajan, Shubham
    Neural Computing and Applications, 2024, 36 (33) : 20723 - 20750
  • [5] Ensemble Feature Selection Framework for Paddy Yield Prediction in Cauvery Basin using Machine Learning Classifiers
    Sathya, P.
    Gnanasekaran, P.
    COGENT ENGINEERING, 2023, 10 (02):
  • [6] Early Prediction of Diabetes Using Feature Selection and Machine Learning Algorithms
    Abdollahi J.
    Aref S.
    SN Computer Science, 5 (2)
  • [7] Solar Flare Prediction Using Advanced Feature Extraction, Machine Learning, and Feature Selection
    Omar W. Ahmed
    Rami Qahwaji
    Tufan Colak
    Paul A. Higgins
    Peter T. Gallagher
    D. Shaun Bloomfield
    Solar Physics, 2013, 283 : 157 - 175
  • [8] A Survey of Feature Selection for Vulnerability Prediction Using Feature-based Machine Learning
    Li, ZhanJun
    Shao, Yan
    ICMLC 2019: 2019 11TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND COMPUTING, 2019, : 30 - 36
  • [9] Solar Flare Prediction Using Advanced Feature Extraction, Machine Learning, and Feature Selection
    Ahmed, Omar W.
    Qahwaji, Rami
    Colak, Tufan
    Higgins, Paul A.
    Gallagher, Peter T.
    Bloomfield, D. Shaun
    SOLAR PHYSICS, 2013, 283 (01) : 157 - 175
  • [10] Machine Learning- and Feature Selection-Enabled Framework for Accurate Crop Yield Prediction
    Gupta, Sandeep
    Geetha, Angelina
    Sankaran, K. Sakthidasan
    Zamani, Abu Sarwar
    Ritonga, Mahyudin
    Raj, Roop
    Ray, Samrat
    Mohammed, Hussien Sobahi
    JOURNAL OF FOOD QUALITY, 2022, 2022