Data-driven techniques for temperature data prediction: big data analytics approach

被引:0
|
作者
Adamson Oloyede
Simeon Ozuomba
Philip Asuquo
Lanre Olatomiwa
Omowunmi Mary Longe
机构
[1] National Space Research and Development Agency,Advanced Space Technology Applications Laboratory Uyo
[2] University of Uyo,Department of Computer Engineering
[3] University of Johannesburg,Department of Electrical and Electronic Engineering Science
[4] Federal University of Technology Minna,Department of Electrical and Electronics Engineering
来源
关键词
Weather prediction; Artificial neural network; Correlation analysis; Descriptive and diagnostic analyses; Predictive and prescriptive analytics;
D O I
暂无
中图分类号
学科分类号
摘要
For extrapolation, climate change and other meteorological analysis, a study of past and current weather events is a prerequisite. NASA (National Aeronautics and Space Administration) has been able to develop a model capable of predicting various weather data for any location on the Earth, including locations lacking weather stations, weather satellite coverage, and other weather measuring instruments. This paper evaluates the prediction accuracy of the NASA temperature data with respect to NiMet (Nigerian Meteorological Agency) ground truth measurement, using Akwa Ibom Airport as a case study. Exploratory data analysis (descriptive and diagnostic analyses) of temperature retrieved from NiMet and NASA was performed to give a clear path to follow for predictive and prescriptive analyses. Using 2783 days of weather data retrieved from NiMet as ground truth, the accuracy of NASA predictions with the corresponding resolution was calculated. Mean absolute error (MAE) of 2.184 °C and root mean square error (RMSE) of 2.579 °C, with a coefficient of determination (R2) of 0.710 for maximum temperature, then MAE of 0.876 °C, RMSE of 1.225 °C with a coefficient of determination (R2) of 0.620 for minimum temperature was discovered. There is a good correlation between the two datasets; hence, a model can be developed to generate more accurate predictions, using the NASA data as input. Predictive and prescriptive analyses were performed by employing five prediction algorithms: decision tree regression, XGBoost regression and MLP (multilayer perceptron) with LBFGS (limited-memory Broyden-Fletcher-Goldfarb-Shanno) optimizer, MLP with SGD (stochastic gradient) optimizer and MLP with Adam optimizer. The MLP LBFGS algorithm performed best, by significantly reducing the MAE by 35.35% and RMSE by 31.06% for maximum temperature, accordingly, MAE by 10.05% and RMSE by 8.00% for minimum temperature. Results obtained show that given sufficient data, plugging NASA predictions as input to an LBFGS-MLP model gives more accurate temperature predictions for the study area.
引用
收藏
相关论文
共 50 条
  • [21] A Data-Driven Approach for Event Prediction
    Yuen, Jenny
    Torralba, Antonio
    COMPUTER VISION-ECCV 2010, PT II, 2010, 6312 : 707 - 720
  • [22] Data-Driven Sustainability: Revolutionizing Hospital Supply Chains through Big Data Analytics
    Twinkle Singh
    Jeanne Poulose
    Vinod Sharma
    Operations Research Forum, 6 (1)
  • [23] Big Data Analytics in Healthcare: Data-Driven Methods for Typical Treatment Pattern Mining
    Guo, Chonghui
    Chen, Jingfeng
    JOURNAL OF SYSTEMS SCIENCE AND SYSTEMS ENGINEERING, 2019, 28 (06) : 694 - 714
  • [24] The Prediction of Flight Delay: Big Data-driven Machine Learning Approach
    Huo, Jiage
    Keung, K. L.
    Lee, C. K. M.
    Ng, Kam K. H.
    Li, K. C.
    2020 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING AND ENGINEERING MANAGEMENT (IEEE IEEM), 2020, : 190 - 194
  • [25] A model-driven approach to automate data visualization in big data analytics
    Golfarelli, Matteo
    Rizzi, Stefano
    INFORMATION VISUALIZATION, 2020, 19 (01) : 24 - 47
  • [26] Big Data and Data-Driven Marketing in Brazil
    Finger, Vitor
    Reichelt, Valesca
    Capelli, Joao
    2ND INTERNATIONAL CONFERENCE ON ADVANCED RESEARCH METHODS AND ANALYTICS (CARMA 2018), 2018, : 71 - 78
  • [27] A Data-Driven Approach to Kinematic Analytics of Spinal Motion
    Gencdogmus, Aysenur
    Keskin, Seref Recep
    Dogan, Gulustan
    Ozturk, Yusuf
    2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 2222 - 2229
  • [28] Framework for Data Analytics in Data-Driven Product Planning
    Massmann, Melina
    Meyer, Maurice
    Frank, Maximilian
    von Enzberg, Sebastian
    Kuehn, Arno
    Dumitrescu, Roman
    PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON SYSTEM-INTEGRATED INTELLIGENCE (SYSINT 2020): SYSTEM-INTEGRATED INTELLIGENCE - INTELLIGENT, FLEXIBLE AND CONNECTED SYSTEMS IN PRODUCTS AND PRODUCTION, 2020, 52 : 350 - 355
  • [29] Big data analytics for data-driven industry: a review of data sources, tools, challenges, solutions, and research directions
    Ikegwu, Anayo Chukwu
    Nweke, Henry Friday
    Anikwe, Chioma Virginia
    Alo, Uzoma Rita
    Okonkwo, Obikwelu Raphael
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2022, 25 (05): : 3343 - 3387
  • [30] Big data analytics for data-driven industry: a review of data sources, tools, challenges, solutions, and research directions
    Anayo Chukwu Ikegwu
    Henry Friday Nweke
    Chioma Virginia Anikwe
    Uzoma Rita Alo
    Obikwelu Raphael Okonkwo
    Cluster Computing, 2022, 25 : 3343 - 3387