Machine Learning for Mapping and Forecasting Poverty in North Sumatera: A Data- Driven Approach

被引:0
|
作者
Arnita [1 ]
Arpaung, Faridawaty m [1 ]
Amadhani, Fanny r [1 ]
Inata, Dewan [1 ]
机构
[1] Univ Negeri Medan, Dept Math, Jl Williem Iskandar Pasar 5, Medan, Indonesia
来源
SAINS MALAYSIANA | 2024年 / 53卷 / 07期
关键词
Cross validation; grid search; K-Means; poverty; random forest regression; RANDOM FOREST ALGORITHM;
D O I
10.17576/jsm-2024-5307-18
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Discussing poverty is crucial because it affects many facets of society, including socioeconomic disparity, crime, and the inability to obtain high-quality education. One of the provinces with the highest poverty rate in Indonesia is North Sumatra. A strategy is required to gather accurate data to effectively reduce poverty. Poverty mapping and prediction were conducted in North Sumatra to get a precise spatial distribution of poverty, the operation of the poverty model, and forecasting using machine learning (ML). ML ). Poverty prediction was conducted using a random forest (RF) RF ) algorithm and poverty mapping was conducted using the K-Means algorithm. The poverty mapping showed a significant inertia value decline in the third and fourth clusters of the elbow graph. The third cluster (0.313) was superior to the fourth cluster (0.244) in the silhouette index. Thus, there were three poverty clusters- low, medium, and high- that were used in the model. The best model was created using the grid search cross-validation, while the best prediction results were created using the RF algorithm, with the following parameters: n-estimator = 50, max depth = 10, min samples split = 2, and min samples leaf = 1. The mean squared error ( MSE ) of the RF model's predictions was 0.002617, or satisfactory precision.
引用
收藏
页码:1715 / 1728
页数:14
相关论文
共 50 条
  • [21] Data Driven Approach for Eye Disease Classification with Machine Learning
    Malik, Sadaf
    Kanwal, Nadia
    Asghar, Mamoona Naveed
    Sadiq, Mohammad Ali A.
    Karamat, Irfan
    Fleury, Martin
    APPLIED SCIENCES-BASEL, 2019, 9 (14):
  • [22] A data-driven approach to mapping multidimensional poverty at residential block level in Mexico
    Zea-Ortiz, Marivel
    Vera, Pablo
    Salas, Joaquin
    Manduchi, Roberto
    Villasenor, Elio
    Figueroa, Alejandra
    Suarez, Ranyart R.
    ENVIRONMENT DEVELOPMENT AND SUSTAINABILITY, 2024,
  • [23] A Machine Learning Approach to Volatility Forecasting*
    Christensen, Kim
    Siggaard, Mathias
    Veliyev, Bezirgen
    JOURNAL OF FINANCIAL ECONOMETRICS, 2023, 21 (05) : 1680 - 1727
  • [24] Forecasting, Data Mining and Machine Learning
    OPERATIONS RESEARCH PROCEEDINGS 2010, 2011, : 1 - 1
  • [25] Data-driven topo-climatic mapping with machine learning methods
    A. Pozdnoukhov
    L. Foresti
    M. Kanevski
    Natural Hazards, 2009, 50 : 497 - 518
  • [26] Data-driven topo-climatic mapping with machine learning methods
    Pozdnoukhov, A.
    Foresti, L.
    Kanevski, M.
    NATURAL HAZARDS, 2009, 50 (03) : 497 - 518
  • [27] Data-Driven Trend Forecasting in Stock Market Using Machine Learning Techniques
    Misra, Puneet
    Chaurasia, Siddharth
    JOURNAL OF INFORMATION TECHNOLOGY RESEARCH, 2020, 13 (01) : 130 - 149
  • [28] Binning Based Data Driven Machine Learning Models for Solar Radiation Forecasting in India
    Munshi, Anuradha
    Moharil, R. M.
    IRANIAN JOURNAL OF SCIENCE AND TECHNOLOGY-TRANSACTIONS OF ELECTRICAL ENGINEERING, 2024, 48 (03) : 1249 - 1260
  • [29] Mapping Raw Acceleration Data on ActiGraph Counts: A Machine Learning Approach
    Martin-Gonzalez, Elena
    de-Luis-Garcia, Rodrigo
    Casaseca-de-la-Higuera, J. P.
    Garmendia-Leiza, J. R.
    Andres-de-LLano, J.
    Alberola-Lopez, Carlos
    SIXTH INTERNATIONAL CONFERENCE ON TECHNOLOGICAL ECOSYSTEMS FOR ENHANCING MULTICULTURALITY (TEEM'18), 2018, : 477 - 482
  • [30] A Machine Learning Approach for NDVI Forecasting based on Sentinel-2 Data
    Cavalli, Stefano
    Penzotti, Gabriele
    Amoretti, Michele
    Caselli, Stefano
    PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE ON SOFTWARE TECHNOLOGIES (ICSOFT), 2021, : 473 - 480