Machine Learning for Mapping and Forecasting Poverty in North Sumatera: A Data- Driven Approach

被引:0
|
作者
Arnita [1 ]
Arpaung, Faridawaty m [1 ]
Amadhani, Fanny r [1 ]
Inata, Dewan [1 ]
机构
[1] Univ Negeri Medan, Dept Math, Jl Williem Iskandar Pasar 5, Medan, Indonesia
来源
SAINS MALAYSIANA | 2024年 / 53卷 / 07期
关键词
Cross validation; grid search; K-Means; poverty; random forest regression; RANDOM FOREST ALGORITHM;
D O I
10.17576/jsm-2024-5307-18
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Discussing poverty is crucial because it affects many facets of society, including socioeconomic disparity, crime, and the inability to obtain high-quality education. One of the provinces with the highest poverty rate in Indonesia is North Sumatra. A strategy is required to gather accurate data to effectively reduce poverty. Poverty mapping and prediction were conducted in North Sumatra to get a precise spatial distribution of poverty, the operation of the poverty model, and forecasting using machine learning (ML). ML ). Poverty prediction was conducted using a random forest (RF) RF ) algorithm and poverty mapping was conducted using the K-Means algorithm. The poverty mapping showed a significant inertia value decline in the third and fourth clusters of the elbow graph. The third cluster (0.313) was superior to the fourth cluster (0.244) in the silhouette index. Thus, there were three poverty clusters- low, medium, and high- that were used in the model. The best model was created using the grid search cross-validation, while the best prediction results were created using the RF algorithm, with the following parameters: n-estimator = 50, max depth = 10, min samples split = 2, and min samples leaf = 1. The mean squared error ( MSE ) of the RF model's predictions was 0.002617, or satisfactory precision.
引用
收藏
页码:1715 / 1728
页数:14
相关论文
共 50 条
  • [31] Forecasting Electricity Consumption Data from Paraguay Using a Machine Learning Approach
    Gallardo, Jose A.
    Garcia-Torres, Miguel
    Gomez-Vela, Francisco
    Morales, Felix
    Divina, Federico
    Becerra-Alonso, David
    Velazquez, Gustavo
    Daumas-Ladouce, Federico
    Vazquez Noguera, Jose Luis
    Sauer Ayala, Carlos
    Pinto-Roa, Diego P.
    Gardel-Sotomayor, Pedro E.
    Mello Roman, Julio C.
    16TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING MODELS IN INDUSTRIAL AND ENVIRONMENTAL APPLICATIONS (SOCO 2021), 2022, 1401 : 685 - 694
  • [32] Machine Learning based Psychology: Advocating for A Data-Driven Approach
    Velez, Jorge I.
    INTERNATIONAL JOURNAL OF PSYCHOLOGICAL RESEARCH, 2021, 14 (01): : 6 - 11
  • [33] Clustering suicides: A data-driven, exploratory machine learning approach
    Ludwig, Birgit
    Koenig, Daniel
    Kapusta, Nestor D.
    Blueml, Victor
    Dorffner, Georg
    Vyssoki, Benjamin
    EUROPEAN PSYCHIATRY, 2019, 62 : 15 - 19
  • [34] Prediction of casing damage: A data-driven, machine learning approach
    Zhao Y.
    Jiang H.
    Li H.
    International Journal of Circuits, Systems and Signal Processing, 2020, 14 : 1047 - 1053
  • [35] Data Driven Credit Risk Management Process: A Machine Learning Approach
    Chen, Mingrui
    Dautais, Yann
    Huang, LiGuo
    Ge, Jidong
    ICSSP'17: PROCEEDINGS OF THE 2017 INTERNATIONAL CONFERENCE ON SOFTWARE AND SYSTEM PROCESS, 2017, : 109 - 113
  • [36] Data Driven Prognostics of Milling Tool Wear :A Machine Learning Approach
    Vijay, S.
    Pillai, Madhusudanan, V
    Kuraichen, Basil
    2021 INTERNATIONAL CONFERENCE ON COMPUTATIONAL PERFORMANCE EVALUATION (COMPE-2021), 2021, : 2 - 7
  • [37] Machine learning approach to handle data-driven model for simulation and forecasting of the cone crusher output in the stone crushing plant
    Abuhasel, Khaled Ali
    COMPUTATIONAL INTELLIGENCE, 2021, 37 (03) : 1098 - 1110
  • [38] Combining Data- and Knowledge-Driven AI with Didactics for Individualized Learning Recommendations
    Landes, Dieter
    Sedelmaier, Yvonne
    Boeck, Felix
    Lehmann, Alexander
    Fraas, Melanie
    Janusch, Sebastian
    2024 IEEE GLOBAL ENGINEERING EDUCATION CONFERENCE, EDUCON 2024, 2024,
  • [39] Understanding the performance of machine learning models from data- to patient-level
    Valeriano, Maria gabriela
    Matran-fernandez, Ana
    Kiffer, Carlos
    Lorena, Ana Carolina
    ACM JOURNAL OF DATA AND INFORMATION QUALITY, 2024, 16 (04):
  • [40] A machine learning approach to geochemical mapping
    Kirkwood, Charlie
    Cave, Mark
    Beamish, David
    Grebby, Stephen
    Ferreira, Antonio
    JOURNAL OF GEOCHEMICAL EXPLORATION, 2016, 167 : 49 - 61