Understanding Residential Address Patterns in Urban and Rural Areas: A Machine Learning Approach

被引:0
|
作者
Cruz, Paula [1 ,2 ]
Vanneschi, Leonardo [1 ]
Painho, Marco [1 ]
机构
[1] Univ Nova Lisboa, Nova Informat Management Sch NOVA IMS, Lisbon, Portugal
[2] Stat Portugal, Methodol & Informat Syst Dept, Lisbon, Portugal
关键词
address validation; census; data quality; machine learning; multiclass classification; statistical operations; CLASSIFICATION; ALGORITHMS; VALIDATION;
D O I
10.1111/tgis.70003
中图分类号
P9 [自然地理学]; K9 [地理];
学科分类号
0705 ; 070501 ;
摘要
Address data quality has a direct impact on demographic and other spatial analyses, since it may lead to uncertainty and potential bias. Most of the existing studies measure address quality through matching with reference databases, which can be an expensive and time-consuming process. To bridge this gap, we propose a multiclass classification algorithm to evaluate the syntactic quality of residential addresses from a large database without using external databases. Namely, we adopt a multi-objective optimization approach, based on the NSGA-II algorithm and two modified k-NN algorithms. The objective is to find the address components as well as the optimal number of neighboring examples that help explain which class (good, incorrect or incomplete and anomalous) the quality of an address belongs to, by type of region (urban, medium urban, and rural). The presented results indicate that the proposed approach outperforms the best baseline algorithms on multiclass classification, while also providing descriptive information on the most relevant features and median local neighborhood of each instance. With this study, we further extend previous research in the field of address pattern extraction, by explicitly differentiating urban and rural areas as well as invalid and anomalous addresses.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] Simulation of Land Surface Temperature Patterns Over Future Urban Areas—A Machine Learning Approach
    Sandeep Maithani
    Garima Nautiyal
    Archana Sharma
    Surendra Kumar Sharma
    Journal of the Indian Society of Remote Sensing, 2022, 50 : 2145 - 2162
  • [2] Abandoned rural residential land: Using machine learning techniques to identify rural residential land vulnerable to be abandoned in mountainous areas
    Xu, Feng
    Ho, Hung Chak
    Chi, Guangqing
    Wang, Zhanqi
    HABITAT INTERNATIONAL, 2019, 84 : 43 - 56
  • [3] Path Loss Prediction in Urban Areas: A Machine Learning Approach
    Rafie, Irfan Farhan Mohamad
    Lim, Soo Yong
    Chung, Michael Jenn Hwan
    IEEE ANTENNAS AND WIRELESS PROPAGATION LETTERS, 2023, 22 (04): : 809 - 813
  • [4] Residential art centres and their understanding of freedom in rural areas☆
    Houserova, Stela
    Pospech, Pavel
    JOURNAL OF RURAL STUDIES, 2025, 114
  • [5] Simulation of Land Surface Temperature Patterns Over Future Urban Areas-A Machine Learning Approach
    Maithani, Sandeep
    Nautiyal, Garima
    Sharma, Archana
    Sharma, Surendra Kumar
    JOURNAL OF THE INDIAN SOCIETY OF REMOTE SENSING, 2022, 50 (11) : 2145 - 2162
  • [6] Altruistic Behavior in Rural and Urban, Residential and Business Areas
    Goldman, Morton
    Lewandowski, Helen E.
    Carrill, Richard E.
    BASIC AND APPLIED SOCIAL PSYCHOLOGY, 1982, 3 (02) : 155 - 160
  • [7] SIMULATING RESIDENTIAL ENERGY DEMAND IN URBAN AND RURAL AREAS
    Thorve, Swapna
    Swarup, Samarth
    Marathe, Achla
    Chungbaek, Youngyun
    Nordberg, Eric K.
    Marathe, Madhav V.
    2018 WINTER SIMULATION CONFERENCE (WSC), 2018, : 548 - 559
  • [8] PATTERNS OF OFFENDING IN URBAN AND RURAL-AREAS
    LAUB, JH
    JOURNAL OF CRIMINAL JUSTICE, 1983, 11 (02) : 129 - 142
  • [9] Understanding the performance gap: a machine learning approach on residential buildings in Turin, Italy
    Boghetti, Roberto
    Fantozzi, Fabio
    Kampf, Jerome H.
    Salvadori, Giacomo
    CLIMATE RESILIENT CITIES - ENERGY EFFICIENCY & RENEWABLES IN THE DIGITAL ERA (CISBAT 2019), 2019, 1343
  • [10] Understanding Road Usage Patterns in Urban Areas
    Wang, Pu
    Hunter, Timothy
    Bayen, Alexandre M.
    Schechtner, Katja
    Gonzalez, Marta C.
    SCIENTIFIC REPORTS, 2012, 2