Understanding Residential Address Patterns in Urban and Rural Areas: A Machine Learning Approach

被引:0
|
作者
Cruz, Paula [1 ,2 ]
Vanneschi, Leonardo [1 ]
Painho, Marco [1 ]
机构
[1] Univ Nova Lisboa, Nova Informat Management Sch NOVA IMS, Lisbon, Portugal
[2] Stat Portugal, Methodol & Informat Syst Dept, Lisbon, Portugal
关键词
address validation; census; data quality; machine learning; multiclass classification; statistical operations; CLASSIFICATION; ALGORITHMS; VALIDATION;
D O I
10.1111/tgis.70003
中图分类号
P9 [自然地理学]; K9 [地理];
学科分类号
0705 ; 070501 ;
摘要
Address data quality has a direct impact on demographic and other spatial analyses, since it may lead to uncertainty and potential bias. Most of the existing studies measure address quality through matching with reference databases, which can be an expensive and time-consuming process. To bridge this gap, we propose a multiclass classification algorithm to evaluate the syntactic quality of residential addresses from a large database without using external databases. Namely, we adopt a multi-objective optimization approach, based on the NSGA-II algorithm and two modified k-NN algorithms. The objective is to find the address components as well as the optimal number of neighboring examples that help explain which class (good, incorrect or incomplete and anomalous) the quality of an address belongs to, by type of region (urban, medium urban, and rural). The presented results indicate that the proposed approach outperforms the best baseline algorithms on multiclass classification, while also providing descriptive information on the most relevant features and median local neighborhood of each instance. With this study, we further extend previous research in the field of address pattern extraction, by explicitly differentiating urban and rural areas as well as invalid and anomalous addresses.
引用
收藏
页数:17
相关论文
共 50 条
  • [31] An urban informatics approach to understanding residential mobility in Metro Chicago
    Pan, Haozhi
    Chen, Si
    Gao, Yizhao
    Deal, Brian
    Liu, Jinfang
    ENVIRONMENT AND PLANNING B-URBAN ANALYTICS AND CITY SCIENCE, 2020, 47 (08) : 1456 - 1473
  • [32] Understanding different migrant selection patterns in rural and urban Mexico
    Fernandez-Huertas Moraga, Jesus
    JOURNAL OF DEVELOPMENT ECONOMICS, 2013, 103 : 182 - 201
  • [33] PrEP retention predictors among key populations in urban areas of Zimbabwe - a machine learning approach
    Bero, Bridgette
    Zidana, Chipo
    Showa, Sarudzai Portia
    AJAR-AFRICAN JOURNAL OF AIDS RESEARCH, 2023, 22 (03): : 157 - 164
  • [34] Forecasting Energy Utilization in Residential Areas through Machine Learning Techniques
    Vidhate, Kalpana D.
    Nema, Dr. Pragya
    JOURNAL OF ELECTRICAL SYSTEMS, 2024, 20 (01) : 743 - 748
  • [35] A Machine Learning Based Approach to Detect Machine Learning Design Patterns
    Pan, Weitao
    Washizaki, Hironori
    Yoshioka, Nobukazu
    Fukazawa, Yoshiaki
    Khomh, Foutse
    Gueheneuc, Yann-Gael
    PROCEEDINGS OF THE 2023 30TH ASIA-PACIFIC SOFTWARE ENGINEERING CONFERENCE, APSEC 2023, 2023, : 574 - 578
  • [36] An agent-based model of residential patterns and social structure in urban areas
    Lemoy, Remi
    Raux, Charles
    Jensen, Pablo
    CYBERGEO-EUROPEAN JOURNAL OF GEOGRAPHY, 2010,
  • [37] Classifying urban residential areas based on their exposure to crime: A constructivist approach
    Marques, Sara C. R.
    Ferreira, Fernando A. F.
    Meidute-Kavaliauskiene, Ieva
    Banaitis, Audrius
    SUSTAINABLE CITIES AND SOCIETY, 2018, 39 : 418 - 429
  • [38] Machine learning approach to uncovering residential energy consumption patterns based on socioeconomic and smart meter data
    Tang, Wenjun
    Wang, Hao
    Lee, Xian-Long
    Yang, Hong-Tzer
    ENERGY, 2022, 240
  • [39] Exploring attitudes and behavioral patterns in residential energy consumption: Data-driven by a machine learning approach
    Fakhr, Bahereh Vojdani
    Yeganeh, Mansour
    Walzberg, Julien
    Ghayehbashi, Ahad Rezayan
    ENERGY AND CLIMATE CHANGE, 2024, 5
  • [40] RESIDENTIAL APPLIANCE USAGE PATTERNS FROM OVERALL ENERGY CONSUMPTION DATA: A STATISTICAL MACHINE LEARNING APPROACH
    Bandyopadhyay, Arkasama
    Bhattacharya, Anirban
    PROCEEDINGS OF ASME 2021 INTERNATIONAL MECHANICAL ENGINEERING CONGRESS AND EXPOSITION (IMECE2021), VOL 8A, 2021,