Geospatial Entity Resolution

被引:4
|
作者
Balsebre, Pasquale [1 ]
Yao, Dezhong [2 ]
Cong, Gao [1 ]
Hai, Zhen [3 ]
机构
[1] Nanyang Technol Univ, Singapore, Singapore
[2] Huazhong Univ Sci & Technol, Wuhan, Peoples R China
[3] Alibaba Grp, DAMO Acad, Singapore, Singapore
基金
中国国家自然科学基金;
关键词
Entity resolution; neural networks; geospatial data; neighbourhood embedding; graph attention;
D O I
10.1145/3485447.3512026
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
A geospatial database is today at the core of an ever increasing number of services. Building and maintaining it remains challenging due to the need to merge information from multiple providers. Entity Resolution (ER) consists of finding entity mentions from different sources that refer to the same real world entity. In geospatial ER, entities are often represented using different schemes and are subject to incomplete information and inaccurate location, making ER and deduplication daunting tasks. While tremendous advances have been made in traditional entity resolution and natural language processing, geospatial data integration approaches still heavily rely on static similarity measures and human-designed rules. In order to achieve automatic linking of geospatial data, a unified representation of entities with heterogeneous attributes and their geographical context, is needed. To this end, we propose Geo-ER1, a joint framework that combines Transformer-based language models, that have been successfully applied in ER, with a novel learning-based architecture to represent the geospatial character of the entity. Different from existing solutions, Geo-ER does not rely on pre-defined rules and is able to capture information from surrounding entities in order to make context-based, accurate predictions. Extensive experiments on eight real world datasets demonstrate the effectiveness of our solution over state-of-the-art methods. Moreover, Geo-ER proves to be robust in settings where there is no available training data for a specific city.
引用
收藏
页码:3061 / 3070
页数:10
相关论文
共 50 条
  • [1] GeoDDupe: A novel interface for interactive entity resolution in geospatial data
    Kang, Hyunmo
    Sehgal, Vivek
    Getoor, Lise
    11TH INTERNATIONAL CONFERENCE INFORMATION VISUALIZATION, 2007, : 489 - +
  • [2] Provenance for Entity Resolution
    Oppold, Sarah
    Herschel, Melanie
    PROVENANCE AND ANNOTATION OF DATA AND PROCESSES, IPAW 2018, 2018, 11017 : 226 - 230
  • [3] Joint Entity Resolution
    Whang, Steven Euijong
    Garcia-Molina, Hector
    2012 IEEE 28TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2012, : 294 - 305
  • [4] Skyblocking for entity resolution
    Shao, Jingyu
    Wang, Qing
    Lin, Yu
    INFORMATION SYSTEMS, 2019, 85 : 30 - 43
  • [5] A Comparison of Approaches for Geospatial Entity Extraction from Wikipedia
    Woodward, Daryl
    Witmer, Jeremy
    Kalita, Jugal
    2010 IEEE FOURTH INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC 2010), 2010, : 402 - 407
  • [6] SPECIAL ISSUE ON ENTITY RESOLUTION Overview: The Criticality of Entity Resolution in Data and Information Quality
    Talburt, John R.
    ACM JOURNAL OF DATA AND INFORMATION QUALITY, 2013, 4 (02):
  • [7] A Bayesian Idealization of Entity Resolution
    Ferry, James P.
    Lo, Darren
    Seaquist, Thomas
    2015 18TH INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION), 2015, : 150 - 157
  • [8] Convergence Diagnostics for Entity Resolution
    Aleshin-Guendel, Serge
    Steorts, Rebecca C.
    ANNUAL REVIEW OF STATISTICS AND ITS APPLICATION, 2024, 11 : 419 - 435
  • [9] Entity Resolution for Big Data
    Getoor, Lise
    Machanavajjhala, Ashwin
    19TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING (KDD'13), 2013, : 1525 - 1525
  • [10] Coreference Resolution with Entity Equalization
    Kantor, Ben
    Globerson, Amir
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 673 - 677