Geospatial Entity Resolution

被引:4
|
作者
Balsebre, Pasquale [1 ]
Yao, Dezhong [2 ]
Cong, Gao [1 ]
Hai, Zhen [3 ]
机构
[1] Nanyang Technol Univ, Singapore, Singapore
[2] Huazhong Univ Sci & Technol, Wuhan, Peoples R China
[3] Alibaba Grp, DAMO Acad, Singapore, Singapore
基金
中国国家自然科学基金;
关键词
Entity resolution; neural networks; geospatial data; neighbourhood embedding; graph attention;
D O I
10.1145/3485447.3512026
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
A geospatial database is today at the core of an ever increasing number of services. Building and maintaining it remains challenging due to the need to merge information from multiple providers. Entity Resolution (ER) consists of finding entity mentions from different sources that refer to the same real world entity. In geospatial ER, entities are often represented using different schemes and are subject to incomplete information and inaccurate location, making ER and deduplication daunting tasks. While tremendous advances have been made in traditional entity resolution and natural language processing, geospatial data integration approaches still heavily rely on static similarity measures and human-designed rules. In order to achieve automatic linking of geospatial data, a unified representation of entities with heterogeneous attributes and their geographical context, is needed. To this end, we propose Geo-ER1, a joint framework that combines Transformer-based language models, that have been successfully applied in ER, with a novel learning-based architecture to represent the geospatial character of the entity. Different from existing solutions, Geo-ER does not rely on pre-defined rules and is able to capture information from surrounding entities in order to make context-based, accurate predictions. Extensive experiments on eight real world datasets demonstrate the effectiveness of our solution over state-of-the-art methods. Moreover, Geo-ER proves to be robust in settings where there is no available training data for a specific city.
引用
收藏
页码:3061 / 3070
页数:10
相关论文
共 50 条
  • [31] Entity resolution with Markov logic
    Singla, Parag
    Domingos, Pedro
    ICDM 2006: SIXTH INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2006, : 572 - +
  • [32] Entity resolution with weighted constraints
    1600, Springer Verlag (8716):
  • [33] Entity Resolution with Weighted Constraints
    Shen, Zeyu
    Wang, Qing
    ADVANCES IN DATABASES AND INFORMATION SYSTEMS (ADBIS 2014), 2014, 8716 : 308 - 322
  • [34] Tutorial: Uncertain Entity Resolution Re-evaluating Entity Resolution in the Big Data Era
    Gal, Avigdor
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2014, 7 (13): : 1711 - 1712
  • [35] Commercial applications for high resolution geospatial imagery
    Khuen, CA
    PHOTOGRAMMETRIC ENGINEERING AND REMOTE SENSING, 1997, 63 (08): : 933 - &
  • [36] High Resolution Earth Imaging for Geospatial Information
    Heipke, Christian
    Jacobsen, Karsten
    Rottensteiner, Franz
    Mueller, Soenke
    Soergel, Uwe
    PHOTOGRAMMETRIE FERNERKUNDUNG GEOINFORMATION, 2012, (04): : 315 - 316
  • [37] Deep Sequence-to-Sequence Entity Matching for Heterogeneous Entity Resolution
    Nie, Hao
    Han, Xianpei
    He, Ben
    Sun, Le
    Chen, Bo
    Zhang, Wei
    Wu, Suhui
    Kong, Hao
    PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM '19), 2019, : 629 - 638
  • [38] Query-time entity resolution
    Bhattacharya, Indrajit
    Getoor, Lise
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2007, 30 (621-657): : 621 - 657
  • [39] Adaptive Graphical Approach to Entity Resolution
    Chen, Zhaoqi
    Kalashnikov, Dmitri V.
    Mehrotra, Sharad
    PROCEEDINGS OF THE 7TH ACM/IEE JOINT CONFERENCE ON DIGITAL LIBRARIES: BUILDING & SUSTAINING THE DIGITAL ENVIRONMENT, 2007, : 204 - 213
  • [40] A Method for Implementing Probabilistic Entity Resolution
    Alsarkhi, Awaad
    Talburt, John R.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2018, 9 (11) : 7 - 15