Toponym matching through deep neural networks

被引:48
|
作者
Santos, Rui [1 ]
Murrieta-Flores, Patricia [2 ]
Calado, Pavel [1 ]
Martins, Bruno [1 ]
机构
[1] Univ Lisbon, Inst Super Tecn, INESC ID, Lisbon, Portugal
[2] Univ Chester, Digital Humanities Res Ctr, Chester, Cheshire, England
关键词
Toponym matching; duplicate detection; approximate string matching; deep neural networks; recurrent neural networks; geographic information retrieval;
D O I
10.1080/13658816.2017.1390119
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Toponym matching, i.e. pairing strings that represent the same real-world location, is a fundamental problemfor several practical applications. The current state-of-the-art relies on string similarity metrics, either specifically developed for matching place names or integrated within methods that combine multiple metrics. However, these methods all rely on common sub-strings in order to establish similarity, and they do not effectively capture the character replacements involved in toponym changes due to transliterations or to changes in language and culture over time. In this article, we present a novel matching approach, leveraging a deep neural network to classify pairs of toponyms as either matching or nonmatching. The proposed network architecture uses recurrent nodes to build representations from the sequences of bytes that correspond to the strings that are to be matched. These representations are then combined and passed to feed-forward nodes, finally leading to a classification decision. We present the results of a wide-ranging evaluation on the performance of the proposed method, using a large dataset collected from the GeoNames gazetteer. These results show that the proposed method can significantly outperform individual similarity metrics from previous studies, as well as previous methods based on supervised machine learning for combining multiple metrics.
引用
收藏
页码:324 / 348
页数:25
相关论文
共 50 条
  • [1] Stereo Matching through Squeeze Deep Neural Networks
    Caffaratti, Gabriel D.
    Marehetta, Martin G.
    Forradellas, Raymundo Q.
    INTELIGENCIA ARTIFICIAL-IBEROAMERICAN JOURNAL OF ARTIFICIAL INTELLIGENCE, 2019, 22 (63): : 16 - 38
  • [2] A deep neural network model for Chinese toponym matching with geographic pre-training model
    Qiu, Qinjun
    Zheng, Shiyu
    Tian, Miao
    Li, Jiali
    Ma, Kai
    Tao, Liufeng
    Xie, Zhong
    INTERNATIONAL JOURNAL OF DIGITAL EARTH, 2024, 17 (01)
  • [3] Using Recurrent Neural Networks for Toponym Resolution in Text
    Cardoso, Ana Barbara
    Martins, Bruno
    Estima, Jacinto
    PROGRESS IN ARTIFICIAL INTELLIGENCE, PT II, 2019, 11805 : 769 - 780
  • [4] Positioning with Map Matching using Deep Neural Networks
    Bergkvist, Hannes
    Davidsson, Paul
    Exner, Peter
    PROCEEDINGS OF THE 17TH EAI INTERNATIONAL CONFERENCE ON MOBILE AND UBIQUITOUS SYSTEMS: COMPUTING, NETWORKING AND SERVICES (MOBIQUITOUS 2020), 2021, : 177 - 183
  • [5] Deep Belief Networks Based Toponym Recognition for Chinese Text
    Wang, Shu
    Zhang, Xueying
    Ye, Peng
    Du, Mi
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2018, 7 (06)
  • [6] Deep Neural Networks for Matching Online Social Networking Profiles
    Ciorbaru, Vicentiu-Marian
    Rebedea, Traian
    COMPUTATIONAL COLLECTIVE INTELLIGENCE, ICCCI 2017, PT I, 2017, 10448 : 192 - 201
  • [7] Deep Image Matching Based on Siamese Convolutional Neural Networks
    Dou, J.
    Tu, Z.
    PATTERN RECOGNITION AND IMAGE ANALYSIS, 2025, 35 (01) : 1 - 15
  • [8] History matching of petroleum reservoirs using deep neural networks
    Alguliyev, Rasim
    Aliguliyev, Ramiz
    Imamverdiyev, Yadigar
    Sukhostat, Lyudmila
    INTELLIGENT SYSTEMS WITH APPLICATIONS, 2022, 16
  • [9] Uncertainty Propagation through Deep Neural Networks
    Abdelaziz, Ahmed Hussen
    Watanabe, Shinji
    Hershey, John R.
    Vincent, Emanuel
    Kolossa, Dorothea
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 3561 - 3565
  • [10] PROGRAM MATCHING THROUGH CODE ANALYSIS AND ARTIFICIAL NEURAL NETWORKS
    Nascimento, Tiago M.
    Boccardo, Davidson R.
    Prado, Charles B.
    Machado, Raphael C. S.
    Carmo, Luiz F. R. C.
    INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2012, 22 (02) : 225 - 241