Toponym matching through deep neural networks

被引:48
|
作者
Santos, Rui [1 ]
Murrieta-Flores, Patricia [2 ]
Calado, Pavel [1 ]
Martins, Bruno [1 ]
机构
[1] Univ Lisbon, Inst Super Tecn, INESC ID, Lisbon, Portugal
[2] Univ Chester, Digital Humanities Res Ctr, Chester, Cheshire, England
关键词
Toponym matching; duplicate detection; approximate string matching; deep neural networks; recurrent neural networks; geographic information retrieval;
D O I
10.1080/13658816.2017.1390119
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Toponym matching, i.e. pairing strings that represent the same real-world location, is a fundamental problemfor several practical applications. The current state-of-the-art relies on string similarity metrics, either specifically developed for matching place names or integrated within methods that combine multiple metrics. However, these methods all rely on common sub-strings in order to establish similarity, and they do not effectively capture the character replacements involved in toponym changes due to transliterations or to changes in language and culture over time. In this article, we present a novel matching approach, leveraging a deep neural network to classify pairs of toponyms as either matching or nonmatching. The proposed network architecture uses recurrent nodes to build representations from the sequences of bytes that correspond to the strings that are to be matched. These representations are then combined and passed to feed-forward nodes, finally leading to a classification decision. We present the results of a wide-ranging evaluation on the performance of the proposed method, using a large dataset collected from the GeoNames gazetteer. These results show that the proposed method can significantly outperform individual similarity metrics from previous studies, as well as previous methods based on supervised machine learning for combining multiple metrics.
引用
收藏
页码:324 / 348
页数:25
相关论文
共 50 条
  • [21] Complexity matching in neural networks
    Mafahim, Javad Usefie
    Lambert, David
    Zare, Marzieh
    Grigolini, Paolo
    NEW JOURNAL OF PHYSICS, 2015, 17
  • [22] An accurate toponym-matching measure based on approximate string matching
    Kilinc, Deniz
    JOURNAL OF INFORMATION SCIENCE, 2016, 42 (02) : 138 - 149
  • [23] Towards Explaining Deep Neural Networks Through Graph Analysis
    Horta, Vitor A. C.
    Mileo, Alessandra
    DATABASE AND EXPERT SYSTEMS APPLICATIONS (DEXA 2019), 2019, 1062 : 155 - 165
  • [24] Estimating Confidence for Deep Neural Networks through Density modeling
    Subramanya, Akshayvarun
    Srinivas, Suraj
    Babu, R. Venkatesh
    2018 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS (SPCOM 2018), 2018, : 397 - 401
  • [25] Automated categorization of behavioral quality through deep neural networks
    Pagliuca, Paolo
    Milano, Nicola
    Nolfi, Stefano
    2022 IEEE INTERNATIONAL CONFERENCE ON METROLOGY FOR EXTENDED REALITY, ARTIFICIAL INTELLIGENCE AND NEURAL ENGINEERING (METROXRAINE), 2022, : 372 - 376
  • [26] Credit Assignment in Neural Networks through Deep Feedback Control
    Meulemans, Alexander
    Farinha, Matilde Tristany
    Ordonez, Javier Garcia
    Aceituno, Pau Vilimelis
    Sacramento, Joao
    Grewe, Benjamin F.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [27] System identification through Lipschitz regularized deep neural networks
    Negrini, Elisa
    Citti, Giovanna
    Capogna, Luca
    JOURNAL OF COMPUTATIONAL PHYSICS, 2021, 444
  • [28] Perceived Emotion from Images through Deep Neural Networks
    Hernandez-Garcia, Alex
    2017 SEVENTH INTERNATIONAL CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION (ACII), 2017, : 566 - 570
  • [29] A novel approach to cloth classification through deep neural networks
    Li Fengxin
    Li Yueping
    Zhang Xiaofeng
    2017 INTERNATIONAL CONFERENCE ON SECURITY, PATTERN ANALYSIS, AND CYBERNETICS (SPAC), 2017, : 368 - 371
  • [30] Exponential expressivity in deep neural networks through transient chaos
    Poole, Ben
    Lahiri, Subhaneil
    Raghu, Maithra
    Sohl-Dickstein, Jascha
    Ganguli, Surya
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29