Toponym matching through deep neural networks

被引:48
|
作者
Santos, Rui [1 ]
Murrieta-Flores, Patricia [2 ]
Calado, Pavel [1 ]
Martins, Bruno [1 ]
机构
[1] Univ Lisbon, Inst Super Tecn, INESC ID, Lisbon, Portugal
[2] Univ Chester, Digital Humanities Res Ctr, Chester, Cheshire, England
关键词
Toponym matching; duplicate detection; approximate string matching; deep neural networks; recurrent neural networks; geographic information retrieval;
D O I
10.1080/13658816.2017.1390119
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Toponym matching, i.e. pairing strings that represent the same real-world location, is a fundamental problemfor several practical applications. The current state-of-the-art relies on string similarity metrics, either specifically developed for matching place names or integrated within methods that combine multiple metrics. However, these methods all rely on common sub-strings in order to establish similarity, and they do not effectively capture the character replacements involved in toponym changes due to transliterations or to changes in language and culture over time. In this article, we present a novel matching approach, leveraging a deep neural network to classify pairs of toponyms as either matching or nonmatching. The proposed network architecture uses recurrent nodes to build representations from the sequences of bytes that correspond to the strings that are to be matched. These representations are then combined and passed to feed-forward nodes, finally leading to a classification decision. We present the results of a wide-ranging evaluation on the performance of the proposed method, using a large dataset collected from the GeoNames gazetteer. These results show that the proposed method can significantly outperform individual similarity metrics from previous studies, as well as previous methods based on supervised machine learning for combining multiple metrics.
引用
收藏
页码:324 / 348
页数:25
相关论文
共 50 条
  • [31] Impact of reverberation through deep neural networks on adversarial perturbations
    Cohendet, Romain
    Solinas, Miguel
    Bernhard, Remi
    Reyboz, Marina
    Moellic, Pierre-Alain
    Bourrier, Yannick
    Mermillod, Martial
    20TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2021), 2021, : 840 - 846
  • [32] Synchrosqueezing voices through deep neural networks for horizon interpretation
    AlSalmi, Haifa
    Wang, Yanghua
    INTERPRETATION-A JOURNAL OF SUBSURFACE CHARACTERIZATION, 2024, 12 (03): : SE89 - SE102
  • [33] The face inversion effect through the lens of deep neural networks
    Tousi, Ehsan
    Mur, Marieke
    PROCEEDINGS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2024, 291 (2028)
  • [34] Autonomous exploration of mobile robots through deep neural networks
    Tai, Lei
    Li, Shaohua
    Liu, Ming
    INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2017, 14 (04): : 1 - 9
  • [35] Pruning of Deep Spiking Neural Networks through Gradient Rewiring
    Chen, Yanqi
    Yu, Zhaofei
    Fang, Wei
    Huang, Tiejun
    Tian, Yonghong
    PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 1713 - 1721
  • [36] Full Approximation of Deep Neural Networks through Efficient Optimization
    De la Parra, Cecilia
    Guntoro, Andre
    Kumar, Akash
    2020 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2020,
  • [37] Post-stack seismic inversion through probabilistic neural networks and deep forward neural networks
    Sotelo, Victor
    Almanza, Ovidio
    Montes, Luis
    EARTH SCIENCE INFORMATICS, 2024, 17 (03) : 1957 - 1966
  • [38] VCAM: Variation Compensation through Activation Matching for Analog Binarized Neural Networks
    Kim, Jaehyun
    Lee, Chaeun
    Kim, Jihun
    Kim, Yumin
    Hwang, Cheol Seong
    Choi, Kiyoung
    2019 IEEE/ACM INTERNATIONAL SYMPOSIUM ON LOW POWER ELECTRONICS AND DESIGN (ISLPED), 2019,
  • [39] Research on the application of convolutional-deep neural networks in parallel fingerprint minutiae matching
    Wang, SuHua
    Cheng, MingJun
    Ma, ZhiQiang
    Sun, XiaoXin
    INTERNATIONAL JOURNAL OF BIOMETRICS, 2021, 13 (01) : 96 - 113
  • [40] Deceiving Deep Neural Networks-Based Binary Code Matching with Adversarial Programs
    Wong, Wai Kin
    Wang, Huaijin
    Ma, Pingchuan
    Wang, Shuai
    Jiang, Mingyue
    Chen, Tsong Yueh
    Tang, Qiyi
    Nie, Sen
    Wu, Shi
    2022 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE MAINTENANCE AND EVOLUTION (ICSME 2022), 2022, : 117 - 128