A Classification Model with Corpus Enrichment for Toponym Disambiguation

被引:0
|
作者
Priego Sanchez, Belem [1 ]
Somodevilla, Maria J. [1 ]
Guzman Cabrera, Rafael [2 ]
Pineda, Ivo H. [1 ]
Carrillo, Maya [1 ]
机构
[1] Benemerita Univ Autonoma Puebla, FCC, Av San Claudio & 14 S, Puebla, Mexico
[2] Univ Guanajuato, DICIS, Salamanca, Mexico
关键词
toponym disambiguation; geographic information retrieval; corpus; classification model; WORD SENSE DISAMBIGUATION; WEB;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a method based on information retrieval to enrich corpus using bootstrapping techniques. A supervised corpus manually validated is provided, and then snippets are obtained from Web in order to increase the size of the initial corpus. Although this technique has already been reported in the literature, the main objective of this work is to apply it under the specific task of GEO/NO-GEO toponym disambiguation. The disambiguation procedure is evaluated by a classification model observing favorable results.
引用
收藏
页码:472 / 480
页数:9
相关论文
共 50 条
  • [1] Toponym Disambiguation in Information Retrieval
    Buscaldi, Davide
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2011, (46): : 125 - 126
  • [2] An Evidence-based Approach for Toponym Disambiguation
    Wang, Xingguang
    Zhang, Yi
    Chen, Min
    Lin, Xing
    Yu, Hao
    Liu, Yu
    2010 18TH INTERNATIONAL CONFERENCE ON GEOINFORMATICS, 2010,
  • [3] Toponym Disambiguation in Online Social Network Profiles
    Ghufran, Mohammad
    Quercini, Gianluca
    Bennacer, Nacera
    23RD ACM SIGSPATIAL INTERNATIONAL CONFERENCE ON ADVANCES IN GEOGRAPHIC INFORMATION SYSTEMS (ACM SIGSPATIAL GIS 2015), 2015,
  • [4] User-Driven Toponym Disambiguation Using Dialogue
    Muron, Mikulas
    Darena, Frantisek
    Prochazka, David
    Kern, Roman
    JOURNAL OF MAP & GEOGRAPHY LIBRARIES, 2023, 19 (03) : 198 - 219
  • [5] Toponym Disambiguation in Historical Documents Using Network Analysis of Qualitative Relationships
    Moncla, Ludovic
    McDonough, Katherine
    Vigier, Denis
    Joliveau, Thierry
    Brenon, Alice
    GEOHUMANITIES 2019: PROCEEDINGS OF THE 3RD ACM SIGSPATIAL INTERNATIONAL WORKSHOP ON GEOSPATIAL HUMANITIES (GEOHUMANITIES 2019), 2019,
  • [6] How can voting mechanisms improve the robustness and generalizability of toponym disambiguation?
    Hu, Xuke
    Sun, Yeran
    Kersten, Jens
    Zhou, Zhiyong
    Klan, Friederike
    Fan, Hongchao
    INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2023, 117
  • [7] Discovery, Enrichment and Disambiguation of Acronyms
    Barua, Jayendra
    Patel, Dhaval
    BIG DATA ANALYTICS AND KNOWLEDGE DISCOVERY, DAWAK 2016, 2016, 9829 : 345 - 360
  • [8] Word Sense Disambiguation Using the Classification Information Model
    Ho Lee
    Hae-Chang Rim
    Hungyun Seo
    Computers and the Humanities, 2000, 34 : 141 - 146
  • [9] Word sense disambiguation using the Classification Information Model
    Lee, H
    Rim, HC
    Seo, J
    COMPUTERS AND THE HUMANITIES, 2000, 34 (1-2): : 141 - 146
  • [10] A Crowdsourced Frame Disambiguation Corpus with Ambiguity
    Dumitrache, Anca
    Aroyo, Lora
    Welty, Chris
    2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 2164 - 2170