A Classification Model with Corpus Enrichment for Toponym Disambiguation

被引:0
|
作者
Priego Sanchez, Belem [1 ]
Somodevilla, Maria J. [1 ]
Guzman Cabrera, Rafael [2 ]
Pineda, Ivo H. [1 ]
Carrillo, Maya [1 ]
机构
[1] Benemerita Univ Autonoma Puebla, FCC, Av San Claudio & 14 S, Puebla, Mexico
[2] Univ Guanajuato, DICIS, Salamanca, Mexico
关键词
toponym disambiguation; geographic information retrieval; corpus; classification model; WORD SENSE DISAMBIGUATION; WEB;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a method based on information retrieval to enrich corpus using bootstrapping techniques. A supervised corpus manually validated is provided, and then snippets are obtained from Web in order to increase the size of the initial corpus. Although this technique has already been reported in the literature, the main objective of this work is to apply it under the specific task of GEO/NO-GEO toponym disambiguation. The disambiguation procedure is evaluated by a classification model observing favorable results.
引用
收藏
页码:472 / 480
页数:9
相关论文
共 50 条
  • [41] A Quantitative Corpus-Driven Approach to Disambiguation of Synonymous Grammatical Constructions
    Zhukovska, Viktoriia
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT SYSTEMS (COLINS 2020), VOL I: MAIN CONFERENCE, 2020, 2604
  • [42] Sense Unveiled: Enhancing Urdu Corpus for Nuanced Word Sense Disambiguation
    Bibi, Sarfraz
    Asghar, Sohail
    Zubair, Muhammad
    IEEE ACCESS, 2024, 12 : 126329 - 126343
  • [43] Automatic Disambiguation of the Belarusian-Russian Legal Parallel Corpus in NooJ
    Varanovich, Valery
    Suprunchuk, Mikita
    Zianouka, Yauheniya
    Hetsevich, Yuras
    FORMALIZING NATURAL LANGUAGES: APPLICATIONS TO NATURAL LANGUAGE PROCESSING AND DIGITAL HUMANITIES, NOOJ 2023, 2024, 1816 : 114 - 126
  • [44] Integration of semantic networks for corpus-based word sense disambiguation
    Moon, YJ
    Min, KH
    Hwang, YH
    Kim, P
    LOGIC PROGRAMMING, PROCEEDINGS, 2003, 2916 : 492 - 493
  • [45] Adding Intelligence to Non-corpus based Word Sense Disambiguation
    Charhate, Sayali
    Dani, Anurag
    Sugandhi, Rekha
    Patil, Varsha
    2012 12TH INTERNATIONAL CONFERENCE ON HYBRID INTELLIGENT SYSTEMS (HIS), 2012, : 173 - 178
  • [46] Disambiguation preferences in noun phrase conjunction do not mirror corpus frequency
    Gibson, E
    Schütze, CT
    JOURNAL OF MEMORY AND LANGUAGE, 1999, 40 (02) : 263 - 279
  • [47] Period disambiguation with Maxent model
    Kit, C
    Liu, XY
    NATURAL LANGUAGE PROCESSING - IJCNLP 2005, PROCEEDINGS, 2005, 3651 : 223 - 232
  • [48] A Deep Model for Partial Multi-label Image Classification with Curriculum-based Disambiguation
    Sun, Feng
    Xie, Ming-Kun
    Huang, Sheng-Jun
    MACHINE INTELLIGENCE RESEARCH, 2024, 21 (04) : 801 - 814
  • [49] Word sense disambiguation model
    Zhu, Jing-bo
    Yao, Tian-shun
    Dongbei Daxue Xuebao/Journal of Northeastern University, 2000, 21 (05): : 484 - 486
  • [50] Hybrid Deep Pairwise Classification for Author Name Disambiguation
    Kim, Kunho
    Rohatgi, Shaurya
    Giles, C. Lee
    PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM '19), 2019, : 2369 - 2372