Word Sense Disambiguation Studio: A Flexible System for WSD Feature Extraction

被引:0
|
作者
Agre, Gennady [1 ]
Petrov, Daniel [2 ]
Keskinova, Simona [2 ]
机构
[1] Bulgarian Acad Sci, Inst Informat & Commun Technol, Sofia 1113, Bulgaria
[2] Tech Univ Sofia, Lab Comp Graph & Geog Informat Syst, Sofia 2173, Bulgaria
关键词
word sense disambiguation; word embedding; classification; neural networks; random forest; deep forest; JRip; KNOWLEDGE;
D O I
10.3390/info10030097
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The paper presents a flexible system for extracting features and creating training and test examples for solving the all-words sense disambiguation (WSD) task. The system allows integrating word and sense embeddings as part of an example description. The system possesses two unique features distinguishing it from all similar WSD systems-the ability to construct a special compressed representation for word embeddings and the ability to construct training and test sets of examples with different data granularity. The first feature allows generation of data sets with quite small dimensionality, which can be used for training highly accurate classifiers of different types. The second feature allows generating sets of examples that can be used for training classifiers specialized in disambiguating a concrete word, words belonging to the same part-of-speech (POS) category or all open class words. Intensive experimentation has shown that classifiers trained on examples created by the system outperform the standard baselines for measuring the behaviour of all-words WSD classifiers.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] Research on Feature Weights of Liheci Word Sense Disambiguation
    Zhang, Zhenjing
    Li, Xinfu
    Tian, Xuedong
    2015 8TH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID), VOL 2, 2015, : 7 - 10
  • [22] Word Sense Disambiguation for XML Structure Feature Generation
    Tagarelli, Andrea
    Longo, Mario
    Greco, Sergio
    SEMANTIC WEB: RESEARCH AND APPLICATIONS, 2009, 5554 : 143 - 157
  • [23] Word Sense Disambiguation Based on Feature Ranking Graph
    Li, Yeqing
    Qiu, Xiaoyu
    2015 IEEE 29TH INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS WORKSHOPS WAINA 2015, 2015, : 209 - 212
  • [24] Performance comparison of word sense disambiguation (WSD) algorithm on Hindi language supporting search engines
    Rastogi, Parul
    Dwivedi, S.K.
    International Journal of Computer Science Issues, 2011, 8 (02): : 375 - 379
  • [25] Mnogoznal: an Unsupervised System for Word Sense Disambiguation
    Ustalov, Dmitry
    Teslenko, Denis
    Panchenko, Alexander
    Chernoskutov, Mikhail
    2017 INTERNATIONAL MULTI-CONFERENCE ON ENGINEERING, COMPUTER AND INFORMATION SCIENCES (SIBIRCON), 2017, : 147 - 150
  • [26] A Word Sense Disambiguation Method for Feature Level Sentiment Analysis
    Farooq, Umar
    Dhamala, Tej Prasad
    Nongaillard, Antoine
    Ouzrout, Yacine
    Qadir, Muhammad Abdul
    2015 9TH INTERNATIONAL CONFERENCE ON SOFTWARE, KNOWLEDGE, INFORMATION MANAGEMENT AND APPLICATIONS (SKIMA), 2015,
  • [27] Sense Space for Word Sense Disambiguation
    Kang, Myung Yun
    Min, Tae Hong
    Lee, Jae Sung
    2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP), 2018, : 669 - 672
  • [28] Word sense disambiguation based on word sense clustering
    Anaya-Sanchez, Henry
    Pons-Porrata, Aurora
    Berlanga-Llavori, Rafael
    ADVANCES IN ARTIFICIAL INTELLIGENCE - IBERAMIA-SBIA 2006, PROCEEDINGS, 2006, 4140 : 472 - 481
  • [29] A Word Sense Disambiguation System Based on Bayesian Model
    Zhang, Chunxiang
    He, Shan
    Gao, Xueyao
    PROCEEDINGS OF 2015 4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2015), 2015, : 124 - 127
  • [30] The University of Jaén Word sense disambiguation system
    García-Vega, Manuel
    García-Cumbreras, Miguel A.
    Martín-Valdivia, M. Teresa
    Ureña-López, L. Alfonso
    Proc. SENSEVAL@ACL : Int. Workshop Eval. Syst. Semant. Anal. Text - Held coop. ACL, 1600, (121-124):