Word Sense Disambiguation Studio: A Flexible System for WSD Feature Extraction

被引:0
|
作者
Agre, Gennady [1 ]
Petrov, Daniel [2 ]
Keskinova, Simona [2 ]
机构
[1] Bulgarian Acad Sci, Inst Informat & Commun Technol, Sofia 1113, Bulgaria
[2] Tech Univ Sofia, Lab Comp Graph & Geog Informat Syst, Sofia 2173, Bulgaria
关键词
word sense disambiguation; word embedding; classification; neural networks; random forest; deep forest; JRip; KNOWLEDGE;
D O I
10.3390/info10030097
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The paper presents a flexible system for extracting features and creating training and test examples for solving the all-words sense disambiguation (WSD) task. The system allows integrating word and sense embeddings as part of an example description. The system possesses two unique features distinguishing it from all similar WSD systems-the ability to construct a special compressed representation for word embeddings and the ability to construct training and test sets of examples with different data granularity. The first feature allows generation of data sets with quite small dimensionality, which can be used for training highly accurate classifiers of different types. The second feature allows generating sets of examples that can be used for training classifiers specialized in disambiguating a concrete word, words belonging to the same part-of-speech (POS) category or all open class words. Intensive experimentation has shown that classifiers trained on examples created by the system outperform the standard baselines for measuring the behaviour of all-words WSD classifiers.
引用
收藏
页数:14
相关论文
共 50 条
  • [41] Word sense disambiguation methods
    D. Yu. Turdakov
    Programming and Computer Software, 2010, 36 : 309 - 326
  • [42] ARABIC WORD SENSE DISAMBIGUATION
    Merhbene, Laroussi
    Zouaghi, Anis
    Zrigui, Mounir
    ICAART 2010: PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, VOL 1: ARTIFICIAL INTELLIGENCE, 2010, : 652 - 655
  • [43] Probabilistic word sense disambiguation
    Preiss, J
    COMPUTER SPEECH AND LANGUAGE, 2004, 18 (03): : 319 - 337
  • [44] Trends in word sense disambiguation
    Bhala, R. V. Vidhu
    Abirami, S.
    ARTIFICIAL INTELLIGENCE REVIEW, 2014, 42 (02) : 159 - 171
  • [45] Word Sense Disambiguation: An Overview
    McCarthy, Diana
    LANGUAGE AND LINGUISTICS COMPASS, 2009, 3 (02): : 537 - 558
  • [46] Research on Word Sense Disambiguation
    Zhan, Jingwen
    Chen, Yanmin
    ADVANCED MATERIALS SCIENCE AND TECHNOLOGY, PTS 1-2, 2011, 181-182 : 337 - 342
  • [47] Trends in word sense disambiguation
    R. V. Vidhu Bhala
    S. Abirami
    Artificial Intelligence Review, 2014, 42 : 159 - 171
  • [48] Word sense disambiguation with pictures
    Barnard, K
    Johnson, M
    ARTIFICIAL INTELLIGENCE, 2005, 167 (1-2) : 13 - 30
  • [49] Word Sense Disambiguation for Assamese
    Sarmah, Jumi
    Sarma, Shikhar Kr
    2016 IEEE 6TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING (IACC), 2016, : 146 - 151
  • [50] Soft Word Sense Disambiguation
    Ramakrishnan, Ganesh
    Prithviraj, B. P.
    Deepa, A.
    Bhattacharyya, Pushpak
    Chakrabarti, Soumen
    GWC 2004: SECOND INTERNATIONAL WORDNET CONFERENCE, PROCEEDINGS, 2003, : 291 - 298