A comparison of graph-based word sense induction clustering algorithms in a pseudoword evaluation framework

被引:0
|
作者
Flavio Massimiliano Cecchini
Martin Riedl
Elisabetta Fersini
Chris Biemann
机构
[1] Università degli Studi di Milano - Bicocca,DISCo
[2] Universität Hamburg,Informatikum
来源
Language Resources and Evaluation | 2018年 / 52卷
关键词
Word sense induction; Graph clustering; Pseudowords; Evaluation;
D O I
暂无
中图分类号
学科分类号
摘要
This article presents a comparison of different Word Sense Induction (wsi) clustering algorithms on two novel pseudoword data sets of semantic-similarity and co-occurrence-based word graphs, with a special focus on the detection of homonymic polysemy. We follow the original definition of a pseudoword as the combination of two monosemous terms and their contexts to simulate a polysemous word. The evaluation is performed comparing the algorithm’s output on a pseudoword’s ego word graph (i.e., a graph that represents the pseudoword’s context in the corpus) with the known subdivision given by the components corresponding to the monosemous source words forming the pseudoword. The main contribution of this article is to present a self-sufficient pseudoword-based evaluation framework for wsi graph-based clustering algorithms, thereby defining a new evaluation measure (top2) and a secondary clustering process (hyperclustering). To our knowledge, we are the first to conduct and discuss a large-scale systematic pseudoword evaluation targeting the induction of coarse-grained homonymous word senses across a large number of graph clustering algorithms.
引用
收藏
页码:733 / 770
页数:37
相关论文
共 50 条
  • [21] Performance evaluation of decision tree graph-based induction
    Geamsakul, Warodom
    Matsuda, Takashi
    Yoshida, Tetsuya
    Motoda, Hiroshi
    Washio, Takashi
    2003, Springer Verlag (2843):
  • [22] Performance evaluation of decision tree graph-based induction
    Geamsakul, W
    Matsuda, T
    Yoshida, T
    Motoda, H
    Washio, T
    DISCOVERY SCIENCE, PROCEEDINGS, 2003, 2843 : 128 - 140
  • [23] Graph-based evolutionary algorithms
    Bryden, Kenneth Mark
    Ashlock, Daniel A.
    Corns, Steven
    Willson, Stephen J.
    IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2006, 10 (05) : 550 - 567
  • [24] Word Sense Disambiguation: A Unified Evaluation Framework and Empirical Comparison
    Raganato, Alessandro
    Camacho-Collados, Jose
    Navigli, Roberto
    15TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2017), VOL 1: LONG PAPERS, 2017, : 99 - 110
  • [25] Towards a unified framework for graph-based multi-view clustering
    Dornaika, F.
    El Hajjar, S.
    NEURAL NETWORKS, 2024, 173
  • [26] Generalized comparison of graph-based ranking algorithms for publications and authors
    Sidiropoulos, Antonis
    Manolopoulos, Yannis
    JOURNAL OF SYSTEMS AND SOFTWARE, 2006, 79 (12) : 1679 - 1700
  • [27] CO-graph: A new graph-based technique for cross-lingual word sense disambiguation
    Duque, Andres
    Araujo, Lourdes
    Martinez-Romo, Juan
    NATURAL LANGUAGE ENGINEERING, 2015, 21 (05) : 743 - 772
  • [28] Tovel: Distributed Graph Clustering for Word Sense Disambiguation
    Guerrieri, Alessio
    Rahimian, Fatemeh
    Girdzijauskas, Sarunas
    Montresor, Alberto
    2016 IEEE 16TH INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW), 2016, : 623 - 630
  • [29] A Graph-based Approach to Word Sense Disambiguation. An Unsupervised Method Based on Semantic Relatedness
    Arab, Meysam
    Jahromi, Mansoor Zolghadri
    Fakhrahmad, Seyed Mostafa
    2016 24TH IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2016, : 250 - 255
  • [30] A Novel Unsupervısed Graph-Based Algorıthm for Hindi Word Sense Disambiguation
    Jha P.
    Agarwal S.
    Abbas A.
    Siddiqui T.J.
    SN Computer Science, 4 (5)