A comparison of graph-based word sense induction clustering algorithms in a pseudoword evaluation framework

被引:0
|
作者
Flavio Massimiliano Cecchini
Martin Riedl
Elisabetta Fersini
Chris Biemann
机构
[1] Università degli Studi di Milano - Bicocca,DISCo
[2] Universität Hamburg,Informatikum
来源
Language Resources and Evaluation | 2018年 / 52卷
关键词
Word sense induction; Graph clustering; Pseudowords; Evaluation;
D O I
暂无
中图分类号
学科分类号
摘要
This article presents a comparison of different Word Sense Induction (wsi) clustering algorithms on two novel pseudoword data sets of semantic-similarity and co-occurrence-based word graphs, with a special focus on the detection of homonymic polysemy. We follow the original definition of a pseudoword as the combination of two monosemous terms and their contexts to simulate a polysemous word. The evaluation is performed comparing the algorithm’s output on a pseudoword’s ego word graph (i.e., a graph that represents the pseudoword’s context in the corpus) with the known subdivision given by the components corresponding to the monosemous source words forming the pseudoword. The main contribution of this article is to present a self-sufficient pseudoword-based evaluation framework for wsi graph-based clustering algorithms, thereby defining a new evaluation measure (top2) and a secondary clustering process (hyperclustering). To our knowledge, we are the first to conduct and discuss a large-scale systematic pseudoword evaluation targeting the induction of coarse-grained homonymous word senses across a large number of graph clustering algorithms.
引用
收藏
页码:733 / 770
页数:37
相关论文
共 50 条
  • [31] A Graph-Based Recommendation Framework for Price-Comparison Services
    Lee, Sang-Chul
    Kim, Sang-Wook
    Park, Sunju
    WWW'15 COMPANION: PROCEEDINGS OF THE 24TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB, 2015, : 59 - 60
  • [32] Graph Clustering: a graph-based clustering algorithm for the electromagnetic calorimeter in LHCb
    Canudas, Nuria Valls
    Gomez, Miriam Calvo
    Vilasis-Cardona, Xavier
    Ribe, Elisabet Golobardes
    EUROPEAN PHYSICAL JOURNAL C, 2023, 83 (02):
  • [33] Graph Clustering: a graph-based clustering algorithm for the electromagnetic calorimeter in LHCb
    Núria Valls Canudas
    Míriam Calvo Gómez
    Xavier Vilasís-Cardona
    Elisabet Golobardes Ribé
    The European Physical Journal C, 83
  • [34] Graph-based hierarchical conceptual clustering
    Jonyer, I
    Cook, DJ
    Holder, LB
    JOURNAL OF MACHINE LEARNING RESEARCH, 2002, 2 (01) : 19 - 43
  • [35] Graph-based Medical Image Clustering
    Li, Jian
    Pan, Haiwei
    Zhang, Minghui
    Han, Qilong
    Feng, Xiaoning
    2012 8TH INTERNATIONAL CONFERENCE ON COMPUTING AND NETWORKING TECHNOLOGY (ICCNT, INC, ICCIS AND ICMIC), 2012, : 153 - 158
  • [36] A GRAPH-BASED APPROACH FOR SEMISUPERVISED CLUSTERING
    Yoshida, Tetsuya
    COMPUTATIONAL INTELLIGENCE, 2014, 30 (02) : 263 - 284
  • [37] Graph-based data clustering with overlaps
    Fellows, Michael R.
    Guo, Jiong
    Komusiewicz, Christian
    Niedermeier, Rolf
    Uhlmann, Johannes
    DISCRETE OPTIMIZATION, 2011, 8 (01) : 2 - 17
  • [38] Graph-Based Clustering of Dolphin Whistles
    Kipnis, Dror
    Diamant, Roee
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 : 2216 - 2227
  • [39] Graph-Based Data Clustering with Overlaps
    Fellows, Michael R.
    Guo, Jiong
    Komusiewicz, Christian
    Niedermeier, Rolf
    Uhlmann, Johannes
    COMPUTING AND COMBINATORICS, PROCEEDINGS, 2009, 5609 : 516 - +
  • [40] Self-Weighted Graph-Based Framework for Multi-View Clustering
    He, Yanfang
    Yusof, Umi Kalsom
    IEEE ACCESS, 2023, 11 : 30197 - 30207