Learning multi-prototype word embedding from single-prototype word embedding with integrated knowledge

被引:9
|
作者
Yang, Xuefeng [1 ]
Mao, Kezhi [1 ]
机构
[1] Nanyang Technol Univ, Sch Elect & Elect Engn, Nanyang Ave, Singapore 639798, Singapore
关键词
Multi-prototype word embedding; Distributional semantic model; Fine tuning; Semantic similarity; SEMANTIC SIMILARITY;
D O I
10.1016/j.eswa.2016.03.013
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Distributional semantic models (DSM) or word embeddings are widely used in prediction of semantic similarity and relatedness. However, context aware similarity and relatedness prediction is still a challenging issue because most DSM models or word embeddings use one vector per word without considering polysemy and homonym. In this paper, we propose a supervised fine tuning framework to transform the existing single-prototype word embeddings into multi-prototype word embeddings based on lexical semantic resources. As a post-processing step, the proposed framework is compatible with any sense inventory and any word embedding. To test the proposed learning framework, both intrinsic and extrinsic evaluations are conducted. Experiments results of 3 tasks with 8 datasets show that the multi-prototype word representations learned by the proposed framework outperform single-prototype word representations. (C) 2016 Elsevier Ltd. All rights reserved.
引用
收藏
页码:291 / 299
页数:9
相关论文
共 50 条
  • [41] Few-shot named entity recognition with hybrid multi-prototype learning
    Liao, Zenghua
    Fei, Junbo
    Zeng, Weixin
    Zhao, Xiang
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2023, 26 (05): : 2521 - 2544
  • [42] Application of word embedding and machine learning in detecting phishing websites
    Routhu Srinivasa Rao
    Amey Umarekar
    Alwyn Roshan Pais
    Telecommunication Systems, 2022, 79 : 33 - 45
  • [43] Application of word embedding and machine learning in detecting phishing websites
    Rao, Routhu Srinivasa
    Umarekar, Amey
    Pais, Alwyn Roshan
    TELECOMMUNICATION SYSTEMS, 2022, 79 (01) : 33 - 45
  • [44] Exploring Chinese word embedding with similar context and reinforcement learning
    Zhang, Yun
    Liu, Yongguo
    Li, Dongxiao
    Zhai, Shuangqing
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (24): : 22287 - 22302
  • [45] Cyberbullying Detection using Deep Learning and Word Embedding Analysis
    On, Elif Pinar
    Yeniterzi, Reyyan
    2020 28TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2020,
  • [46] Turkish Tweet Sentiment Analysis with Word Embedding and Machine Learning
    Ayata, Deger
    Saraclar, Murat
    Ozgur, Arzucan
    2017 25TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2017,
  • [47] Unsupervised Word Embedding Learning by Incorporating Local and Global Contexts
    Meng, Yu
    Huang, Jiaxin
    Wang, Guangyuan
    Wang, Zihan
    Zhang, Chao
    Han, Jiawei
    FRONTIERS IN BIG DATA, 2020, 3
  • [48] Few-shot named entity recognition with hybrid multi-prototype learning
    Zenghua Liao
    Junbo Fei
    Weixin Zeng
    Xiang Zhao
    World Wide Web, 2023, 26 : 2521 - 2544
  • [49] Exploring Chinese word embedding with similar context and reinforcement learning
    Yun Zhang
    Yongguo Liu
    Dongxiao Li
    Shuangqing Zhai
    Neural Computing and Applications, 2022, 34 : 22287 - 22302
  • [50] Lifelong Domain Word Embedding via Meta-Learning
    Xu, Hu
    Liu, Bing
    Shu, Lei
    Yu, Philip S.
    PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 4510 - 4516