Automatic Keyword Extraction Using Word Embedding and Clustering

被引:0
|
作者
Zeng, Ping [1 ]
Tan, Qingping [1 ]
Yan, Ying [1 ]
Xie, Qinzheng [1 ]
Xu, Jianjun [1 ]
Cao, Wei [2 ]
机构
[1] Natl Univ Def Technol, Coll Comp, Changsha, Hunan, Peoples R China
[2] Changsha Univ Sci & Technol, Sch Comp & Commun Engn, Changsha, Hunan, Peoples R China
关键词
keyword extraction; automatic keyword; keyword embedding;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Existing word-frequency-based algorithms for keyword extraction do not consider the semantic relationships among words. Moreover, word-graph-based algorithms cannot distinguish multiple topics, and topic-model-based algorithms possess high time complexity. All of these keyword extraction algorithms exhibit limitations. This paper proposes a new word-embedding-based algorithm, namely, WEC, for keyword extraction. The algorithm incorporates word frequency, effects of word co-occurrence, and semantic relationship among contexts. The algorithm also estimates the final weights of words with cosine similarity and pointwise mutual information and extracts topics by clustering. Experimental results show that the WEC algorithm outperforms state-of-the-art keyword extraction methods on four datasets when tested under various evaluation metrics.
引用
收藏
页码:1392 / 1408
页数:17
相关论文
共 50 条
  • [31] Automatic keyphrase extraction using word embeddings
    Zhang, Yuxiang
    Liu, Huan
    Wang, Suge
    Ip, W. H.
    Fan, Wei
    Xiao, Chunjing
    SOFT COMPUTING, 2020, 24 (08) : 5593 - 5608
  • [32] Automatic keyword extraction for news finder
    Martínez-Fernández, JL
    García-Serrano, A
    Martínez, P
    Villena, J
    ADAPTIVE MULTIMEDIA RETRIEVAL, 2004, 3094 : 99 - 119
  • [33] Automatic Keyword Extraction: An Ensemble Method
    Pay, Tayfun
    Lucci, Stephen
    2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2017, : 4816 - 4818
  • [34] Automatic Keyword Extraction Algorithm and Implementation
    Zhao, Hong
    Bai, Chensheng
    Zhu, Song
    FRONTIERS OF MANUFACTURING AND DESIGN SCIENCE, PTS 1-4, 2011, 44-47 : 4041 - +
  • [35] Audio keyword extraction by unsupervised word discovery
    Muscariello, Armando
    Gravier, Guillaume
    Bimbot, Frederic
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 2811 - +
  • [36] RETRACTED: TextRank Keyword Extraction Algorithm Using Word Vector Clustering Based on Rough Data-Deduction (Retracted Article)
    Zhou, Ning
    Shi, Wenqian
    Liang, Renyu
    Zhong, Na
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [37] Automatic selection of clustering algorithms using supervised graph embedding
    Cohen-Shapira, Noy
    Rokach, Lior
    INFORMATION SCIENCES, 2021, 577 : 824 - 851
  • [38] Chinese keyword extraction based on word platform
    Jiao, Hui
    Liu, Qian
    Jia, Hui-bo
    FOURTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 2, PROCEEDINGS, 2007, : 360 - +
  • [39] Geoscience keyphrase extraction algorithm using enhanced word embedding
    Qiu, Qinjun
    Xie, Zhong
    Wu, Liang
    Li, Wenjia
    EXPERT SYSTEMS WITH APPLICATIONS, 2019, 125 : 157 - 169
  • [40] Improving Precision in Automatic Keyword Extraction Using Attention Attractive Strings
    Kian, H. H.
    Zahedi, M.
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2013, 38 (08) : 2063 - 2068