Automatic Keyword Extraction Using Word Embedding and Clustering

被引:0
|
作者
Zeng, Ping [1 ]
Tan, Qingping [1 ]
Yan, Ying [1 ]
Xie, Qinzheng [1 ]
Xu, Jianjun [1 ]
Cao, Wei [2 ]
机构
[1] Natl Univ Def Technol, Coll Comp, Changsha, Hunan, Peoples R China
[2] Changsha Univ Sci & Technol, Sch Comp & Commun Engn, Changsha, Hunan, Peoples R China
关键词
keyword extraction; automatic keyword; keyword embedding;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Existing word-frequency-based algorithms for keyword extraction do not consider the semantic relationships among words. Moreover, word-graph-based algorithms cannot distinguish multiple topics, and topic-model-based algorithms possess high time complexity. All of these keyword extraction algorithms exhibit limitations. This paper proposes a new word-embedding-based algorithm, namely, WEC, for keyword extraction. The algorithm incorporates word frequency, effects of word co-occurrence, and semantic relationship among contexts. The algorithm also estimates the final weights of words with cosine similarity and pointwise mutual information and extracts topics by clustering. Experimental results show that the WEC algorithm outperforms state-of-the-art keyword extraction methods on four datasets when tested under various evaluation metrics.
引用
收藏
页码:1392 / 1408
页数:17
相关论文
共 50 条
  • [21] Clustering analysis of process alarms using word embedding
    Cai, Shuang
    Zhang, Laibin
    Palazoglu, Ahmet
    Hu, Jinqiu
    JOURNAL OF PROCESS CONTROL, 2019, 83 : 11 - 19
  • [22] KEYWORD EXTRACTION BASED ON WORD SYNONYMS USING WORD2VEC
    Ogul, Iskender Ulgen
    Ozcan, Caner
    Hakdagli, Ozlem
    2019 27TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2019,
  • [23] Automatic Keyword Extraction on Twitter
    Marujo, Luis
    Ling, Wang
    Trancoso, Isabel
    Dyer, Chris
    Black, Alan W.
    Gershman, Anatole
    de Matos, David Martins
    Neto, Joao P.
    Carbonell, Jaime
    PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL) AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (IJCNLP), VOL 2, 2015, : 637 - 643
  • [24] Automatic tag recommendation approach with keyphrase extraction and word embedding techniques
    Konkaew, Taechawat
    Kitisin, Sukumal
    Journal of Computers (Taiwan), 2019, 30 (02) : 135 - 149
  • [25] Automatic Image Annotation using Word Embedding Learning
    Chen, Qi
    Yip, Andy M.
    Tan, Chew Lim
    2012 IEEE 24TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2012), VOL 1, 2012, : 269 - 276
  • [26] Novel Word Features for Keyword Extraction
    Chen, Yiqun
    Yin, Jian
    Zhu, Weiheng
    Qiu, Shiding
    WEB-AGE INFORMATION MANAGEMENT (WAIM 2015), 2015, 9098 : 148 - 160
  • [27] Keyphrase Extraction Using Enhanced Word and Document Embedding
    Alotaibi, Fahd Saleh
    Sharma, Saurabh
    Gupta, Vishal
    Gupta, Savita
    IETE JOURNAL OF RESEARCH, 2023, 69 (12) : 8876 - 8888
  • [28] Keyword extraction based peer clustering
    Liang, BY
    Tang, J
    Li, JZ
    Wang, KH
    GRID AND COOPERATIVE COMPUTING GCC 2004, PROCEEDINGS, 2004, 3251 : 827 - 830
  • [29] Comparison of two schemes for automatic keyword extraction from MEDLINE for functional gene clustering
    Liu, Y
    Ciliax, BJ
    Borges, K
    Dasigi, V
    Ram, A
    Navathe, SB
    Dingledine, R
    2004 IEEE COMPUTATIONAL SYSTEMS BIOINFORMATICS CONFERENCE, PROCEEDINGS, 2004, : 394 - 404
  • [30] Automatic keyphrase extraction using word embeddings
    Yuxiang Zhang
    Huan Liu
    Suge Wang
    W. H. Ip.
    Wei Fan
    Chunjing Xiao
    Soft Computing, 2020, 24 : 5593 - 5608