Automatic Keyword Extraction Using Word Embedding and Clustering

被引:0
|
作者
Zeng, Ping [1 ]
Tan, Qingping [1 ]
Yan, Ying [1 ]
Xie, Qinzheng [1 ]
Xu, Jianjun [1 ]
Cao, Wei [2 ]
机构
[1] Natl Univ Def Technol, Coll Comp, Changsha, Hunan, Peoples R China
[2] Changsha Univ Sci & Technol, Sch Comp & Commun Engn, Changsha, Hunan, Peoples R China
关键词
keyword extraction; automatic keyword; keyword embedding;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Existing word-frequency-based algorithms for keyword extraction do not consider the semantic relationships among words. Moreover, word-graph-based algorithms cannot distinguish multiple topics, and topic-model-based algorithms possess high time complexity. All of these keyword extraction algorithms exhibit limitations. This paper proposes a new word-embedding-based algorithm, namely, WEC, for keyword extraction. The algorithm incorporates word frequency, effects of word co-occurrence, and semantic relationship among contexts. The algorithm also estimates the final weights of words with cosine similarity and pointwise mutual information and extracts topics by clustering. Experimental results show that the WEC algorithm outperforms state-of-the-art keyword extraction methods on four datasets when tested under various evaluation metrics.
引用
收藏
页码:1392 / 1408
页数:17
相关论文
共 50 条
  • [11] Using topic keyword clusters for automatic document clustering
    Chang, HC
    Hsu, CC
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2005, E88D (08) : 1852 - 1860
  • [12] Using topic keyword clusters for automatic document clustering
    Chang, HC
    Hsu, CC
    THIRD INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND APPLICATIONS, VOL 1, PROCEEDINGS, 2005, : 419 - 424
  • [13] Keyword Clustering for Automatic Categorization
    Zhao, Qinpei
    Rezaei, Mohammad
    Chen, Hao
    Franti, Pasi
    2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 2845 - 2848
  • [14] TextRank Keyword Extraction Algorithm Using Word Vector Clustering Based on Rough Data-Deduction
    Zhou, Ning
    Shi, Wenqian
    Liang, Renyu
    Zhong, Na
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [15] News Keyword Extraction Algorithm Based on Semantic Clustering and Word Graph Model
    Ao Xiong
    Derong Liu
    Hongkang Tian
    Zhengyuan Liu
    Peng Yu
    Michel Kadoch
    Tsinghua Science and Technology, 2021, 26 (06) : 886 - 893
  • [16] News Keyword Extraction Algorithm Based on Semantic Clustering and Word Graph Model
    Xiong, Ao
    Liu, Derong
    Tian, Hongkang
    Liu, Zhengyuan
    Yu, Peng
    Kadoch, Michel
    TSINGHUA SCIENCE AND TECHNOLOGY, 2021, 26 (06) : 886 - 893
  • [17] Research on Pattern Representation Based on Keyword and Word Embedding in Chinese Entity Relation Extraction
    Ye, Feiyue
    Qin, Zhentao
    JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2018, 22 (04) : 475 - 482
  • [18] Keyword Extraction for Document Clustering Using Submodular Optimization
    Zhang, Xi
    Mueller, Klaus
    Yoo, Shinjae
    2017 NEW YORK SCIENTIFIC DATA SUMMIT (NYSDS), 2017,
  • [19] Automatic Audio Sentiment Extraction Using Keyword Spotting
    Kaushik, Lakshmish
    Sangwan, Abhi Feet
    Hansen, John H. L.
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2709 - 2713
  • [20] Automatic Keyword Extraction for Scientific Literatures Using References
    Lu, Yanchun
    Li, Ruixuan
    Wen, Kunmei
    Lu, Zhengding
    PROCEEDINGS OF THE 2014 INTERNATIONAL CONFERENCE ON INNOVATIVE DESIGN AND MANUFACTURING (ICIDM), 2014, : 78 - 81