Automatic Keyword Extraction Using Word Embedding and Clustering

被引:0
|
作者
Zeng, Ping [1 ]
Tan, Qingping [1 ]
Yan, Ying [1 ]
Xie, Qinzheng [1 ]
Xu, Jianjun [1 ]
Cao, Wei [2 ]
机构
[1] Natl Univ Def Technol, Coll Comp, Changsha, Hunan, Peoples R China
[2] Changsha Univ Sci & Technol, Sch Comp & Commun Engn, Changsha, Hunan, Peoples R China
关键词
keyword extraction; automatic keyword; keyword embedding;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Existing word-frequency-based algorithms for keyword extraction do not consider the semantic relationships among words. Moreover, word-graph-based algorithms cannot distinguish multiple topics, and topic-model-based algorithms possess high time complexity. All of these keyword extraction algorithms exhibit limitations. This paper proposes a new word-embedding-based algorithm, namely, WEC, for keyword extraction. The algorithm incorporates word frequency, effects of word co-occurrence, and semantic relationship among contexts. The algorithm also estimates the final weights of words with cosine similarity and pointwise mutual information and extracts topics by clustering. Experimental results show that the WEC algorithm outperforms state-of-the-art keyword extraction methods on four datasets when tested under various evaluation metrics.
引用
收藏
页码:1392 / 1408
页数:17
相关论文
共 50 条
  • [1] Impact analysis of keyword extraction using contextual word embedding
    Khan, Muhammad Qasim
    Shahid, Abdul
    Uddin, M. Irfan
    Roman, Muhammad
    Alharbi, Abdullah
    Alosaimi, Wael
    Almalki, Jameel
    Alshahrani, Saeed M.
    PEERJ COMPUTER SCIENCE, 2022, 8
  • [2] Semantic Unsupervised Automatic Keyphrases Extraction by Integrating Word Embedding with Clustering Methods
    Gagliardi, Isabella
    Artese, Maria Teresa
    MULTIMODAL TECHNOLOGIES AND INTERACTION, 2020, 4 (02) : 1 - 20
  • [3] A Study on the Optimal Search Keyword Extraction and Retrieval Technique Generation Using Word Embedding
    Lee, Jeong-In
    Ahn, Jin-Hee
    Koh, Kyung-Taek
    Kim, YoungSeok
    JOURNAL OF THE KOREAN GEOSYNTHETIC SOCIETY, 2023, 22 (02): : 47 - 54
  • [4] Automatic keyword extraction with relational clustering and Levenshtein distances
    Runkler, TA
    Bezdek, JC
    NINTH IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE 2000), VOLS 1 AND 2, 2000, : 636 - 640
  • [5] Automatic Keyword Extraction Using TextRank
    Wongchaisuwat, Papis
    2019 IEEE 6TH INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING AND APPLICATIONS (ICIEA), 2019, : 377 - 381
  • [6] Rapid Automatic Keyword Extraction and Word Frequency in Scientific Article Keywords Extraction
    Rinartha, Komang
    Kartika, Luh Gede Surya
    3RD INTERNATIONAL CONFERENCE ON CYBERNETICS AND INTELLIGENT SYSTEMS (ICORIS 2021), 2021, : 216 - 219
  • [7] Using keyword extraction for Web site clustering
    Tonella, P
    Ricca, F
    Pianta, E
    Girardi, C
    FIFTH IEEE INTERNATIONAL WORKSHOP ON WEB SITE EVOLUTION THEME: ARCHITECTURE, PROCEEDINGS, 2003, : 41 - 48
  • [8] Automatic keyword extraction using domain knowledge
    Hulth, A
    Karlgren, J
    Jonsson, A
    Boström, H
    Asker, L
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, 2001, 2004 : 472 - 482
  • [9] Automatic keyword extraction using linguistic features
    Hu, Xinghua
    Wu, Bin
    ICDM 2006: SIXTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, WORKSHOPS, 2006, : 19 - +
  • [10] Automatic Synonym Extraction Using Word2Vec and Spectral Clustering
    Zhang, Li
    Li, Jun
    Wang, Chao
    PROCEEDINGS OF THE 36TH CHINESE CONTROL CONFERENCE (CCC 2017), 2017, : 5629 - 5632