Research on Cross Language Text Keyword Extraction Based on Information Entropy and TextRank

被引:0
|
作者
Zhang, Xiaoyu [1 ]
Wang, Yongbin [1 ]
Wu, Lin [1 ]
机构
[1] Commun Univ China, Internet Informat Res Inst, Beijing 100024, Peoples R China
关键词
component; information entropy; TextRank; keyword extraction; Cross language keyword extraction;
D O I
10.1109/itnec.2019.8728993
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In order to extract keywords from cross-language documents as accurately as possible, especially for the language whose keyword extraction technology is not mature, a text keyword extraction method based on information entropy and TextRank is proposed to extract the accurate keywords from the translated Chinese documents. This method determines the basic importance of words according to the information entropy of words, and then uses the information entropy of words to vote iteratively through the TextRank algorithm. This method solves the problem that TextRank algorithm easily extracts frequent non key words as keywords. The experimental results show that the proposed method can extract keywords more accurately than TextRank in the processing of cross-lingual bilingual translated documents.
引用
收藏
页码:16 / 19
页数:4
相关论文
共 50 条
  • [31] Research on Keyword Information Retrieve Based on Semantic
    Li, Xin
    Dong, Wanxin
    ADVANCES IN MULTIMEDIA, SOFTWARE ENGINEERING AND COMPUTING, VOL 1, 2011, 128 : 305 - +
  • [32] Lithuanian text summarization based on keyword cross-occurrence
    Petrauskas, Saulius
    Miseviciene, Regina
    INFORMATION TECHNOLOGIES' 2008, PROCEEDINGS, 2008, : 49 - 52
  • [33] A Text Feature Based Automatic Keyword Extraction Method for Single Documents
    Campos, Ricardo
    Mangaravite, Vitor
    Pasquali, Arian
    Jorge, Alipio Mario
    Nunes, Celia
    Jatowt, Adam
    ADVANCES IN INFORMATION RETRIEVAL (ECIR 2018), 2018, 10772 : 684 - 691
  • [34] Keyword Combination Extraction in Text Categorization Based on Ant Colony Optimization
    Yu, Zi-jun
    Wu, Wei-gang
    Xiao, Jing
    Zhang, Jun
    Huang, Rui-Zhang
    Liu, Ou
    2009 INTERNATIONAL CONFERENCE OF SOFT COMPUTING AND PATTERN RECOGNITION, 2009, : 430 - +
  • [35] Reviewing Metaphor Research Based on Keyword Extraction Algorithm
    Zhang D.
    Gu F.
    Cui Z.
    Hu S.
    Zhang W.
    Lin H.
    Data Analysis and Knowledge Discovery, 2022, 6 (04) : 130 - 138
  • [36] The Research and Implementation of Keyword Extraction Algorithm Based on LDA
    Liu, Chengxia
    THIRD INTERNATIONAL CONFERENCE ON ELECTRONICS AND COMMUNICATION; NETWORK AND COMPUTER TECHNOLOGY (ECNCT 2021), 2022, 12167
  • [37] Cross Language Information Extraction Knowledge Adaptation
    Wong, Tak-Lam
    Chow, Kai-On
    Lam, Wai
    ROUGH SETS AND KNOWLEDGE TECHNOLOGY, PROCEEDINGS, 2009, 5589 : 520 - +
  • [38] Extraction of English Keyword Information Based on CAD Mesh Model
    Wu, Xiuying
    Yang, Liuhui
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [39] Keyword Extraction Based on Statistical Information for Cyrillic Mongolian script
    Nyandag, Bat-Erdene
    Li, Ru
    Demberel, Orgil
    2017 29TH CHINESE CONTROL AND DECISION CONFERENCE (CCDC), 2017, : 2250 - 2255
  • [40] Text Feature Extraction based on Joint Conditional Entropy
    Chen, Yanmin
    Wang, Xinwei
    PROCEEDINGS OF 2012 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2012), 2012, : 2055 - 2058