Research on Cross Language Text Keyword Extraction Based on Information Entropy and TextRank

被引:0
|
作者
Zhang, Xiaoyu [1 ]
Wang, Yongbin [1 ]
Wu, Lin [1 ]
机构
[1] Commun Univ China, Internet Informat Res Inst, Beijing 100024, Peoples R China
关键词
component; information entropy; TextRank; keyword extraction; Cross language keyword extraction;
D O I
10.1109/itnec.2019.8728993
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In order to extract keywords from cross-language documents as accurately as possible, especially for the language whose keyword extraction technology is not mature, a text keyword extraction method based on information entropy and TextRank is proposed to extract the accurate keywords from the translated Chinese documents. This method determines the basic importance of words according to the information entropy of words, and then uses the information entropy of words to vote iteratively through the TextRank algorithm. This method solves the problem that TextRank algorithm easily extracts frequent non key words as keywords. The experimental results show that the proposed method can extract keywords more accurately than TextRank in the processing of cross-lingual bilingual translated documents.
引用
收藏
页码:16 / 19
页数:4
相关论文
共 50 条
  • [21] TextRank Keyword Extraction Algorithm Using Word Vector Clustering Based on Rough Data-Deduction
    Zhou, Ning
    Shi, Wenqian
    Liang, Renyu
    Zhong, Na
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [22] An Automatic Text Summary Extraction Method Based on Improved TextRank and TF-IDF
    Guan, Xinxin
    Li, Yeli
    Zeng, Qingtao
    Zhou, Chufeng
    2019 INTERNATIONAL CONFERENCE ON ADVANCED ELECTRONIC MATERIALS, COMPUTERS AND MATERIALS ENGINEERING (AEMCME 2019), 2019, 563
  • [23] Text Keyword Extraction Based on Meta-Learning Strategy
    Yuan, Man
    Zou, Chenhong
    2018 INTERNATIONAL CONFERENCE ON BIG DATA AND ARTIFICIAL INTELLIGENCE (BDAI 2018), 2018, : 78 - 81
  • [24] Research on method of text classification rule extraction based on genetic algorithm and entropy
    Computer Engineering Department of Nanhai Campus, South China Normal University, Foshan 528225, China
    Zhongshan Daxue Xuebao, 2007, 5 (18-21+24):
  • [25] Keyword Sequence Extraction Based on Byte Entropy Iterative Segmentation
    Ding, Siyuan
    Zhang, Xia
    Li, Ou
    Li, Shengxiang
    PROCEEDINGS OF 2017 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATIONS (ICCC), 2017, : 1530 - 1535
  • [26] An Unsupervised Keyword Extraction Method based on Text Semantic Graph
    Zhao, Liujun
    Miao, Zhongquan
    Wang, Chunming
    Kong, Weizheng
    2022 IEEE 6TH ADVANCED INFORMATION TECHNOLOGY, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (IAEAC), 2022, : 1431 - 1436
  • [27] Research on Decision Tree Method of Medical Text Based on Information Extraction
    Wu, Zihong
    HEALTH INFORMATION PROCESSING. EVALUATION TRACK PAPERS, 2023, 1773 : 127 - 133
  • [28] Research on a Combined Ontology-based Text Information Extraction Technology
    Gong Yiguang
    Mei Ping
    2011 INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND MULTIMEDIA COMMUNICATION, 2011, : 115 - 118
  • [29] UNL as a text content representation language for information extraction
    Cardenosa, Jesus
    Gallardo, Carolina
    Iraola, Luis
    FLEXIBLE QUERY ANSWERING SYSTEMS, PROCEEDINGS, 2006, 4027 : 507 - 518
  • [30] Automatic In-Text Keyword Tagging based on Information Retrieval
    Kim, Jinsuk
    Jin, Du-Seok
    Kim, KwangYoung
    Choe, Ho-Seop
    JOURNAL OF INFORMATION PROCESSING SYSTEMS, 2009, 5 (03): : 159 - 166