Keyword Acquisition for Language Composition Based on TextRank Automatic Summarization Approach

被引:0
|
作者
Jiang, Yan [1 ]
Xiang, Chunlin [1 ]
Li, Lingtong [1 ]
机构
[1] Presch Educ Coll, Dept Primary Educ, Chongqing 404047, Peoples R China
关键词
Language composition; keywords; best match 25; textrank; digests;
D O I
10.14569/IJACSA.2024.01504101
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
It is important to extract keywords from text quickly and accurately for composition analysis, but the accuracy of traditional keyword acquisition models is not high. Therefore, in this study, the Best Match 25 algorithm was first used to preprocess the compositions and evaluate the similarity between sentences. Then, TextRank was used to extract the abstract, construct segmentation and named entity model, and finally verify the research content. The results show that in the performance test, the Best Match 25 similarity algorithm has higher accuracy, recall rate and F1 value, the average running time is only 2182ms, and has the largest receiver working characteristic curve area, which is significantly higher than other models, reaching 0.954. The accuracy of TextRank algorithm is above 90%, the average accuracy of 100 text analysis is 94.23%, the average recall rate and F1 value are 96.67% and 95.85%, respectively. In comparison of the application of the four methods, the research model shows obvious advantages, the average keyword coverage rate is 94.54%, the average processing time of 16 texts is 11.29 seconds, and the average 24-hour memory usage is only 15.67%, which is lower than the other three methods. The experimental results confirm the superiority of the model in terms of keyword extraction accuracy. This research not only provides a new technical tool for language composition teaching and evaluation, but also provides a new idea and method for keyword extraction research in the field of natural language processing.
引用
收藏
页码:994 / 1005
页数:12
相关论文
共 50 条
  • [11] Fine-Tuning Textrank for Legal Document Summarization: A Bayesian Optimization Based Approach
    Jain, Deepali
    Borah, Malaya Dutta
    Biswas, Anupam
    PROCEEDINGS OF THE 12TH ANNUAL MEETING OF THE FORUM FOR INFORMATION RETRIEVAL EVALUATION (FIRE 2020), 2020, : 41 - 48
  • [12] Multifeature Fusion Keyword Extraction Algorithm Based on TextRank
    Guo, Wenming
    Wang, Zihao
    Han, Fang
    IEEE ACCESS, 2022, 10 : 71805 - 71813
  • [13] Automatic Keyword and Sentence-Based Text Summarization for Software Bug Reports
    Jindal, Shubhra Goyal
    Kaur, Arvinder
    IEEE ACCESS, 2020, 8 : 65352 - 65370
  • [14] Automatic Thai Text Summarization Using Keyword-Based Abstractive Method
    Ngamcharoen, Parun
    Sanglerdsinlapachai, Nuttapong
    Vejjanugraha, Pikul
    2022 17TH INTERNATIONAL JOINT SYMPOSIUM ON ARTIFICIAL INTELLIGENCE AND NATURAL LANGUAGE PROCESSING (ISAI-NLP 2022) / 3RD INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND INTERNET OF THINGS (AIOT 2022), 2022,
  • [15] The Mixture of TextRank and LexRank Techniques of Single Document Automatic Summarization Research in Tibetan
    Li, Ailin
    Jiang, Tao
    Wang, Qingshuai
    Yu, Hongzhi
    2016 8TH INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN-MACHINE SYSTEMS AND CYBERNETICS (IHMSC), VOL. 1, 2016, : 514 - 519
  • [16] Graph-Based Text Summarization Using Modified TextRank
    Mallick, Chirantana
    Das, Ajit Kumar
    Dutta, Madhurima
    Das, Asit Kumar
    Sarkar, Apurba
    SOFT COMPUTING IN DATA ANALYTICS, SCDA 2018, 2019, 758 : 137 - 146
  • [17] Automatic Keyword Extraction for Text Summarization in e-Newspapers
    Thomas, Justine Raju
    Bharti, Santosh Kumar
    Babu, Korra Sathya
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INFORMATICS AND ANALYTICS (ICIA' 16), 2016,
  • [18] An Improved TextRank-Based Method for Chinese Text Summarization
    Zheng, Xin
    Zhou, Tiantian
    Wang, Yintong
    Li, Shuo
    ARTIFICIAL INTELLIGENCE AND SECURITY, ICAIS 2022, PT II, 2022, 13339 : 140 - 149
  • [19] Language Agnostic Automatic Summarization Evaluation
    Tauchmann, Christopher
    Mieskes, Margot
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 6656 - 6662
  • [20] A Semantic Based Approach for Automatic Patent Document Summarization
    Trappey, Amy J. C.
    Trappey, Charles V.
    Wu, Chun-Yi
    COLLABORATIVE PRODUCTIVE AND SERVICE LIFE CYCLE MANAGEMENT FOR A SUSTAINABLE WORLD, 2008, : 485 - +