Keyword Acquisition for Language Composition Based on TextRank Automatic Summarization Approach

被引:0
|
作者
Jiang, Yan [1 ]
Xiang, Chunlin [1 ]
Li, Lingtong [1 ]
机构
[1] Presch Educ Coll, Dept Primary Educ, Chongqing 404047, Peoples R China
关键词
Language composition; keywords; best match 25; textrank; digests;
D O I
10.14569/IJACSA.2024.01504101
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
It is important to extract keywords from text quickly and accurately for composition analysis, but the accuracy of traditional keyword acquisition models is not high. Therefore, in this study, the Best Match 25 algorithm was first used to preprocess the compositions and evaluate the similarity between sentences. Then, TextRank was used to extract the abstract, construct segmentation and named entity model, and finally verify the research content. The results show that in the performance test, the Best Match 25 similarity algorithm has higher accuracy, recall rate and F1 value, the average running time is only 2182ms, and has the largest receiver working characteristic curve area, which is significantly higher than other models, reaching 0.954. The accuracy of TextRank algorithm is above 90%, the average accuracy of 100 text analysis is 94.23%, the average recall rate and F1 value are 96.67% and 95.85%, respectively. In comparison of the application of the four methods, the research model shows obvious advantages, the average keyword coverage rate is 94.54%, the average processing time of 16 texts is 11.29 seconds, and the average 24-hour memory usage is only 15.67%, which is lower than the other three methods. The experimental results confirm the superiority of the model in terms of keyword extraction accuracy. This research not only provides a new technical tool for language composition teaching and evaluation, but also provides a new idea and method for keyword extraction research in the field of natural language processing.
引用
收藏
页码:994 / 1005
页数:12
相关论文
共 50 条
  • [31] Comparison of Document Index Graph Using TextRank and HITS Weighting Method in Automatic Text Summarization
    Hadyan, Fadhlil
    Shaufiah
    Bijaksana, Moch. Arif
    1ST INTERNATIONAL CONFERENCE ON COMPUTING AND APPLIED INFORMATICS 2016 : APPLIED INFORMATICS TOWARD SMART ENVIRONMENT, PEOPLE, AND SOCIETY, 2017, 801
  • [32] Automatic paper writing based on a RNN and the TextRank algorithm
    Wang, Hei-Chia
    Hsiao, Wei-Ching
    Chang, Sheng-Han
    APPLIED SOFT COMPUTING, 2020, 97 (97)
  • [33] A Modification to Graph Based Approach for Extraction Based Automatic Text Summarization
    Sehgal, Sunchit
    Kumar, Badal
    Maheshwar
    Rampal, Lakshay
    Chaliya, Ankit
    PROGRESS IN ADVANCED COMPUTING AND INTELLIGENT ENGINEERING, VOL 2, 2018, 564 : 373 - 378
  • [34] Automatic Spoken Language Acquisition Based on Observation and Dialogue
    Komatsu, Ryota
    Gao, Shengzhou
    Hou, Wenxin
    Zhang, Mingxin
    Tanaka, Tomohiro
    Toyoda, Keisuke
    Kimura, Yusuke
    Hino, Kent
    Iwamoto, Yu
    Mori, Kosuke
    Okamoto, Takuma
    Shinozaki, Takahiro
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2022, 16 (06) : 1480 - 1492
  • [35] A Knowledge Graph Based Approach for Automatic Speech and Essay Summarization
    Khadilkar, Kunal
    Kulkarni, Siddhivinayak
    Venkatraman, Sitalakshmi
    2019 IEEE 5TH INTERNATIONAL CONFERENCE FOR CONVERGENCE IN TECHNOLOGY (I2CT), 2019,
  • [36] A Review on Optimization-Based Automatic Text Summarization Approach
    Wahab, Muhammad Hafizul H.
    Ali, Nor Hafiza
    Abdul Hamid, Nor Asilah Wati
    Subramaniam, Shamala K.
    Latip, Rohaya
    Othman, Mohamed
    IEEE ACCESS, 2024, 12 : 4892 - 4909
  • [37] A topic modeling based approach to novel document automatic summarization
    Wu, Zongda
    Lei, Li
    Li, Guiling
    Huang, Hui
    Zheng, Chengren
    Chen, Enhong
    Xu, Guandong
    EXPERT SYSTEMS WITH APPLICATIONS, 2017, 84 : 12 - 23
  • [38] Chinese Text Keyword Extraction Based on Doc2vec And TextRank
    Wang, Wei
    Li, Xiangshun
    Yu, Sheng
    PROCEEDINGS OF THE 32ND 2020 CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2020), 2020, : 369 - 373
  • [39] A Review on Swarm Intelligence Based Approach for Automatic Document Summarization
    Widiartha, I. Made
    Hartati, Rukmi Sari
    Sastra, Nyoman Putra
    Wiharta, Dewa Made
    2021 INTERNATIONAL CONFERENCE ON SMART-GREEN TECHNOLOGY IN ELECTRICAL AND INFORMATION SYSTEMS (ICSGTEIS), 2021, : 155 - 160
  • [40] Automatic Text Summarization in Natural Language Processing
    Desai, M. R.
    Gachhinakatti, Bhagyashree
    Balaganur, Pooja
    Rajeshwari, Y.
    Rathod, Laxmi
    2021 IEEE INTERNATIONAL CONFERENCE ON MOBILE NETWORKS AND WIRELESS COMMUNICATIONS (ICMNWC), 2021,