Keyword Acquisition for Language Composition Based on TextRank Automatic Summarization Approach

被引：0

作者：

Jiang, Yan ^{[1
]}

Xiang, Chunlin ^{[1
]}

Li, Lingtong ^{[1
]}

机构：

[1] Presch Educ Coll, Dept Primary Educ, Chongqing 404047, Peoples R China

来源：

INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS | 2024年 / 15卷 / 04期

关键词：

Language composition; keywords; best match 25; textrank; digests;

D O I：

10.14569/IJACSA.2024.01504101

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

It is important to extract keywords from text quickly and accurately for composition analysis, but the accuracy of traditional keyword acquisition models is not high. Therefore, in this study, the Best Match 25 algorithm was first used to preprocess the compositions and evaluate the similarity between sentences. Then, TextRank was used to extract the abstract, construct segmentation and named entity model, and finally verify the research content. The results show that in the performance test, the Best Match 25 similarity algorithm has higher accuracy, recall rate and F1 value, the average running time is only 2182ms, and has the largest receiver working characteristic curve area, which is significantly higher than other models, reaching 0.954. The accuracy of TextRank algorithm is above 90%, the average accuracy of 100 text analysis is 94.23%, the average recall rate and F1 value are 96.67% and 95.85%, respectively. In comparison of the application of the four methods, the research model shows obvious advantages, the average keyword coverage rate is 94.54%, the average processing time of 16 texts is 11.29 seconds, and the average 24-hour memory usage is only 15.67%, which is lower than the other three methods. The experimental results confirm the superiority of the model in terms of keyword extraction accuracy. This research not only provides a new technical tool for language composition teaching and evaluation, but also provides a new idea and method for keyword extraction research in the field of natural language processing.

引用

页码：994 / 1005

页数：12

共 50 条

[31] Comparison of Document Index Graph Using TextRank and HITS Weighting Method in Automatic Text Summarization
Hadyan, Fadhlil
Shaufiah
Bijaksana, Moch. Arif
1ST INTERNATIONAL CONFERENCE ON COMPUTING AND APPLIED INFORMATICS 2016 : APPLIED INFORMATICS TOWARD SMART ENVIRONMENT, PEOPLE, AND SOCIETY, 2017, 801
[32] Automatic paper writing based on a RNN and the TextRank algorithm
Wang, Hei-Chia
Hsiao, Wei-Ching
Chang, Sheng-Han
APPLIED SOFT COMPUTING, 2020, 97 (97)
[33] A Modification to Graph Based Approach for Extraction Based Automatic Text Summarization
Sehgal, Sunchit
Kumar, Badal
Maheshwar
Rampal, Lakshay
Chaliya, Ankit
PROGRESS IN ADVANCED COMPUTING AND INTELLIGENT ENGINEERING, VOL 2, 2018, 564 : 373 - 378
[34] Automatic Spoken Language Acquisition Based on Observation and Dialogue
Komatsu, Ryota
Gao, Shengzhou
Hou, Wenxin
Zhang, Mingxin
Tanaka, Tomohiro
Toyoda, Keisuke
Kimura, Yusuke
Hino, Kent
Iwamoto, Yu
Mori, Kosuke
Okamoto, Takuma
Shinozaki, Takahiro
IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2022, 16 (06) : 1480 - 1492
[35] A Knowledge Graph Based Approach for Automatic Speech and Essay Summarization
Khadilkar, Kunal
Kulkarni, Siddhivinayak
Venkatraman, Sitalakshmi
2019 IEEE 5TH INTERNATIONAL CONFERENCE FOR CONVERGENCE IN TECHNOLOGY (I2CT), 2019,
[36] A Review on Optimization-Based Automatic Text Summarization Approach
Wahab, Muhammad Hafizul H.
Ali, Nor Hafiza
Abdul Hamid, Nor Asilah Wati
Subramaniam, Shamala K.
Latip, Rohaya
Othman, Mohamed
IEEE ACCESS, 2024, 12 : 4892 - 4909
[37] A topic modeling based approach to novel document automatic summarization
Wu, Zongda
Lei, Li
Li, Guiling
Huang, Hui
Zheng, Chengren
Chen, Enhong
Xu, Guandong
EXPERT SYSTEMS WITH APPLICATIONS, 2017, 84 : 12 - 23
[38] Chinese Text Keyword Extraction Based on Doc2vec And TextRank
Wang, Wei
Li, Xiangshun
Yu, Sheng
PROCEEDINGS OF THE 32ND 2020 CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2020), 2020, : 369 - 373
[39] A Review on Swarm Intelligence Based Approach for Automatic Document Summarization
Widiartha, I. Made
Hartati, Rukmi Sari
Sastra, Nyoman Putra
Wiharta, Dewa Made
2021 INTERNATIONAL CONFERENCE ON SMART-GREEN TECHNOLOGY IN ELECTRICAL AND INFORMATION SYSTEMS (ICSGTEIS), 2021, : 155 - 160
[40] Automatic Text Summarization in Natural Language Processing
Desai, M. R.
Gachhinakatti, Bhagyashree
Balaganur, Pooja
Rajeshwari, Y.
Rathod, Laxmi
2021 IEEE INTERNATIONAL CONFERENCE ON MOBILE NETWORKS AND WIRELESS COMMUNICATIONS (ICMNWC), 2021,

← 1 2 3 4 5 →