Keyword Acquisition for Language Composition Based on TextRank Automatic Summarization Approach

被引：0

作者：

Jiang, Yan ^{[1
]}

Xiang, Chunlin ^{[1
]}

Li, Lingtong ^{[1
]}

机构：

[1] Presch Educ Coll, Dept Primary Educ, Chongqing 404047, Peoples R China

来源：

INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS | 2024年 / 15卷 / 04期

关键词：

Language composition; keywords; best match 25; textrank; digests;

D O I：

10.14569/IJACSA.2024.01504101

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

It is important to extract keywords from text quickly and accurately for composition analysis, but the accuracy of traditional keyword acquisition models is not high. Therefore, in this study, the Best Match 25 algorithm was first used to preprocess the compositions and evaluate the similarity between sentences. Then, TextRank was used to extract the abstract, construct segmentation and named entity model, and finally verify the research content. The results show that in the performance test, the Best Match 25 similarity algorithm has higher accuracy, recall rate and F1 value, the average running time is only 2182ms, and has the largest receiver working characteristic curve area, which is significantly higher than other models, reaching 0.954. The accuracy of TextRank algorithm is above 90%, the average accuracy of 100 text analysis is 94.23%, the average recall rate and F1 value are 96.67% and 95.85%, respectively. In comparison of the application of the four methods, the research model shows obvious advantages, the average keyword coverage rate is 94.54%, the average processing time of 16 texts is 11.29 seconds, and the average 24-hour memory usage is only 15.67%, which is lower than the other three methods. The experimental results confirm the superiority of the model in terms of keyword extraction accuracy. This research not only provides a new technical tool for language composition teaching and evaluation, but also provides a new idea and method for keyword extraction research in the field of natural language processing.

引用

页码：994 / 1005

页数：12

共 50 条

[41] Research on Keyword Extraction Based on Word2Vec Weighted TextRank
Wen, Yujun
Yuan, Hui
Zhang, Pengzhou
2016 2ND IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATIONS (ICCC), 2016, : 2109 - 2113
[42] Multi-documents Summarization Based on the TextRank and Its Application in Argumentation System
Xiong, Caiquan
Li, Yuan
Lv, Ke
ADVANCES IN INTERNETWORKING, DATA & WEB TECHNOLOGIES, EIDWT-2017, 2018, 6 : 457 - 466
[43] A new approach to automatic speech summarization
Hori, C
Furui, S
IEEE TRANSACTIONS ON MULTIMEDIA, 2003, 5 (03) : 368 - 378
[44] Practical approach to automatic text summarization
Hynek, J
Jezek, K
FROM INFORMATION TO KNOWLEDGE, 2003, : 378 - 388
[45] A hybrid approach to automatic text summarization
Chang, Te-Min
Hsiao, Wen-Feng
2008 IEEE 8TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY, VOLS 1 AND 2, 2008, : 65 - +
[46] A statistical approach to automatic speech summarization
Hori, C
Furui, S
Malkin, R
Yu, H
Waibel, A
EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2003, 2003 (02) : 128 - 139
[47] An Approach to Automatic Summarization of Television Programs
Canora, Marco
Garcia-Granada, Fernando
Sanchis, Emilio
Segarra, Encarna
SPEECH AND COMPUTER (SPECOM 2018), 2018, 11096 : 86 - 93
[48] A Statistical Approach to Automatic Speech Summarization
Chiori Hori
Sadaoki Furui
Rob Malkin
Hua Yu
Alex Waibel
EURASIP Journal on Advances in Signal Processing, 2003
[49] A statistical approach to automatic speech summarization
Hori, Chiori
Furul, Sadaoki
Malkin, Rob
Yu, Hua
Waibel, Alex
Eurasip Journal on Applied Signal Processing, 1600, 2003 (02): : 128 - 139
[50] A Hybrid Approach For Automatic Document Summarization
Rani, Siji S.
Sreejith, K.
Sanker, Arjun
2017 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2017, : 663 - 669

← 1 2 3 4 5 →