Keyword Acquisition for Language Composition Based on TextRank Automatic Summarization Approach

被引:0
|
作者
Jiang, Yan [1 ]
Xiang, Chunlin [1 ]
Li, Lingtong [1 ]
机构
[1] Presch Educ Coll, Dept Primary Educ, Chongqing 404047, Peoples R China
关键词
Language composition; keywords; best match 25; textrank; digests;
D O I
10.14569/IJACSA.2024.01504101
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
It is important to extract keywords from text quickly and accurately for composition analysis, but the accuracy of traditional keyword acquisition models is not high. Therefore, in this study, the Best Match 25 algorithm was first used to preprocess the compositions and evaluate the similarity between sentences. Then, TextRank was used to extract the abstract, construct segmentation and named entity model, and finally verify the research content. The results show that in the performance test, the Best Match 25 similarity algorithm has higher accuracy, recall rate and F1 value, the average running time is only 2182ms, and has the largest receiver working characteristic curve area, which is significantly higher than other models, reaching 0.954. The accuracy of TextRank algorithm is above 90%, the average accuracy of 100 text analysis is 94.23%, the average recall rate and F1 value are 96.67% and 95.85%, respectively. In comparison of the application of the four methods, the research model shows obvious advantages, the average keyword coverage rate is 94.54%, the average processing time of 16 texts is 11.29 seconds, and the average 24-hour memory usage is only 15.67%, which is lower than the other three methods. The experimental results confirm the superiority of the model in terms of keyword extraction accuracy. This research not only provides a new technical tool for language composition teaching and evaluation, but also provides a new idea and method for keyword extraction research in the field of natural language processing.
引用
收藏
页码:994 / 1005
页数:12
相关论文
共 50 条
  • [41] Research on Keyword Extraction Based on Word2Vec Weighted TextRank
    Wen, Yujun
    Yuan, Hui
    Zhang, Pengzhou
    2016 2ND IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATIONS (ICCC), 2016, : 2109 - 2113
  • [42] Multi-documents Summarization Based on the TextRank and Its Application in Argumentation System
    Xiong, Caiquan
    Li, Yuan
    Lv, Ke
    ADVANCES IN INTERNETWORKING, DATA & WEB TECHNOLOGIES, EIDWT-2017, 2018, 6 : 457 - 466
  • [43] A new approach to automatic speech summarization
    Hori, C
    Furui, S
    IEEE TRANSACTIONS ON MULTIMEDIA, 2003, 5 (03) : 368 - 378
  • [44] Practical approach to automatic text summarization
    Hynek, J
    Jezek, K
    FROM INFORMATION TO KNOWLEDGE, 2003, : 378 - 388
  • [45] A hybrid approach to automatic text summarization
    Chang, Te-Min
    Hsiao, Wen-Feng
    2008 IEEE 8TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY, VOLS 1 AND 2, 2008, : 65 - +
  • [46] A statistical approach to automatic speech summarization
    Hori, C
    Furui, S
    Malkin, R
    Yu, H
    Waibel, A
    EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2003, 2003 (02) : 128 - 139
  • [47] An Approach to Automatic Summarization of Television Programs
    Canora, Marco
    Garcia-Granada, Fernando
    Sanchis, Emilio
    Segarra, Encarna
    SPEECH AND COMPUTER (SPECOM 2018), 2018, 11096 : 86 - 93
  • [48] A Statistical Approach to Automatic Speech Summarization
    Chiori Hori
    Sadaoki Furui
    Rob Malkin
    Hua Yu
    Alex Waibel
    EURASIP Journal on Advances in Signal Processing, 2003
  • [49] A statistical approach to automatic speech summarization
    Hori, Chiori
    Furul, Sadaoki
    Malkin, Rob
    Yu, Hua
    Waibel, Alex
    Eurasip Journal on Applied Signal Processing, 1600, 2003 (02): : 128 - 139
  • [50] A Hybrid Approach For Automatic Document Summarization
    Rani, Siji S.
    Sreejith, K.
    Sanker, Arjun
    2017 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2017, : 663 - 669