Research of news text classification method based on hierarchical semantics and prior correction

被引:0
|
作者
Sun, Ping [1 ]
Song, LinLin [2 ]
Yuan, Ling [2 ]
Yu, Haiping [1 ]
Wei, Yinzhen [1 ]
机构
[1] Wuhan Vocational College of Software and Engineering, Hubei, Wuhan, China
[2] School of Computer Science and Technology, Huazhong University of Science and Technology, Hubei, Wuhan, China
来源
基金
中国国家自然科学基金;
关键词
Classification (of information) - Deep learning - Learning algorithms - Learning systems - Natural language processing systems - Text processing;
D O I
10.3233/JIFS-238433
中图分类号
学科分类号
摘要
News text is an important branch of natural language processing. Compared to ordinary texts, news text has significant economic and scientific value. The characteristics of news text include structural hierarchy, diverse label categories, and limited high-quality annotation samples. Many machine learning and deep learning methods exist to analyze various forms of news text. However, due to label imbalance, hierarchical semantics, and confusing labels, current methods have limitations. Therefore, this paper proposes a news text classification framework based on hierarchical semantics and prior correction (HSPC). Firstly, data augmentation is used to enhance the diversity of the training set and adversarial learning is employed to improve the resistance of the model with its robustness. Then, a hierarchical feature extraction approach is employed to extract semantic features from different levels of news texts. Consequentially, a feature fusion method is designed to allow the model to focus on relevant hierarchical semantics for label classification. Finally, highly confusing label predictions are corrected to optimize the label prediction of the model and improve confidence. Multiple experiments are performed on four widely used public datasets. The experimental results indicate that HSPC achieves higher classification accuracy compared to other models. On the FCT, AGNews, THUCNews, and Ohsumed datasets, HSPC improves the accuracy by 1.03%, 1.38%, 2.55%, and 1.15%, respectively, compared to state-of-the-art methods. This validates the rationality and effectiveness of the designed mechanisms. © 2024 - The authors. Published by IOS Press.
引用
收藏
页码:8185 / 8203
相关论文
共 50 条
  • [41] RETRACTED: News Text Classification Method Based on the GRU_CNN Model (Retracted Article)
    Deng, Lujuan
    Ge, Qingxia
    Zhang, Jiaxue
    Li, Zuhe
    Yu, Zeqi
    Yin, Tiantian
    Zhu, Hanxue
    INTERNATIONAL TRANSACTIONS ON ELECTRICAL ENERGY SYSTEMS, 2022, 2022
  • [42] Research on Design of News Video Retrieval System Based on Semantics
    Zhang Xuhua
    2022 THE 6TH INTERNATIONAL CONFERENCE ON VIRTUAL AND AUGMENTED REALITY SIMULATIONS, ICVARS 2022, 2022, : 71 - 75
  • [43] Semantics-based event-driven web news classification
    Hu, Wei
    Sheng, Huan-Ye
    FRONTIERS OF HIGH PERFORMANCE COMPUTING AND NETWORKING - ISPA 2007 WORKSHOPS, 2007, 4743 : 136 - +
  • [44] A Text Classification Method Based on Cascade
    Li, Hui
    Zhang, Qi
    Lu, Huchuan
    Yang, Deli
    ADVANCES IN COGNITIVE NEURODYNAMICS, PROCEEDINGS, 2008, : 927 - +
  • [45] News Text Classification Based on an Improved Convolutional Neural Network
    Tao, Wenjing
    Chang, Dan
    TEHNICKI VJESNIK-TECHNICAL GAZETTE, 2019, 26 (05): : 1400 - 1409
  • [46] Deep Learning-Based Algorithm for Classification of News Text
    Yu Li, Xiao
    Han, Ling Bo
    Feng Jiang, Zheng
    IEEE ACCESS, 2024, 12 : 159086 - 159098
  • [47] Latent semantic text classification method research based on support vector machine
    Lu Q.
    Wang Y.
    International Journal of Information and Communication Technology, 2019, 15 (03) : 243 - 255
  • [48] Tibetan News Text Classification Based on Graph Convolutional Networks
    Xu G.
    Zhang Z.
    Yu S.
    Dong Y.
    Tian Y.
    Data Analysis and Knowledge Discovery, 2023, 7 (06) : 73 - 85
  • [49] Research on feature classification method of network text data based on association rules
    Huang H.
    International Journal of Computers and Applications, 2020, 42 (02) : 157 - 163
  • [50] Research on method of text classification rule extraction based on genetic algorithm and entropy
    Computer Engineering Department of Nanhai Campus, South China Normal University, Foshan 528225, China
    Zhongshan Daxue Xuebao, 2007, 5 (18-21+24):