Research of news text classification method based on hierarchical semantics and prior correction

被引:0
|
作者
Sun, Ping [1 ]
Song, LinLin [2 ]
Yuan, Ling [2 ]
Yu, Haiping [1 ]
Wei, Yinzhen [1 ]
机构
[1] Wuhan Vocational College of Software and Engineering, Hubei, Wuhan, China
[2] School of Computer Science and Technology, Huazhong University of Science and Technology, Hubei, Wuhan, China
来源
基金
中国国家自然科学基金;
关键词
Classification (of information) - Deep learning - Learning algorithms - Learning systems - Natural language processing systems - Text processing;
D O I
10.3233/JIFS-238433
中图分类号
学科分类号
摘要
News text is an important branch of natural language processing. Compared to ordinary texts, news text has significant economic and scientific value. The characteristics of news text include structural hierarchy, diverse label categories, and limited high-quality annotation samples. Many machine learning and deep learning methods exist to analyze various forms of news text. However, due to label imbalance, hierarchical semantics, and confusing labels, current methods have limitations. Therefore, this paper proposes a news text classification framework based on hierarchical semantics and prior correction (HSPC). Firstly, data augmentation is used to enhance the diversity of the training set and adversarial learning is employed to improve the resistance of the model with its robustness. Then, a hierarchical feature extraction approach is employed to extract semantic features from different levels of news texts. Consequentially, a feature fusion method is designed to allow the model to focus on relevant hierarchical semantics for label classification. Finally, highly confusing label predictions are corrected to optimize the label prediction of the model and improve confidence. Multiple experiments are performed on four widely used public datasets. The experimental results indicate that HSPC achieves higher classification accuracy compared to other models. On the FCT, AGNews, THUCNews, and Ohsumed datasets, HSPC improves the accuracy by 1.03%, 1.38%, 2.55%, and 1.15%, respectively, compared to state-of-the-art methods. This validates the rationality and effectiveness of the designed mechanisms. © 2024 - The authors. Published by IOS Press.
引用
收藏
页码:8185 / 8203
相关论文
共 50 条
  • [31] Hierarchical Text Classification based on LDA and Domain Ontology
    An, Wei
    Liu, Qihua
    INFORMATION TECHNOLOGY APPLICATIONS IN INDUSTRY II, PTS 1-4, 2013, 411-414 : 1112 - +
  • [32] A Text Classification Algorithm Based on Rocchio and Hierarchical Clustering
    Zeng, Anping
    Huang, Yongping
    ADVANCED INTELLIGENT COMPUTING, 2011, 6838 : 432 - +
  • [33] Hierarchical text classification based on support vector machines
    Jin, Ting
    Lei, Jingsheng
    Journal of Information and Computational Science, 2009, 6 (01): : 543 - 551
  • [34] Hierarchical Label Text Classification Method with Deep Label Assisted Classification Task
    Yukun, Cao
    Ziyue, Wei
    Yijia, Tang
    Chengkun, Jin
    Yunfeng, Li
    Computer Engineering and Applications, 2024, 60 (10) : 105 - 112
  • [35] Hierarchical approaches to Text-based Offense Classification
    Choi, Jay
    Kilmer, David
    Mueller-Smith, Michael
    Taheri, Sema A.
    SCIENCE ADVANCES, 2023, 9 (09)
  • [36] Research on Text Classification Based on TextRank
    Lu, Guangming
    Xia, Yule
    Wang, Jiamei
    Yang, Zhenling
    PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON COMMUNICATIONS, INFORMATION MANAGEMENT AND NETWORK SECURITY, 2016, 47 : 319 - 322
  • [37] Two-level hierarchical combination method for text classification
    Li, Wen
    Miao, Duoqian
    Wang, Weili
    EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (03) : 2030 - 2039
  • [38] BCC: BIDIRECTIONAL CONSISTENCY CONSTRAINT METHOD FOR HIERARCHICAL TEXT CLASSIFICATION
    Shen, Yinghan
    Yan, Yu
    Yin, Dechun
    Shen, Huawei
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2024), 2024, : 11271 - 11275
  • [39] Research article classification with text mining method
    Gurbuz, Tugba
    Uluyol, Celebi
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2023, 35 (01):
  • [40] Meta-Information Fusion of Hierarchical Semantics Dependency and Graph Structure for Structured Text Classification
    Wang, Shaokang
    Pan, Li
    Wu, Yu
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2023, 17 (02)