The Study on the Text Classification for Financial News Based on Partial Information

被引:8
|
作者
Zhao, Wenjie [1 ]
Zhang, Gaoyu [2 ]
Yuan, George [3 ,4 ]
Liu, Jun [5 ]
Shan, Hongtao [1 ]
Zhang, Shuyi [6 ]
机构
[1] Shanghai Univ Engn Sci, Sch Elect & Elect Engn, Shanghai 201620, Peoples R China
[2] Shanghai Lixin Univ Accounting & Finance, Sch Informat Management, Shanghai 201209, Peoples R China
[3] Chengdu Univ, Business Sch, Chengdu 610106, Peoples R China
[4] Shanghai Lixin Univ Accounting & Finance, Sch Finbtech, Shanghai 201209, Peoples R China
[5] Shanghai Lixin Univ Accounting & Finance, Shanghai 201209, Peoples R China
[6] Shanghai Lixin Univ Accounting & Finance, Lixin Res Inst, Shanghai 201209, Peoples R China
关键词
Text categorization; Logic gates; Feature extraction; Machine learning; Neural networks; Finance; Classification algorithms; Financial news; natural language processing (NLP); text processing; EXTRACTION; ENTITY;
D O I
10.1109/ACCESS.2020.2997969
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The goal of this paper is to conduct the study on the text classification for financial news based on partial information. By a fact that an indispensable step for the efficient use of topic information embedded in financial news is the text classification, a new neural network called & x201C;All Dataset based on CharCNN (Character Convolutional Neural Networks) and GRU (Gated Recurrent Unit)& x201D; (in short, AD-CharCGNN) which extracts a part of the financial article and incorporates both time domain and spatial domain to classify financial texts is proposed. In the study of this paper, we first build a character level vocabulary by reading all characters of the financial dataset, part of each financial text which will be classified is mapped to a high-dimensional spatial vector based on the vocabulary. Then, the vectors are convoluted in the spatial domain to get the text local features, and next, the features are processed by the gated recurrent units to get the features contained time information. Finally, the features which contain spatial and time information will be classified through softmax function to get the text classification results. Our results on the experiments confirm that the network proposed in this paper works effectively with the accuracy of 96.45 & x0025;, and it seems that the text classification algorithm with the feature by taking only partial text part is more suitable for the application of the practice. Meanwhile, for the input with character level vector, the network is not only suitable for Chinese but also for other languages.
引用
收藏
页码:100426 / 100437
页数:12
相关论文
共 50 条
  • [1] Application of text classification to mass financial information processing
    Zhang Juan
    Wang Huifeng
    Proceedings of the 24th Chinese Control Conference, Vols 1 and 2, 2005, : 1312 - 1315
  • [2] Text Spillover: Measuring connectedness of financial institutions based on news text data
    Klaucke, Konstantin
    NORTH AMERICAN JOURNAL OF ECONOMICS AND FINANCE, 2025, 75
  • [3] News Text Classification Model Based on Topic Model
    Li, Zhenzhong
    Shang, Wenqian
    Yan, Menghan
    2016 IEEE/ACIS 15TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE (ICIS), 2016, : 1197 - 1201
  • [4] A Text Classification Algorithm Based on Quantum Information
    Shang, Songtao
    Shi, Minyong
    Shang, Wenqian
    Hong, Zhiguo
    2015 11TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION (ICNC), 2015, : 381 - 384
  • [5] Text Classification Based on Title Semantic Information
    Liu, YunXiang
    Xu, Qi
    Wang, ChunYa
    2020 5TH INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATICS AND BIOMEDICAL SCIENCES (ICIIBMS 2020), 2020, : 29 - 33
  • [6] Text Classification Based on Partial Least Square Analysis
    Zeng, Xue-Qiang
    Wang, Ming-Wen
    Nie, Jian-Yun
    APPLIED COMPUTING 2007, VOL 1 AND 2, 2007, : 834 - +
  • [7] Research on Chinese News Text Classification Based on ERNIE Model
    Zhang, Wenxu
    PROCEEDINGS OF THE WORLD CONFERENCE ON INTELLIGENT AND 3-D TECHNOLOGIES, WCI3DT 2022, 2023, 323 : 89 - 100
  • [8] News Text Classification Based on an Improved Convolutional Neural Network
    Tao, Wenjing
    Chang, Dan
    TEHNICKI VJESNIK-TECHNICAL GAZETTE, 2019, 26 (05): : 1400 - 1409
  • [9] Deep Learning-Based Algorithm for Classification of News Text
    Yu Li, Xiao
    Han, Ling Bo
    Feng Jiang, Zheng
    IEEE ACCESS, 2024, 12 : 159086 - 159098
  • [10] Chinese News Text Classification Method Based On Attention Mechanism
    Ruan, Jinjun
    Caballero, Jonathan M.
    Juanatas, Ronaldo A.
    2022 7TH INTERNATIONAL CONFERENCE ON BUSINESS AND INDUSTRIAL RESEARCH (ICBIR2022), 2022, : 330 - 334