The Study on the Text Classification for Financial News Based on Partial Information

被引:8
|
作者
Zhao, Wenjie [1 ]
Zhang, Gaoyu [2 ]
Yuan, George [3 ,4 ]
Liu, Jun [5 ]
Shan, Hongtao [1 ]
Zhang, Shuyi [6 ]
机构
[1] Shanghai Univ Engn Sci, Sch Elect & Elect Engn, Shanghai 201620, Peoples R China
[2] Shanghai Lixin Univ Accounting & Finance, Sch Informat Management, Shanghai 201209, Peoples R China
[3] Chengdu Univ, Business Sch, Chengdu 610106, Peoples R China
[4] Shanghai Lixin Univ Accounting & Finance, Sch Finbtech, Shanghai 201209, Peoples R China
[5] Shanghai Lixin Univ Accounting & Finance, Shanghai 201209, Peoples R China
[6] Shanghai Lixin Univ Accounting & Finance, Lixin Res Inst, Shanghai 201209, Peoples R China
关键词
Text categorization; Logic gates; Feature extraction; Machine learning; Neural networks; Finance; Classification algorithms; Financial news; natural language processing (NLP); text processing; EXTRACTION; ENTITY;
D O I
10.1109/ACCESS.2020.2997969
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The goal of this paper is to conduct the study on the text classification for financial news based on partial information. By a fact that an indispensable step for the efficient use of topic information embedded in financial news is the text classification, a new neural network called & x201C;All Dataset based on CharCNN (Character Convolutional Neural Networks) and GRU (Gated Recurrent Unit)& x201D; (in short, AD-CharCGNN) which extracts a part of the financial article and incorporates both time domain and spatial domain to classify financial texts is proposed. In the study of this paper, we first build a character level vocabulary by reading all characters of the financial dataset, part of each financial text which will be classified is mapped to a high-dimensional spatial vector based on the vocabulary. Then, the vectors are convoluted in the spatial domain to get the text local features, and next, the features are processed by the gated recurrent units to get the features contained time information. Finally, the features which contain spatial and time information will be classified through softmax function to get the text classification results. Our results on the experiments confirm that the network proposed in this paper works effectively with the accuracy of 96.45 & x0025;, and it seems that the text classification algorithm with the feature by taking only partial text part is more suitable for the application of the practice. Meanwhile, for the input with character level vector, the network is not only suitable for Chinese but also for other languages.
引用
收藏
页码:100426 / 100437
页数:12
相关论文
共 50 条
  • [41] Research on News Text Classification Based on Deep Learning Convolutional Neural Network
    Zhu, Yunlong
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2021, 2021
  • [42] Optimization of Text Feature Selection Process Based on Advanced Searching for News Classification
    Kyaw, Khin Sandar
    Limsiroratana, Somchai
    INTERNATIONAL JOURNAL OF SWARM INTELLIGENCE RESEARCH, 2020, 11 (04) : 1 - 23
  • [43] Research of news text classification method based on hierarchical semantics and prior correction
    Sun, Ping
    Song, LinLin
    Yuan, Ling
    Yu, Haiping
    Wei, Yinzhen
    Journal of Intelligent and Fuzzy Systems, 2024, 46 (04): : 8185 - 8203
  • [44] A Long-Text Classification Method of Chinese News Based on BERT and CNN
    Chen, Xinying
    Cong, Peimin
    Lv, Shuo
    IEEE ACCESS, 2022, 10 : 34046 - 34057
  • [45] Research on News Text Classification Based on BERT-BiLSTM-TextCNN-Attention
    Wang, Jia
    Li, Zongting
    Ma, Chenyang
    PROCEEDINGS OF 2024 3RD INTERNATIONAL CONFERENCE ON CYBER SECURITY, ARTIFICIAL INTELLIGENCE AND DIGITAL ECONOMY, CSAIDE 2024, 2024, : 295 - 298
  • [46] Research on news text classification based on improved BERT-UNet model
    Li, Zeqin
    Liu, Jianwen
    Lin, Jin
    Tan, Deli
    Gong, Ruyue
    Wang, Linglin
    PROCEEDINGS OF INTERNATIONAL CONFERENCE ON MODELING, NATURAL LANGUAGE PROCESSING AND MACHINE LEARNING, CMNM 2024, 2024, : 1 - 7
  • [47] Local semantic indexing based on partial least squares for text classification
    Zeng, Xueqiang
    Li, Guozheng
    Wang, Mingwen
    Wu, Gengfeng
    Journal of Computational Information Systems, 2008, 4 (03): : 1145 - 1152
  • [48] Information Visualization of Special News Text Sets
    An, Haizhong
    Cui, Na
    Yu, Wenjing
    Wang, Lixiao
    Zhao, Aihua
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INFORMATION MANAGEMENT, INNOVATION MANAGEMENT AND INDUSTRIAL ENGINEERING, VOL 1, 2008, : 62 - 66
  • [49] Identifying the fraudulent financial information based on data classification method
    Chen, Zhang
    International Journal of Database Theory and Application, 2014, 7 (01): : 71 - 82
  • [50] News Video Story Segmentation Based on Topic Caption Text and Audio Information
    Zhao Yaqin
    Zhou Xianzhong
    Chen Huiming
    PROCEEDINGS OF THE 2009 WRI GLOBAL CONGRESS ON INTELLIGENT SYSTEMS, VOL IV, 2009, : 482 - +