The Study on the Text Classification for Financial News Based on Partial Information

被引:8
|
作者
Zhao, Wenjie [1 ]
Zhang, Gaoyu [2 ]
Yuan, George [3 ,4 ]
Liu, Jun [5 ]
Shan, Hongtao [1 ]
Zhang, Shuyi [6 ]
机构
[1] Shanghai Univ Engn Sci, Sch Elect & Elect Engn, Shanghai 201620, Peoples R China
[2] Shanghai Lixin Univ Accounting & Finance, Sch Informat Management, Shanghai 201209, Peoples R China
[3] Chengdu Univ, Business Sch, Chengdu 610106, Peoples R China
[4] Shanghai Lixin Univ Accounting & Finance, Sch Finbtech, Shanghai 201209, Peoples R China
[5] Shanghai Lixin Univ Accounting & Finance, Shanghai 201209, Peoples R China
[6] Shanghai Lixin Univ Accounting & Finance, Lixin Res Inst, Shanghai 201209, Peoples R China
关键词
Text categorization; Logic gates; Feature extraction; Machine learning; Neural networks; Finance; Classification algorithms; Financial news; natural language processing (NLP); text processing; EXTRACTION; ENTITY;
D O I
10.1109/ACCESS.2020.2997969
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The goal of this paper is to conduct the study on the text classification for financial news based on partial information. By a fact that an indispensable step for the efficient use of topic information embedded in financial news is the text classification, a new neural network called & x201C;All Dataset based on CharCNN (Character Convolutional Neural Networks) and GRU (Gated Recurrent Unit)& x201D; (in short, AD-CharCGNN) which extracts a part of the financial article and incorporates both time domain and spatial domain to classify financial texts is proposed. In the study of this paper, we first build a character level vocabulary by reading all characters of the financial dataset, part of each financial text which will be classified is mapped to a high-dimensional spatial vector based on the vocabulary. Then, the vectors are convoluted in the spatial domain to get the text local features, and next, the features are processed by the gated recurrent units to get the features contained time information. Finally, the features which contain spatial and time information will be classified through softmax function to get the text classification results. Our results on the experiments confirm that the network proposed in this paper works effectively with the accuracy of 96.45 & x0025;, and it seems that the text classification algorithm with the feature by taking only partial text part is more suitable for the application of the practice. Meanwhile, for the input with character level vector, the network is not only suitable for Chinese but also for other languages.
引用
收藏
页码:100426 / 100437
页数:12
相关论文
共 50 条
  • [21] A news classification applied with new text representation based on the improved LDA
    Dangguo Shao
    Chengyao Li
    Chusheng Huang
    Yan Xiang
    Zhengtao Yu
    Multimedia Tools and Applications, 2022, 81 : 21521 - 21545
  • [22] Research on Financial Fraud Text Classification Based on PET-BiLSTM
    Zou, Feifei
    Hu, Su
    Yu, Wei
    Yan, Zejun
    Chan, Sijun
    PROCEEDINGS OF 2024 3RD INTERNATIONAL CONFERENCE ON CYBER SECURITY, ARTIFICIAL INTELLIGENCE AND DIGITAL ECONOMY, CSAIDE 2024, 2024, : 341 - 345
  • [23] Concept Pattern Based Text Classification System Development for Georgian Text Based Information Retrieval
    Khachidze, Manana
    Tsintsadze, Magda
    Archuadze, Maia
    Besiashvili, Gela
    BALTIC JOURNAL OF MODERN COMPUTING, 2015, 3 (04): : 307 - 317
  • [24] Chinese News Text Classification based on Attention-based CNN-BiLSTM
    Wang, Meng
    Cai, Qiong
    Wang, Liya
    Li, Jun
    Wang, Xiaoke
    MIPPR 2019: PATTERN RECOGNITION AND COMPUTER VISION, 2020, 11430
  • [25] News Article Text Classification in Indonesian Language
    Wongso, Rini
    Luwinda, Ferdinand Ariandy
    Trisnajaya, Brandon Christian
    Rusli, Olivia
    Rudy
    DISCOVERY AND INNOVATION OF COMPUTER SCIENCE TECHNOLOGY IN ARTIFICIAL INTELLIGENCE ERA, 2017, 116 : 137 - 143
  • [26] Online Mining in Unstructured Financial Information: An Empirical Study in Bulletin News
    Ma, Chao
    Liang, Xun
    2015 12TH INTERNATIONAL CONFERENCE ON SERVICE SYSTEMS AND SERVICE MANAGEMENT (ICSSSM), 2015,
  • [27] Applications of Deep Learning in News Text Classification
    Zhang, Menghan
    SCIENTIFIC PROGRAMMING, 2021, 2021
  • [28] Financial news mining:: Monitoring continuous streams of text
    Ingvaldsen, Jon Espen
    Gulla, Jon Atle
    Laegreid, Tarjei
    Sandal, Paul Christian
    2006 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE, (WI 2006 MAIN CONFERENCE PROCEEDINGS), 2006, : 321 - +
  • [29] Text Mining: Sentiment Analysis on news classification
    Gomes, Helder
    Neto, Miguel de Castro
    Henriques, Roberto
    PROCEEDINGS OF THE 2013 8TH IBERIAN CONFERENCE ON INFORMATION SYSTEMS AND TECHNOLOGIES (CISTI 2013), 2013,
  • [30] Financial Cycle With Text Information Embedding Based on LDA Measurement and Nowcasting
    Li, Peijin
    Peng, Xinyi
    Zhang, Chonghui
    Balezentis, Tomas
    JOURNAL OF ORGANIZATIONAL AND END USER COMPUTING, 2024, 36 (01)