Chinese Financial Event Extraction Base on Syntactic and Semantic Dependency Parsing

被引:0
|
作者
Wan Q.-Z. [1 ,3 ]
Wan C.-X. [1 ,3 ]
Hu R. [2 ,3 ]
Liu D.-X. [1 ,3 ]
机构
[1] School of Information Technology, Jiangxi University of Finance and Economics, Nanchang
[2] School of Software and Internet of Things Engineering, Jiangxi University of Finance and Economics, Nanchang
[3] Jiangxi Key Laboratory of Data and Knowledge Engineering, Jiangxi University of Finance and Economics, Nanchang
来源
基金
中国国家自然科学基金;
关键词
Chinese event extraction; Core verb chain; Default complement; Event semantics relevance; Syntactic semantic dependency parsing graph;
D O I
10.11897/SP.J.1016.2021.00508
中图分类号
学科分类号
摘要
As a sub-task of information extraction, event extraction plays an important role in nature language process applications, such as stock market trend forecast, which can provide strong clues for events users, e.g. investors, managers and government, to analyze the market and make decisions. At present, most of the studies about event extraction pay more attention to the type correctness of triggers and arguments, and not consider the effect and value of event extraction based on application requirements. We call this type of event extraction traditional event extraction. The event types and standards in traditional event extraction are derived from ACE2005 containing 8 categories and 33 sub-categories, KBP2015 and ERE, et al. However, there are some limitations in application of them to event extraction in specific financial domain. For example, there is not the overweight event type in ACE2005, which is a special behavior in the financial domain. In this paper, we focus on the financial news and extract open events without types. In the field of finance and economics, most event users are more concerned with the objects and actions that events affect. Therefore, combined with the application requirement, we propose to extract the financial event ET(Sub, Pred, Obj), where Sub, Pred and Obj represent subject, predicate and object respectively. However, Chinese financial news generally suffers from the event nesting and component default problem, which result in event omission and key element missing of events. To tackle this issue, with the expression habits and characteristics of Chinese linguistics, we build a Chinese event extraction framework based on syntactic and semantic dependency parsing. Then summarize four common default structures and design corresponding completion rules. In particular, at the beginning of this paper, we summarize four prominent phenomena in the extraction of events from the headlines of financial news, and explore the cause of these problems, no in-depth analyzing the relevance of syntactic and semantic structure or lack of it. After that, we employ the syntactic dependency parsing tree and lexical structure, and propose the core verb chains, which make sure that each core verb corresponds to an event solving event leakage problem. Thirdly, we add semantic dependency relation between events on the basis of syntactic dependency tree, which is called Syntactic Semantic Dependency Parsing (SSDP) tree. In order to better separate the detected events and their properties, we adjust and optimize SSDP tree to form the SSDP graph, where the word nodes of the same syntactic structure are at the same level, providing a way for subsequent event extraction. Fourthly, with the division of default structure in linguistic, we summarize four common default structures and propose ten corresponding completion rules to solve the problem of component default. Meanwhile, the whole Chinese event extraction algorithm based SSDP graph is shown at the end of the section. Finally, this paper depicts a detailed experimental situation. The experimental dataset, labeling standard and evaluation index are given. Subsequently, the method in this paper is verified on two datasets, financial news titles and common field news titles. At the end, we conduct comprehensive benchmarks on Chinese financial news titles and CoNLL2009 Chinese Corpus. The experimental results show that the proposed methods are effective. © 2021, Science Press. All right reserved.
引用
收藏
页码:508 / 530
页数:22
相关论文
共 40 条
  • [1] Ding X, Zhang Y, Liu T, Et al., Using structured events to predict stock price movement: An empirical investigation, Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing(EMNLP), pp. 1415-1425, (2014)
  • [2] Ding X, Zhang Y, Liu T, Et al., Deep learning for event-driven stock prediction, Proceedings of the 24th International Joint Conference on Artificial Intelligence(IJCAI), pp. 2327-2333, (2015)
  • [3] Ding X, Zhang Y, Liu T, Et al., Knowledge-driven event embedding for stock prediction, Proceedings of the 26th International Conference on Computational Linguistics: Technical Papers(COLING), pp. 2133-2142, (2016)
  • [4] Xie B, Passonneau R, Wu L, Et al., Semantic frames to predict stock price movement, Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics(ACL), pp. 873-883, (2013)
  • [5] Aguilar J, Beller C, McNamee P, Et al., A comparison of the events and relations across ACE, ERE, TAC-KBP, and FrameNet annotation standards, Proceedings of the 2nd Workshop on EVENTS: Definition, Detection, Coreference, and Representation, pp. 45-53, (2014)
  • [6] Jacobs G, Lefever E, Hoste V., Economic event detection in company-specific news text, Proceedings of the 1st Workshop on Economics and Natural Language Processing (ACL), pp. 1-10, (2018)
  • [7] Yang H, Chen Y, Liu K, Et al., DCFEE: A document-level Chinese financial event extraction system based on automatically labeled training data, Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics-System Demonstrations(ACL), pp. 1-6, (2018)
  • [8] Li Peng-Feng, Zhou Guo-Dong, Zhu Qiao-Ming, Semantics-based joint model of Chinese event trigger extraction, Journal of Software, 27, 2, pp. 280-294, (2016)
  • [9] Yeh C L, Chen Y C., Zero anaphora resolution in Chinese with shallow parsing, Journal of Chinese Language and Computing, 17, 1, pp. 41-56, (2007)
  • [10] Li P, Zhu Q, Zhou G., Argument inference from relevant event mentions in Chinese argument extraction, Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics(ACL), pp. 1477-1487, (2013)