Annotation of Financial Entities Using A Comprehensive Scheme in Turkish

被引:0
|
作者
Adali, Kubra [1 ]
Tantug, A. Cuneyd [1 ]
机构
[1] Istanbul Tech Univ, Bilgisayar Muhendisligi Bolumu, Istanbul, Turkey
关键词
Annotation; annotation scheme; language resource; named entity recognition; financial information extraction;
D O I
10.1109/SIU55565.2022.9864782
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Information extraction (IE) which refers to the task of turning texts into structured form is also employed in finance domain for extraction of information which have a big importance for different financial concepts such as market, stock, and indices etc. As many other applications in Natural Language Processing(NLP), annotated corpora which involves entities, that represent characteristics of the related domain, is also essential resources for training and evaluation of IE models. Unfortunately, the creation of these resources is rather thorny, thus the scarcity of annotated language resources is one of the most prominent problems for lesser-studied language; as in the case for Turkish. In this paper, we present an ontology of financial concepts, and an effort to produce a high-quality corpus which includes 500 news documents annotated with these concepts in Turkish. We employ the dataset in the training of a baseline entity recognition model, and performance achieved over the dataset is 64.5% F-scores.
引用
收藏
页数:4
相关论文
共 50 条
  • [31] Improving supplier performance using a comprehensive scheme
    Yang, Chang-Lin
    PRODUCTION PLANNING & CONTROL, 2010, 21 (07) : 653 - 663
  • [32] Annotation of semantic roles for the Turkish Proposition Bank
    Gözde Gül Şahin
    Eşref Adalı
    Language Resources and Evaluation, 2018, 52 : 673 - 706
  • [33] Annotation of semantic roles for the Turkish Proposition Bank
    Sahin, Gozde Gul
    Adali, Esref
    LANGUAGE RESOURCES AND EVALUATION, 2018, 52 (03) : 673 - 706
  • [34] Co-occurrence and ranking of entities based on semantic annotation
    Popov, Borislav
    Kiryakov, Atanas
    Kitchukov, Ilian
    Angelov, Krasimir
    Kozhuharov, Danail
    International Journal of Metadata, Semantics and Ontologies, 2008, 3 (01) : 21 - 36
  • [35] Annotation tools for syntax and named entities in the National Corpus of Polish
    Waszczuk, Jakub
    Glowinska, Katarzyna
    Savary, Agata
    Przepiorkowski, Adam
    Lenart, Michal
    INTERNATIONAL JOURNAL OF DATA MINING MODELLING AND MANAGEMENT, 2013, 5 (02) : 103 - 122
  • [36] SIA: a scalable interoperable annotation server for biomedical named entities
    Kirschnick, Johannes
    Thomas, Philippe
    Roller, Roland
    Hennig, Leonhard
    JOURNAL OF CHEMINFORMATICS, 2018, 10
  • [37] Accelerating the annotation of sparse named entities by dynamic sentence selection
    Yoshimasa Tsuruoka
    Jun'ichi Tsujii
    Sophia Ananiadou
    BMC Bioinformatics, 9
  • [38] Accelerating the annotation of sparse named entities by dynamic sentence selection
    Tsuruoka, Yoshimasa
    Tsujii, Jun'ichi
    Ananiadou, Sophia
    BMC BIOINFORMATICS, 2008, 9 (Suppl 11)
  • [39] SIA: a scalable interoperable annotation server for biomedical named entities
    Johannes Kirschnick
    Philippe Thomas
    Roland Roller
    Leonhard Hennig
    Journal of Cheminformatics, 10
  • [40] Retraction Note: A scheme for detecting outliers using sequential adjacency among entities
    V. Kathiresan
    N. A. Vasanthi
    Cluster Computing, 2023, 26 : 79 - 79