Annotation of Financial Entities Using A Comprehensive Scheme in Turkish

被引:0
|
作者
Adali, Kubra [1 ]
Tantug, A. Cuneyd [1 ]
机构
[1] Istanbul Tech Univ, Bilgisayar Muhendisligi Bolumu, Istanbul, Turkey
关键词
Annotation; annotation scheme; language resource; named entity recognition; financial information extraction;
D O I
10.1109/SIU55565.2022.9864782
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Information extraction (IE) which refers to the task of turning texts into structured form is also employed in finance domain for extraction of information which have a big importance for different financial concepts such as market, stock, and indices etc. As many other applications in Natural Language Processing(NLP), annotated corpora which involves entities, that represent characteristics of the related domain, is also essential resources for training and evaluation of IE models. Unfortunately, the creation of these resources is rather thorny, thus the scarcity of annotated language resources is one of the most prominent problems for lesser-studied language; as in the case for Turkish. In this paper, we present an ontology of financial concepts, and an effort to produce a high-quality corpus which includes 500 news documents annotated with these concepts in Turkish. We employ the dataset in the training of a baseline entity recognition model, and performance achieved over the dataset is 64.5% F-scores.
引用
收藏
页数:4
相关论文
共 50 条
  • [1] Annotation of specialized corpora using a comprehensive entity and relation scheme
    Deleger, Louise
    Ligozat, Anne-Laure
    Grouin, Cyril
    Zweigenbaum, Pierre
    Neveol, Aurelie
    LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014, : 1267 - 1274
  • [2] Annotation Scheme and Specification for Named Entities and Relations on Chinese Medical Knowledge Graph
    Yue, Donghui
    Zhang, Kunli
    Zhuang, Lei
    Zhao, Xu
    Byambasuren, Odmaa
    Zan, Hongying
    CHINESE LEXICAL SEMANTICS (CLSW 2019), 2020, 11831 : 563 - 574
  • [3] Using comprehensive entities for product planning
    Ito, S
    Kusumoto, R
    Numata, J
    2005 IEEE International Engineering Management Conference, Vols 1 and 2, 2005, : 637 - 639
  • [4] Constructing a WordNet for Turkish Using Manual and Automatic Annotation
    Ehsani, Razieh
    Solak, Ercan
    Yildiz, Olcay Taner
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2018, 17 (03)
  • [5] TEI-friendly annotation scheme for medieval named entities: a case on a Spanish medieval corpus
    Alvarez-Mellado, Elena
    Diez-Platas, Maria Luisa
    Ruiz-Fabo, Pablo
    Bermudez, Helena
    Ros, Salvador
    Gonzalez-Blanco, Elena
    LANGUAGE RESOURCES AND EVALUATION, 2021, 55 (02) : 525 - 549
  • [6] TEI-friendly annotation scheme for medieval named entities: a case on a Spanish medieval corpus
    Elena Álvarez-Mellado
    María Luisa Díez-Platas
    Pablo Ruiz-Fabo
    Helena Bermúdez
    Salvador Ros
    Elena González-Blanco
    Language Resources and Evaluation, 2021, 55 : 525 - 549
  • [7] Towards a double annotation of Named Entities
    Ehrmann, Maud
    Jacquet, Guillaume
    TRAITEMENT AUTOMATIQUE DES LANGUES, 2006, 47 (03): : 63 - 88
  • [8] Temporal Role Annotation for Named Entities
    Koutraki, Maria
    Bakhshandegan-Moghaddam, Farshad
    Sack, Harald
    PROCEEDINGS OF THE 14TH INTERNATIONAL CONFERENCE ON SEMANTIC SYSTEMS, 2018, 137 : 223 - 234
  • [9] Relational Turkish Text Classification Using Distant Supervised Entities and Relations
    Okur, Halil Ibrahim
    Tohma, Kadir
    Sertbas, Ahmet
    CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 79 (02): : 2209 - 2228
  • [10] Fast Linking of Mathematical Wikidata Entities in Wikipedia Articles Using Annotation Recommendation
    Scharpf, Philipp
    Schubotz, Moritz
    Gipp, Bela
    WEB CONFERENCE 2021: COMPANION OF THE WORLD WIDE WEB CONFERENCE (WWW 2021), 2021, : 602 - 609