Late Latin Charter Treebank: contents and annotation

被引:2
|
作者
Korkiakangas, Timo [1 ]
机构
[1] Univ Helsinki, POB A215,Unioninkatu 40, Helsinki 00014, Finland
关键词
charter; Early Middle Ages; Italy; Latin; philology; treebank;
D O I
10.3366/cor.2021.0217
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
This paper describes the construction and annotation of the Late Latin Charter Treebank, a set of three dependency treebanks (LLCT1, LLCT2 and LLCT3) which together contain 1,261 Early Medieval Latin documentary texts (i.e., original charters) written in Italy between AD 714 and 1000 (about 594,000 tokens). The paper focusses on matters which a linguistically or philologically inclined user of LLCT needs to know: the criteria on which the charters were selected, the special characteristics of the annotation types utilised, and the geographical and chronological distribution of the data. In addition to normal queries on forms, lemmas, morphology and syntax, complex philological research settings are enabled by the textual annotation layer of LLCT, which indicates abbreviated and damaged words, as well as the formulaic and non-formulaic passages of each charter.
引用
收藏
页码:191 / 203
页数:13
相关论文
共 50 条
  • [11] Contemplata, a Free Platform for Constituency Treebank Annotation
    Waszczuk, Jakub
    Wang, Ilaine
    Antoine, Jean-Yves
    Halftermeyer, Anais
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 7222 - 7229
  • [12] A relation-based schema for treebank annotation
    Bosco, C
    Lombardo, V
    AI(ASTERISK)IA 2003: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2003, 2829 : 462 - 473
  • [13] Attribution and its annotation in the Penn Discourse TreeBank
    Prasad, Rashmi
    Dinesh, Nikhil
    Lee, Alan
    Joshi, Aravind
    Webber, Bonnie
    TRAITEMENT AUTOMATIQUE DES LANGUES, 2006, 47 (02): : 43 - 63
  • [14] Deep Syntax Annotation of the Sequoia French Treebank
    Candito, Marie
    Perrier, Guy
    Guillaume, Bruno
    Ribeyre, Corentin
    Fort, Karen
    Seddah, Djame
    de la Clergerie, Eric
    LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014, : 2298 - 2305
  • [15] Annotation of discourse phenomena in the Prague Dependency Treebank
    Zikanova, Sarka
    Polakova, Lucie
    Jinova, Pavlina
    Nedoluzhko, Anna
    Rysova, Magdalena
    Mirovsky, Jiri
    Hajicova, Eva
    SLOVO A SLOVESNOST, 2015, 76 (03): : 163 - 197
  • [16] Ensuring annotation consistency and accuracy for Vietnamese treebank
    Quy T. Nguyen
    Yusuke Miyao
    Ha T. T. Le
    Nhung T. H. Nguyen
    Language Resources and Evaluation, 2018, 52 : 269 - 315
  • [17] Annotation of multiword expressions in the Prague dependency treebank
    Bejcek, Eduard
    Stranak, Pavel
    LANGUAGE RESOURCES AND EVALUATION, 2010, 44 (1-2) : 7 - 21
  • [18] Projection-based Annotation of a Polish Dependency Treebank
    Wroblewska, Alina
    Przepiorkowski, Adam
    LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014, : 2306 - 2312
  • [19] Porting an Ancient Greek and Latin Treebank
    Lee, John
    Haug, Dag
    LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010, : 1918 - 1924
  • [20] Syntactic Annotation Guidelines for the Quranic Arabic Dependency Treebank
    Dukes, Kais
    Atwell, Eric
    Sharaf, Abdul-Baquee M.
    LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010, : 1822 - 1827