Late Latin Charter Treebank: contents and annotation

被引:2
|
作者
Korkiakangas, Timo [1 ]
机构
[1] Univ Helsinki, POB A215,Unioninkatu 40, Helsinki 00014, Finland
关键词
charter; Early Middle Ages; Italy; Latin; philology; treebank;
D O I
10.3366/cor.2021.0217
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
This paper describes the construction and annotation of the Late Latin Charter Treebank, a set of three dependency treebanks (LLCT1, LLCT2 and LLCT3) which together contain 1,261 Early Medieval Latin documentary texts (i.e., original charters) written in Italy between AD 714 and 1000 (about 594,000 tokens). The paper focusses on matters which a linguistically or philologically inclined user of LLCT needs to know: the criteria on which the charters were selected, the special characteristics of the annotation types utilised, and the geographical and chronological distribution of the data. In addition to normal queries on forms, lemmas, morphology and syntax, complex philological research settings are enabled by the textual annotation layer of LLCT, which indicates abbreviated and damaged words, as well as the formulaic and non-formulaic passages of each charter.
引用
收藏
页码:191 / 203
页数:13
相关论文
共 50 条
  • [1] The annotation guidelines of the Latin Dependency Treebank and Index Thomisticus Treebank The treatment of some specific syntactic constructions in Latin
    Bamman, David
    Passarotti, Marco
    Busa, Roberto
    Crane, Gregory
    SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, 2008, : 71 - 76
  • [2] The Annotation Scheme for Uyghur Dependency Treebank
    Mamitimin, Samat
    Ibrahim, Turgun
    Eli, Marhaba
    2013 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2013), 2013, : 185 - 188
  • [3] Annotation of grammatical function in the Persian treebank
    Pouramini, Ahmad
    Moridi, Elham
    4TH INTERNATIONAL CONFERENCE OF COGNITIVE SCIENCE, 2012, 32 : 302 - 307
  • [4] Sense annotation in the penn discourse treebank
    Miltsakaki, Eleni
    Robaldo, Livio
    Lee, Alan
    Joshi, Aravind
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, 2008, 4919 : 275 - +
  • [5] A dependency annotation scheme for Bangla treebank
    Sanjay Chatterji
    Tanaya Mukherjee Sarkar
    Pragati Dhang
    Samhita Deb
    Sudeshna Sarkar
    Jayshree Chakraborty
    Anupam Basu
    Language Resources and Evaluation, 2014, 48 : 443 - 477
  • [6] A dependency annotation scheme for Bangla treebank
    Chatterji, Sanjay
    Sarkar, Tanaya Mukherjee
    Dhang, Pragati
    Deb, Samhita
    Sarkar, Sudeshna
    Chakraborty, Jayshree
    Basu, Anupam
    LANGUAGE RESOURCES AND EVALUATION, 2014, 48 (03) : 443 - 477
  • [7] Challenges and Solutions for Consistent Annotation of Vietnamese Treebank
    Nguyen, Quy T.
    Miyao, Yusuke
    Le, Ha T. T.
    Nguyen, Ngan L. T.
    LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2016, : 1532 - 1539
  • [8] Ensuring annotation consistency and accuracy for Vietnamese treebank
    Nguyen, Quy T.
    Miyao, Yusuke
    Le, Ha T. T.
    Nguyen, Nhung T. H.
    LANGUAGE RESOURCES AND EVALUATION, 2018, 52 (01) : 269 - 315
  • [9] Automatic clause boundary annotation in the Hindi Treebank
    Sharma, Rahul
    Paul, Soma
    Bhat, Riyaz Ahmad
    Jain, Sambhav
    27th Pacific Asia Conference on Language, Information, and Computation, PACLIC 27, 2013, : 499 - 504
  • [10] Annotation of multiword expressions in the Prague dependency treebank
    Eduard Bejček
    Pavel Straňák
    Language Resources and Evaluation, 2010, 44 : 7 - 21