Late Latin Charter Treebank: contents and annotation

被引:2
|
作者
Korkiakangas, Timo [1 ]
机构
[1] Univ Helsinki, POB A215,Unioninkatu 40, Helsinki 00014, Finland
关键词
charter; Early Middle Ages; Italy; Latin; philology; treebank;
D O I
10.3366/cor.2021.0217
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
This paper describes the construction and annotation of the Late Latin Charter Treebank, a set of three dependency treebanks (LLCT1, LLCT2 and LLCT3) which together contain 1,261 Early Medieval Latin documentary texts (i.e., original charters) written in Italy between AD 714 and 1000 (about 594,000 tokens). The paper focusses on matters which a linguistically or philologically inclined user of LLCT needs to know: the criteria on which the charters were selected, the special characteristics of the annotation types utilised, and the geographical and chronological distribution of the data. In addition to normal queries on forms, lemmas, morphology and syntax, complex philological research settings are enabled by the textual annotation layer of LLCT, which indicates abbreviated and damaged words, as well as the formulaic and non-formulaic passages of each charter.
引用
收藏
页码:191 / 203
页数:13
相关论文
共 50 条
  • [31] Partial Parsing as a Method to Expedite Dependency Annotation of a Hindi Treebank
    Gupta, Mridul
    Yadav, Vineet
    Husain, Samar
    Sharma, Dipti Misra
    LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010, : 1930 - 1935
  • [32] CCGweb: a New Annotation Tool and a First Quadrilingual CCG Treebank
    Evang, Kilian
    Abzianidze, Lasha
    Bos, Johan
    13TH LINGUISTIC ANNOTATION WORKSHOP (LAW XIII), 2019, : 37 - 42
  • [33] Adjusting Indonesian Multiword Expression Annotation to the Penn Treebank Format
    Arwidarasti, Jessica Naraiswari
    Alfina, Ika
    Krisnadhi, Adila Alfa
    2020 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2020), 2020, : 75 - 80
  • [34] Diacritic Annotation in the Arabic Treebank and Its Impact on Parser Evaluation
    Maamouri, Mohamed
    Kulick, Seth
    Bies, Ann
    SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, 2008, : 2773 - 2776
  • [35] Syntactic Annotation in the I3rab Dependency Treebank
    Halabi, Dana
    Awajan, Arafat
    Fayyoumi, Ebaa
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2021, 18 (3A) : 381 - 392
  • [36] The Construction of Interactive Environment for Sentence Pattern Structure Based Treebank Annotation
    Guan, Shiyu
    Peng, Weiming
    Song, Jihua
    Xu, Zhiping
    CHINESE LEXICAL SEMANTICS (CLSW 2019), 2020, 11831 : 753 - 763
  • [37] Semi-automatic Korean FrameNet Annotation over KAIST Treebank
    Hahm, Younggyun
    Kwon, Sunggoo
    Kim, Jiseong
    Choi, Key-Sun
    PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 83 - 87
  • [38] From Speech to Trees: Applying Treebank Annotation to Arabic Broadcast News
    Maamouri, Mohamed
    Bies, Ann
    Kulick, Seth
    Zaghouani, Wajdi
    Graff, David
    Ciul, Michael
    LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010, : 2117 - 2122
  • [39] Extending the TuBa-D/Z Treebank with GermaNet Sense Annotation
    Henrich, Verena
    Hinrichs, Erhard
    LANGUAGE PROCESSING AND KNOWLEDGE IN THE WEB, 2013, 8105 : 89 - 96
  • [40] Analysis of Typical Annotation Problems in Bilingual Case Grammar Treebank Construction
    Zan, Hongying
    Chen, Wanli
    Zhang, Kunli
    Jia, Yuxiang
    CHINESE LEXICAL SEMANTICS (CLSW 2015), 2015, 9332 : 524 - 534