Contemplata, a Free Platform for Constituency Treebank Annotation

被引:0
|
作者
Waszczuk, Jakub [1 ]
Wang, Ilaine [2 ,3 ]
Antoine, Jean-Yves [2 ]
Halftermeyer, Anais [3 ]
机构
[1] Heinrich Heine Univ, Dusseldorf, Germany
[2] Univ Tours, LIFAT, Tours, France
[3] Univ Orleans, LIFO, Orleans, France
关键词
treebank; syntactic annotation; spontaneous speech; French language; constituent trees;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This paper describes Contemplata, an annotation platform that offers a generic solution for treebank building as well as treebank enrichment with relations between syntactic nodes. Contemplata is dedicated to the annotation of constituency trees. The framework includes support for syntactic parsers, which provide automatic annotations to be manually revised. The balanced strategy of annotation between automatic parsing and manual revision allows to reduce the annotator workload, which favours data reliability. The paper presents the software architecture of Contemplata, describes its practical use and eventually gives two examples of annotation projects that were conducted on the platform.
引用
收藏
页码:7222 / 7229
页数:8
相关论文
共 50 条
  • [21] Annotation of multiword expressions in the Prague dependency treebank
    Bejcek, Eduard
    Stranak, Pavel
    LANGUAGE RESOURCES AND EVALUATION, 2010, 44 (1-2) : 7 - 21
  • [22] Projection-based Annotation of a Polish Dependency Treebank
    Wroblewska, Alina
    Przepiorkowski, Adam
    LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014, : 2306 - 2312
  • [23] Syntactic Annotation Guidelines for the Quranic Arabic Dependency Treebank
    Dukes, Kais
    Atwell, Eric
    Sharaf, Abdul-Baquee M.
    LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010, : 1822 - 1827
  • [24] Prague Dependency Treebank Annotation Errors A Preliminary Analysis
    Kovar, Vojtech
    Jakubicek, Milos
    RASLAN 2009: RECENT ADVANCES IN SLAVONIC NATURAL LANGUAGE PROCESSING, 2009, : 101 - 108
  • [25] Consistent and Flexible Integration of Morphological Annotation in the Arabic Treebank
    Kulick, Seth
    Bies, Ann
    Maamouri, Mohamed
    LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010, : 1499 - 1506
  • [26] A dependency-based analysis of treebank annotation errors
    Haverinen, Katri
    Ginter, Filip
    Laippala, Veronika
    Kohonen, Samuel
    Viljanen, Timo
    Nyblom, Jenna
    Salakoski, Tapio
    1600, IOS Press BV (258): : 47 - 61
  • [27] The Procedure of Lexico-Semantic Annotation of Skladnica Treebank
    Hajnicz, Elzbieta
    LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014, : 2290 - 2297
  • [28] Dependency structure annotation in the IULA Spanish LSP Treebank
    Montserrat Marimon
    Núria Bel
    Language Resources and Evaluation, 2015, 49 : 433 - 454
  • [29] Dependency structure annotation in the IULA Spanish LSP Treebank
    Marimon, Montserrat
    Bel, Nuria
    LANGUAGE RESOURCES AND EVALUATION, 2015, 49 (02) : 433 - 454
  • [30] Deriving Enhanced Universal Dependencies from a Hybrid Dependency-Constituency Treebank
    Pretkalnina, Lauma
    Rituma, Laura
    Saulite, Baiba
    TEXT, SPEECH, AND DIALOGUE (TSD 2018), 2018, 11107 : 95 - 105