Contemplata, a Free Platform for Constituency Treebank Annotation

被引:0
|
作者
Waszczuk, Jakub [1 ]
Wang, Ilaine [2 ,3 ]
Antoine, Jean-Yves [2 ]
Halftermeyer, Anais [3 ]
机构
[1] Heinrich Heine Univ, Dusseldorf, Germany
[2] Univ Tours, LIFAT, Tours, France
[3] Univ Orleans, LIFO, Orleans, France
关键词
treebank; syntactic annotation; spontaneous speech; French language; constituent trees;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This paper describes Contemplata, an annotation platform that offers a generic solution for treebank building as well as treebank enrichment with relations between syntactic nodes. Contemplata is dedicated to the annotation of constituency trees. The framework includes support for syntactic parsers, which provide automatic annotations to be manually revised. The balanced strategy of annotation between automatic parsing and manual revision allows to reduce the annotator workload, which favours data reliability. The paper presents the software architecture of Contemplata, describes its practical use and eventually gives two examples of annotation projects that were conducted on the platform.
引用
收藏
页码:7222 / 7229
页数:8
相关论文
共 50 条
  • [41] The Construction of Interactive Environment for Sentence Pattern Structure Based Treebank Annotation
    Guan, Shiyu
    Peng, Weiming
    Song, Jihua
    Xu, Zhiping
    CHINESE LEXICAL SEMANTICS (CLSW 2019), 2020, 11831 : 753 - 763
  • [42] Semi-automatic Korean FrameNet Annotation over KAIST Treebank
    Hahm, Younggyun
    Kwon, Sunggoo
    Kim, Jiseong
    Choi, Key-Sun
    PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 83 - 87
  • [43] From Speech to Trees: Applying Treebank Annotation to Arabic Broadcast News
    Maamouri, Mohamed
    Bies, Ann
    Kulick, Seth
    Zaghouani, Wajdi
    Graff, David
    Ciul, Michael
    LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010, : 2117 - 2122
  • [44] Extending the TuBa-D/Z Treebank with GermaNet Sense Annotation
    Henrich, Verena
    Hinrichs, Erhard
    LANGUAGE PROCESSING AND KNOWLEDGE IN THE WEB, 2013, 8105 : 89 - 96
  • [45] Analysis of Typical Annotation Problems in Bilingual Case Grammar Treebank Construction
    Zan, Hongying
    Chen, Wanli
    Zhang, Kunli
    Jia, Yuxiang
    CHINESE LEXICAL SEMANTICS (CLSW 2015), 2015, 9332 : 524 - 534
  • [46] Post-annotation checking of Prague Dependency Treebank 2.0 data
    Stepanek, Jan
    TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2006, 4188 : 277 - 284
  • [47] Analyzing Text Coherence via Multiple Annotation in the Prague Dependency Treebank
    Rysova, Katerina
    Rysova, Magdalena
    TEXT, SPEECH, AND DIALOGUE (TSD 2015), 2015, 9302 : 71 - 79
  • [48] Constructions in Latvian Treebank: the Impact of Annotation Decisions on the Dependency Parsing Performance
    Pretkalnina, Lauma
    Rituma, Laura
    HUMAN LANGUAGE TECHNOLOGIES - THE BALTIC PERSPECTIVE, BALTIC HLT 2014, 2014, 268 : 219 - 226
  • [49] Enhancing the Arabic Treebank: A Collaborative Effort toward New Annotation Guidelines
    Maamouri, Mohamed
    Bies, Ann
    Kulick, Seth
    SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, 2008, : 3192 - 3196
  • [50] Croatian Dependency Treebank 2.0: New Annotation Guidelines for Improved Parsing
    Agic, Zeljko
    Berovic, Dasa
    Merkler, Danijela
    Tadic, Marko
    LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014, : 2313 - 2319