Contemplata, a Free Platform for Constituency Treebank Annotation

被引:0
|
作者
Waszczuk, Jakub [1 ]
Wang, Ilaine [2 ,3 ]
Antoine, Jean-Yves [2 ]
Halftermeyer, Anais [3 ]
机构
[1] Heinrich Heine Univ, Dusseldorf, Germany
[2] Univ Tours, LIFAT, Tours, France
[3] Univ Orleans, LIFO, Orleans, France
关键词
treebank; syntactic annotation; spontaneous speech; French language; constituent trees;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This paper describes Contemplata, an annotation platform that offers a generic solution for treebank building as well as treebank enrichment with relations between syntactic nodes. Contemplata is dedicated to the annotation of constituency trees. The framework includes support for syntactic parsers, which provide automatic annotations to be manually revised. The balanced strategy of annotation between automatic parsing and manual revision allows to reduce the annotator workload, which favours data reliability. The paper presents the software architecture of Contemplata, describes its practical use and eventually gives two examples of annotation projects that were conducted on the platform.
引用
收藏
页码:7222 / 7229
页数:8
相关论文
共 50 条
  • [11] Ensuring annotation consistency and accuracy for Vietnamese treebank
    Nguyen, Quy T.
    Miyao, Yusuke
    Le, Ha T. T.
    Nguyen, Nhung T. H.
    LANGUAGE RESOURCES AND EVALUATION, 2018, 52 (01) : 269 - 315
  • [12] Automatic clause boundary annotation in the Hindi Treebank
    Sharma, Rahul
    Paul, Soma
    Bhat, Riyaz Ahmad
    Jain, Sambhav
    27th Pacific Asia Conference on Language, Information, and Computation, PACLIC 27, 2013, : 499 - 504
  • [13] Annotation of multiword expressions in the Prague dependency treebank
    Eduard Bejček
    Pavel Straňák
    Language Resources and Evaluation, 2010, 44 : 7 - 21
  • [14] A relation-based schema for treebank annotation
    Bosco, C
    Lombardo, V
    AI(ASTERISK)IA 2003: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2003, 2829 : 462 - 473
  • [15] Late Latin Charter Treebank: contents and annotation
    Korkiakangas, Timo
    CORPORA, 2021, 16 (02) : 191 - 203
  • [16] Attribution and its annotation in the Penn Discourse TreeBank
    Prasad, Rashmi
    Dinesh, Nikhil
    Lee, Alan
    Joshi, Aravind
    Webber, Bonnie
    TRAITEMENT AUTOMATIQUE DES LANGUES, 2006, 47 (02): : 43 - 63
  • [17] Deep Syntax Annotation of the Sequoia French Treebank
    Candito, Marie
    Perrier, Guy
    Guillaume, Bruno
    Ribeyre, Corentin
    Fort, Karen
    Seddah, Djame
    de la Clergerie, Eric
    LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014, : 2298 - 2305
  • [18] Składnica: a constituency treebank of Polish harmonised with the Walenty valency dictionary
    Marcin Woliński
    Elżbieta Hajnicz
    Language Resources and Evaluation, 2021, 55 : 209 - 239
  • [19] Annotation of discourse phenomena in the Prague Dependency Treebank
    Zikanova, Sarka
    Polakova, Lucie
    Jinova, Pavlina
    Nedoluzhko, Anna
    Rysova, Magdalena
    Mirovsky, Jiri
    Hajicova, Eva
    SLOVO A SLOVESNOST, 2015, 76 (03): : 163 - 197
  • [20] Ensuring annotation consistency and accuracy for Vietnamese treebank
    Quy T. Nguyen
    Yusuke Miyao
    Ha T. T. Le
    Nhung T. H. Nguyen
    Language Resources and Evaluation, 2018, 52 : 269 - 315