Contemplata, a Free Platform for Constituency Treebank Annotation

被引:0
|
作者
Waszczuk, Jakub [1 ]
Wang, Ilaine [2 ,3 ]
Antoine, Jean-Yves [2 ]
Halftermeyer, Anais [3 ]
机构
[1] Heinrich Heine Univ, Dusseldorf, Germany
[2] Univ Tours, LIFAT, Tours, France
[3] Univ Orleans, LIFO, Orleans, France
关键词
treebank; syntactic annotation; spontaneous speech; French language; constituent trees;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This paper describes Contemplata, an annotation platform that offers a generic solution for treebank building as well as treebank enrichment with relations between syntactic nodes. Contemplata is dedicated to the annotation of constituency trees. The framework includes support for syntactic parsers, which provide automatic annotations to be manually revised. The balanced strategy of annotation between automatic parsing and manual revision allows to reduce the annotator workload, which favours data reliability. The paper presents the software architecture of Contemplata, describes its practical use and eventually gives two examples of annotation projects that were conducted on the platform.
引用
收藏
页码:7222 / 7229
页数:8
相关论文
共 50 条
  • [1] Transforming a Constituency Treebank into a Dependency Treebank
    Gelbukh, Alexander
    Torres, Sulema
    Calvo, Hiram
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2005, (35): : 145 - 152
  • [2] Converting an Indonesian Constituency Treebank to the Penn Treebank Format
    Arwidarasti, Jessica Naraiswari
    Alfina, Ika
    Krisnadhi, Adila Alfa
    PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2019, : 331 - 336
  • [3] Constructing a Turkish Constituency Parse TreeBank
    Yildiz, Olcay Taner
    Solak, Ercan
    Candir, Semsinur
    Ehsani, Razieh
    Gorgun, Onur
    INFORMATION SCIENCES AND SYSTEMS 2015, 2016, 363 : 339 - 347
  • [4] The Annotation Scheme for Uyghur Dependency Treebank
    Mamitimin, Samat
    Ibrahim, Turgun
    Eli, Marhaba
    2013 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2013), 2013, : 185 - 188
  • [5] Annotation of grammatical function in the Persian treebank
    Pouramini, Ahmad
    Moridi, Elham
    4TH INTERNATIONAL CONFERENCE OF COGNITIVE SCIENCE, 2012, 32 : 302 - 307
  • [6] Sense annotation in the penn discourse treebank
    Miltsakaki, Eleni
    Robaldo, Livio
    Lee, Alan
    Joshi, Aravind
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, 2008, 4919 : 275 - +
  • [7] A dependency annotation scheme for Bangla treebank
    Sanjay Chatterji
    Tanaya Mukherjee Sarkar
    Pragati Dhang
    Samhita Deb
    Sudeshna Sarkar
    Jayshree Chakraborty
    Anupam Basu
    Language Resources and Evaluation, 2014, 48 : 443 - 477
  • [8] A dependency annotation scheme for Bangla treebank
    Chatterji, Sanjay
    Sarkar, Tanaya Mukherjee
    Dhang, Pragati
    Deb, Samhita
    Sarkar, Sudeshna
    Chakraborty, Jayshree
    Basu, Anupam
    LANGUAGE RESOURCES AND EVALUATION, 2014, 48 (03) : 443 - 477
  • [9] Skladnica: a constituency treebank of Polish harmonised with the Walenty valency dictionary
    Wolinski, Marcin
    Hajnicz, Elzbieta
    LANGUAGE RESOURCES AND EVALUATION, 2021, 55 (01) : 209 - 239
  • [10] Challenges and Solutions for Consistent Annotation of Vietnamese Treebank
    Nguyen, Quy T.
    Miyao, Yusuke
    Le, Ha T. T.
    Nguyen, Ngan L. T.
    LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2016, : 1532 - 1539