BKTreebank: Building a Vietnamese Dependency Treebank

被引:0
|
作者
Kiem-Hieu Nguyen [1 ]
机构
[1] Hanoi Univ Sci & Technol, Sch Informat & Commun Technol, 1 Dai Co Viet, Hanoi, Vietnam
关键词
treebank; dependency parsing; POS tagging; word segmentation; Vietnamese; less-resourced language;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Dependency treebank is an important resource in any language. In this paper, we present our work on building BKTreebank, a dependency treebank for Vietnamese. Important points on designing POS tagset, dependency relations, and annotation guidelines are discussed. We describe experiments on POS tagging and dependency parsing on the treebank. Experimental results show that the treebank is a useful resource for Vietnamese language processing.
引用
收藏
页码:2164 / 2168
页数:5
相关论文
共 50 条
  • [11] Building the Vietnamese Phrase Treebank by Improved Probabilistic Context-Free Grammars
    Li, Ying
    Guo, Jianyi
    Yu, Zhengtao
    Xian, Yantuan
    Wen, Yonghua
    MACHINE TRANSLATION, 2016, 668 : 75 - 90
  • [12] Lithuanian Dependency Treebank ALKSNIS
    Bielinskiene, Agne
    Boizou, Loic
    Kovalevskaite, Jolanta
    Rimkute, Erika
    HUMAN LANGUAGE TECHNOLOGIES - THE BALTIC PERSPECTIVE, 2016, 289 : 107 - 114
  • [13] Yoruba Dependency Treebank (YTB)
    Ishola, Olajide
    Zeman, Daniel
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 5178 - 5186
  • [14] A Universal Dependency Treebank for ClassicalTibetan
    Faggionato, Christian
    REVUE D ETUDES TIBETAINES, 2024, (72): : 52 - 69
  • [15] Ways of Evaluation of the Annotators in Building the Prague Czech-English Dependency Treebank
    Mikulova, Marie
    Stepanek, Jan
    LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010, : 1836 - 1839
  • [16] Hindi CCGbank: A CCG treebank from the Hindi dependency treebank
    Ambati, Bharat Ram
    Deoskar, Tejaswini
    Steedman, Mark
    LANGUAGE RESOURCES AND EVALUATION, 2018, 52 (01) : 67 - 100
  • [17] Hindi CCGbank: A CCG treebank from the Hindi dependency treebank
    Bharat Ram Ambati
    Tejaswini Deoskar
    Mark Steedman
    Language Resources and Evaluation, 2018, 52 : 67 - 100
  • [18] Reusability of the Basque Dependency Treebank for building the Gold Standard of Constraint Grammar Surface Syntax
    Maria Arriola, Jose
    Jesus Aranzabe, Maria
    Goenaga, Lakes
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2013, (51): : 83 - 90
  • [19] From Sentence to Discourse: Building an Annotation Scheme for Discourse Based on Prague Dependency Treebank
    Mladova, Lucie
    Zikanova, Sarka
    Hajicova, Eva
    SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, 2008, : 2564 - 2570
  • [20] A Dependency Treebank for Serbian: Initial Experiments
    Jakovljevic, Bojana
    Kovacevic, Aleksandar
    Secujski, Milan
    Markovic, Maja
    SPEECH AND COMPUTER, 2014, 8773 : 42 - 49