BKTreebank: Building a Vietnamese Dependency Treebank

被引:0
|
作者
Kiem-Hieu Nguyen [1 ]
机构
[1] Hanoi Univ Sci & Technol, Sch Informat & Commun Technol, 1 Dai Co Viet, Hanoi, Vietnam
关键词
treebank; dependency parsing; POS tagging; word segmentation; Vietnamese; less-resourced language;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Dependency treebank is an important resource in any language. In this paper, we present our work on building BKTreebank, a dependency treebank for Vietnamese. Important points on designing POS tagset, dependency relations, and annotation guidelines are discussed. We describe experiments on POS tagging and dependency parsing on the treebank. Experimental results show that the treebank is a useful resource for Vietnamese language processing.
引用
收藏
页码:2164 / 2168
页数:5
相关论文
共 50 条
  • [21] Prague Dependency Treebank - Consolidated 1.0
    Hajic, Jan
    Bejcek, Eduard
    Hlavacova, Jaroslava
    Mikulova, Marie
    Straka, Milan
    Stepanek, Jan
    Stepankova, Barbora
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 5208 - 5218
  • [22] Challenges and Solutions for Consistent Annotation of Vietnamese Treebank
    Nguyen, Quy T.
    Miyao, Yusuke
    Le, Ha T. T.
    Nguyen, Ngan L. T.
    LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2016, : 1532 - 1539
  • [23] Ensuring annotation consistency and accuracy for Vietnamese treebank
    Nguyen, Quy T.
    Miyao, Yusuke
    Le, Ha T. T.
    Nguyen, Nhung T. H.
    LANGUAGE RESOURCES AND EVALUATION, 2018, 52 (01) : 269 - 315
  • [24] The Annotation Scheme for Uyghur Dependency Treebank
    Mamitimin, Samat
    Ibrahim, Turgun
    Eli, Marhaba
    2013 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2013), 2013, : 185 - 188
  • [25] Prague Dependency Treebank:: Restoration of deletions
    Hajicová, E
    Kruijff-Korbayová, I
    Sgall, P
    TEXT, SPEECH AND DIALOGUE, 1999, 1692 : 44 - 49
  • [26] Informal Persian Universal Dependency Treebank
    Kabiri, Roya
    Karimi, Simin
    Surdeanu, Mihai
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 7096 - 7105
  • [27] A Gold Standard Dependency Treebank for Turkish
    Kayadelen, Tolga
    Ozturel, Adnan
    Bohnet, Bernd
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 5156 - 5163
  • [28] A dependency annotation scheme for Bangla treebank
    Sanjay Chatterji
    Tanaya Mukherjee Sarkar
    Pragati Dhang
    Samhita Deb
    Sudeshna Sarkar
    Jayshree Chakraborty
    Anupam Basu
    Language Resources and Evaluation, 2014, 48 : 443 - 477
  • [29] Prague Dependency Style Treebank for Tamil
    Ramasamy, Loganathan
    Zabokrtsky, Zdenek
    LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 1888 - 1894
  • [30] Universal Dependency Treebank for Latvian: A Pilot
    Pretkalnina, Lauma
    Rituma, Laura
    Saulite, Baiba
    HUMAN LANGUAGE TECHNOLOGIES - THE BALTIC PERSPECTIVE, 2016, 289 : 136 - 143