BKTreebank: Building a Vietnamese Dependency Treebank

被引:0
|
作者
Kiem-Hieu Nguyen [1 ]
机构
[1] Hanoi Univ Sci & Technol, Sch Informat & Commun Technol, 1 Dai Co Viet, Hanoi, Vietnam
关键词
treebank; dependency parsing; POS tagging; word segmentation; Vietnamese; less-resourced language;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Dependency treebank is an important resource in any language. In this paper, we present our work on building BKTreebank, a dependency treebank for Vietnamese. Important points on designing POS tagset, dependency relations, and annotation guidelines are discussed. We describe experiments on POS tagging and dependency parsing on the treebank. Experimental results show that the treebank is a useful resource for Vietnamese language processing.
引用
收藏
页码:2164 / 2168
页数:5
相关论文
共 50 条
  • [1] Building a Treebank for Vietnamese Dependency Parsing
    Luong Nguyen Thi
    Linh Ha My
    Hung Nguyen Viet
    Huyen Nguyen Thi Minh
    Phuong Le Hong
    PROCEEDINGS OF 2013 IEEE RIVF INTERNATIONAL CONFERENCE ON COMPUTING AND COMMUNICATION TECHNOLOGIES: RESEARCH, INNOVATION, AND VISION FOR THE FUTURE (RIVF), 2013, : 147 - 151
  • [2] Building Vietnamese Dependency Treebank Based on Chinese-Vietnamese Bilingual Word Alignment
    Li, Ying
    Guo, Jianyi
    Yu, Zhengtao
    Wang, Hongbin
    Wen, Yonghua
    2016 12TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (ICNC-FSKD), 2016, : 1330 - 1335
  • [3] Building the Croatian Dependency Treebank: the initial stages
    Tadic, Marko
    SUVREMENA LINGVISTIKA, 2007, 63 (01): : 85 - 92
  • [4] Building the essential resources for Finnish: the Turku Dependency Treebank
    Katri Haverinen
    Jenna Nyblom
    Timo Viljanen
    Veronika Laippala
    Samuel Kohonen
    Anna Missilä
    Stina Ojala
    Tapio Salakoski
    Filip Ginter
    Language Resources and Evaluation, 2014, 48 : 493 - 531
  • [5] Building the essential resources for Finnish: the Turku Dependency Treebank
    Haverinen, Katri
    Nyblom, Jenna
    Viljanen, Timo
    Laippala, Veronika
    Kohonen, Samuel
    Missila, Anna
    Ojala, Stina
    Salakoski, Tapio
    Ginter, Filip
    LANGUAGE RESOURCES AND EVALUATION, 2014, 48 (03) : 493 - 531
  • [6] Transforming a Constituency Treebank into a Dependency Treebank
    Gelbukh, Alexander
    Torres, Sulema
    Calvo, Hiram
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2005, (35): : 145 - 152
  • [7] The Alpino Dependency Treebank
    van der Beek, L
    Bouma, G
    Malouf, R
    van Noord, G
    COMPUTATIONAL LINGUISTICS IN THE NETHERLANDS 2001, 2002, (45): : 8 - 22
  • [8] The Norwegian Dependency Treebank
    Solberg, Per Erik
    Skjaerholt, Arne
    Ovrelid, Lilja
    Hagen, Kristin
    Johannessen, Janne Bondi
    LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014, : 789 - 795
  • [9] Hungarian Dependency Treebank
    Vincze, Veronika
    Szauter, Dora
    Almasi, Attila
    Mora, Gyoergy
    Alexin, Zoltan
    Csirik, Janos
    LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010, : 1855 - 1862
  • [10] Building an Ellipsis-aware Chinese Dependency Treebank for Web Text
    Ren, Xuancheng
    Sun, Xu
    Wen, Ji
    Wei, Bingzhen
    Zhan, Weidong
    Zhang, Zhiyuan
    PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 1749 - 1754