BKTreebank: Building a Vietnamese Dependency Treebank

被引:0
|
作者
Kiem-Hieu Nguyen [1 ]
机构
[1] Hanoi Univ Sci & Technol, Sch Informat & Commun Technol, 1 Dai Co Viet, Hanoi, Vietnam
关键词
treebank; dependency parsing; POS tagging; word segmentation; Vietnamese; less-resourced language;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Dependency treebank is an important resource in any language. In this paper, we present our work on building BKTreebank, a dependency treebank for Vietnamese. Important points on designing POS tagset, dependency relations, and annotation guidelines are discussed. We describe experiments on POS tagging and dependency parsing on the treebank. Experimental results show that the treebank is a useful resource for Vietnamese language processing.
引用
收藏
页码:2164 / 2168
页数:5
相关论文
共 50 条
  • [31] A Dependency Treebank of the Chinese Buddhist Canon
    Wong, Tak-sum
    Lee, John
    LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2016, : 1679 - 1683
  • [32] The Persian Dependency Treebank Made Universal
    Rasooli, Mohammad Sadegh
    Safari, Pegah
    Moloodi, Amirsaeid
    Nourian, Alireza
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 7078 - 7087
  • [33] Camel Treebank: An Open Multi-genre Arabic Dependency Treebank
    Habash, Nizar
    AbuOdeh, Muhammed
    Taji, Dima
    Faraj, Reem
    El Gizuli, Jamila
    Kallas, Omar
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 2672 - 2681
  • [34] Automatic Detection of Problematic Rules in Vietnamese Treebank
    Hong-Quan Nguyen
    Phuong-Thai Nguyen
    Thanh-Quyen Dang
    Van-Hiep Nguyen
    2015 IEEE RIVF INTERNATIONAL CONFERENCE ON COMPUTING & COMMUNICATION TECHNOLOGIES - RESEARCH, INNOVATION, AND VISION FOR THE FUTURE (RIVF), 2015, : 13 - 18
  • [35] Development of Traditional Mongolian Dependency Treebank
    Su, Xiangdong
    Gao, Guanglai
    Yan, Xueliang
    CHINESE COMPUTATIONAL LINGUISTICS AND NATURAL LANGUAGE PROCESSING BASED ON NATURALLY ANNOTATED BIG DATA, 2013, 8208 : 247 - 256
  • [36] A dependency treebank of Chinese Buddhist texts
    Lee, John
    Kong, Yin Hei
    DIGITAL SCHOLARSHIP IN THE HUMANITIES, 2016, 31 (01) : 140 - 151
  • [37] Chinese dependency parsing based on Treebank
    Liu, Hai-Tao
    Zhao, Yi-Yi
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2009, 22 (01): : 17 - 21
  • [38] A dependency annotation scheme for Bangla treebank
    Chatterji, Sanjay
    Sarkar, Tanaya Mukherjee
    Dhang, Pragati
    Deb, Samhita
    Sarkar, Sudeshna
    Chakraborty, Jayshree
    Basu, Anupam
    LANGUAGE RESOURCES AND EVALUATION, 2014, 48 (03) : 443 - 477
  • [39] Ensuring annotation consistency and accuracy for Vietnamese treebank
    Quy T. Nguyen
    Yusuke Miyao
    Ha T. T. Le
    Nhung T. H. Nguyen
    Language Resources and Evaluation, 2018, 52 : 269 - 315
  • [40] Automatic functor assignment in the Prague Dependency Treebank
    Zabokrtsky, Z
    TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2000, 1902 : 45 - 50