A Neural Joint Model with BERT for Burmese Syllable Segmentation, Word Segmentation, and POS Tagging

被引:4
|
作者
Mao, Cunli [1 ]
Man, Zhibo [1 ]
Yu, Zhengtao [1 ]
Gao, Shengxiang [1 ]
Wang, Zhenhan [1 ]
Wang, Hongbin [1 ]
机构
[1] Kunming Univ Sci & Technol, Key Lab Artificial Intelligence Informat Engn & A, Kunming, Yunnan, Peoples R China
基金
中国国家自然科学基金;
关键词
Burmese; word segmentation; POS tagging; joint training; BiLSTM-CRF; BERT;
D O I
10.1145/3436818
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The smallest semantic unit of the Burmese language is called the syllable. In the present study, it is intended to propose the first neural joint learning model for Burmese syllable segmentation, word segmentation, and part-of-speech (POS) tagging with the BERT. The proposed model alleviates the error propagation problem of the syllable segmentation. More specifically, it extends the neural joint model for Vietnamese word segmentation, POS tagging, and dependency parsing [28] with the pre-training method of the Burmese character, syllable, and word embedding with BiLSTM-CRF-based neural layers. In order to evaluate the performance of the proposed model, experiments are carried out on Burmese benchmark datasets, and we fine-tune the model of multilingual BERT. Obtained results show that the proposed joint model can result in an excellent performance.
引用
收藏
页数:23
相关论文
共 50 条
  • [1] A Simple and Effective Neural Model for Joint Word Segmentation and POS Tagging
    Zhang, Meishan
    Yu, Nan
    Fu, Guohong
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (09) : 1528 - 1538
  • [2] An Effective Joint Model for Chinese Word Segmentation and POS Tagging
    Wang, Heng-Jun
    Si, Nian-Wen
    Chen, Cheng
    PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATION PROCESSING (ICIIP'16), 2016,
  • [3] Joint Word Segmentation, POS-Tagging and Syntactic Chunking
    Lyu, Chen
    Zhang, Yue
    Ji, Donghong
    THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 3007 - 3014
  • [4] A Fine-Grained Domain Adaption Model for Joint Word Segmentation and POS Tagging
    Jiang, Peijie
    Long, Dingkun
    Sun, Yueheng
    Zhang, Meishan
    Xu, Guangwei
    Xie, Pengjun
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 3587 - 3598
  • [5] A Unified Model for Joint Chinese Word Segmentation and POS Tagging with Heterogeneous Annotation Corpora
    Zhao, Jiayi
    Qiu, Xipeng
    Huang, Xuanjing
    2013 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2013), 2013, : 227 - 230
  • [6] Study on Tibetan Word Segmentation as Syllable Tagging
    Li, Yachao
    Yu, Hongzhi
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2013, 2013, 400 : 363 - 369
  • [7] A hybrid approach to word segmentation and POS tagging
    Oki Electric Industry Co., Ltd., 2−5−7 Honmachi, Chuo-ku, Osaka
    541−0053, Japan
    不详
    619−0289, Japan
    Proc. Annu. Meet. Assoc. Comput Linguist., 1600, (217-220):
  • [8] Joint Chinese word segmentation and POS tagging system with undirected graphical models
    Zhu C.-H.
    Zhao T.-J.
    Zheng D.-Q.
    Dianzi Yu Xinxi Xuebao/Journal of Electronics and Information Technology, 2010, 32 (03): : 700 - 704
  • [9] Bidirectional Deep Learning of Context Representation for Joint Word Segmentation and POS Tagging
    Boonkwan, Prachya
    Supnithi, Thepchai
    ADVANCED COMPUTATIONAL METHODS FOR KNOWLEDGE ENGINEERING, ICCSAMA 2017, 2018, 629 : 184 - 196
  • [10] Word segmentation and POS tagging for Chinese keyphrase extraction
    Huang, XC
    Chen, J
    Yan, PL
    Luo, X
    ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS, 2005, 3584 : 364 - 369