Study on Tibetan Word Segmentation as Syllable Tagging

被引:0
|
作者
Li, Yachao [1 ]
Yu, Hongzhi [1 ]
机构
[1] Northwest Univ Nationalities, Key Lab Chinese Natl Linguist Informat Technol, Lanzhou 730030, Peoples R China
关键词
Tibetan; word segmentation; sequence label;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Tibetan word segmentation (TWS) is the basic problem for Tibetan natural language processing. The paper reformulates the segmentation as a syllable tagging problem, and studies the performance of TWS with different sequence labeling models. Experimental results show that, the TWS system with conditional random field achieves the best performance in the condition of current 4-tag set, at the same time, the other models achieve good results too. All the above show that, the segmentation as a syllable tagging problem that is an efficient approach to deal with TWS.
引用
收藏
页码:363 / 369
页数:7
相关论文
共 50 条
  • [1] Tibetan Word Segmentation as Sub-syllable Tagging with Syllable's Part-of-Speech Property
    Liu, Huidan
    Long, Congjun
    Nuo, Minghua
    Wu, Jian
    CHINESE COMPUTATIONAL LINGUISTICS AND NATURAL LANGUAGE PROCESSING BASED ON NATURALLY ANNOTATED BIG DATA (CCL 2015), 2015, 9427 : 189 - 201
  • [2] Tibetan Word Segmentation Based on Word-position Tagging
    Kang, Caijun
    Jiang, Di
    Long, Congjun
    2013 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2013), 2013, : 239 - 242
  • [3] Research and Implementation of Tibetan Word Segmentation Based on Syllable Methods
    Jiang, Jing
    Li, Yachao
    Jiang, Tao
    Yu, Hongzhi
    2017 INTERNATIONAL SYMPOSIUM ON APPLICATION OF MATERIALS SCIENCE AND ENERGY MATERIALS (SAMSE 2017), 2018, 322
  • [4] A Neural Joint Model with BERT for Burmese Syllable Segmentation, Word Segmentation, and POS Tagging
    Mao, Cunli
    Man, Zhibo
    Yu, Zhengtao
    Gao, Shengxiang
    Wang, Zhenhan
    Wang, Hongbin
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2021, 20 (04)
  • [5] Research on the Syllable Combining Ability of Tibetan Base-word
    Li Yonghong
    Kong Jiangping
    Fang Huaping
    2011 INTERNATIONAL CONFERENCE ON SOCIAL SCIENCES AND SOCIETY (ICSSS 2011), VOL 4, 2011, : 254 - +
  • [6] A Tibetan input method based on syllable word for mobile phone
    Lin, S
    Dong, Y
    Wang, SY
    Li, T
    Nyimatrashi
    Pudun
    ICESS 2005: SECOND INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS, 2005, : 346 - 350
  • [7] Role of syllable segmentation processes in peripheral word recognition
    Bernard, Jean-Baptiste
    Calabrese, Aurelie
    Castet, Eric
    VISION RESEARCH, 2014, 105 : 226 - 232
  • [8] Neural Architecture for Tibetan Word Segmentation
    Chen, Mengzhu
    Zhao, Shengjie
    Yang, Kai
    2017 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2017, : 367 - 370
  • [9] A hybrid approach to word segmentation and POS tagging
    Oki Electric Industry Co., Ltd., 2−5−7 Honmachi, Chuo-ku, Osaka
    541−0053, Japan
    不详
    619−0289, Japan
    Proc. Annu. Meet. Assoc. Comput Linguist., 1600, (217-220):
  • [10] Character-based Joint Word Segmentation and Part-of-Speech Tagging for Tibetan Based on Deep Learning
    Li, Yan
    Li, Xiaomin
    Wang, Yiru
    Lv, Hui
    Li, Fenfang
    Duo, La
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2022, 21 (05)