Prompt-based for Low-Resource Tibetan Text Classification

被引:4
|
作者
An, Bo [1 ]
机构
[1] Chinese Acad Social Sci, Inst Ethnol & Anthropol, South Tweenty 7 St,Bldg 6,Zhongguancun Nandajie 2, Beijing, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Tibetan text classification; prompt learning; deep learning; pre-trained language model;
D O I
10.1145/3603168
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text classification is a critical and foundational task in Tibetan natural language processing, it plays a crucial role in various applications, such as sentiment analysis and information extraction. However, the limited availability of annotated data poses a significant challenge to Tibetan natural language processing. This paper proposes a prompt learning-based method for low-resource Tibetan text classification to overcome this challenge. This method utilizes pre-trained language models to learn text representation and generation capabilities on a large-scale unsupervised Tibetan corpus, enabling few-shot Tibetan text classification. Experimental results demonstrate that the proposed method significantly improves the performance of Tibetan text classification in low-resource scenarios. This work provides a new research idea and method for low-resource language processing, such as Tibetan natural language processing. Hopefully, it will inspire subsequent work on low-resource language processing.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] PromDA: Prompt-based Data Augmentation for Low-Resource NLU Tasks
    Wang, Yufei
    Xu, Can
    Sun, Qingfeng
    Hu, Huang
    Tao, Chongyang
    Geng, Xiubo
    Jiang, Daxin
    PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 4242 - 4255
  • [2] Prompt Tuning on Graph-Augmented Low-Resource Text Classification
    Wen, Zhihao
    Fang, Yuan
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (12) : 9080 - 9095
  • [3] A Prompt-Based Topic-Modeling Method for Depression Detection on Low-Resource Data
    Guo, Yanrong
    Liu, Jilong
    Wang, Lei
    Qin, Wei
    Hao, Shijie
    Hong, Richang
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024, 11 (01) : 1430 - 1439
  • [4] A Good Prompt Is Worth Millions of Parameters: Low-resource Prompt-based Learning for Vision-Language Models
    Jin, Woojeong
    Cheng, Yu
    Shen, Yelong
    Chen, Weizhu
    Ren, Xiang
    PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 2763 - 2775
  • [5] Evolutionary Verbalizer Search for Prompt-Based Few Shot Text Classification
    Ling, Tongtao
    Chen, Lei
    Lai, Yutao
    Liu, Hai-Lin
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT IV, KSEM 2023, 2023, 14120 : 279 - 290
  • [6] Prompt-based Zero-shot Text Classification with Conceptual Knowledge
    Wang, Yuqi
    Wang, Wei
    Chen, Qi
    Huang, Kaizhu
    Nguyen, Anh
    De, Suparna
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-SRW 2023, VOL 4, 2023, : 30 - 38
  • [7] Comparing Prompt-Based and Standard Fine-Tuning for Urdu Text Classification
    Ullah, Faizad
    Azam, Ubaid
    Faheem, Ali
    Kamiran, Faisal
    Karim, Asim
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS - EMNLP 2023, 2023, : 6747 - 6754
  • [8] Tibetan Text Classification based on Prompt Learning and Ensemble Learning
    Tang, Chao
    Tan, Zelin
    Zhao, Xiaobing
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2025, 24 (02)
  • [9] Prompt-based Learning for Text Readability Assessment
    Lee, Bruce W.
    Lee, Jason Hyung-Jong
    17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 1819 - 1824
  • [10] Prompt-Based Editing for Text Style Transfer
    Luo, Guoqing
    Han, Yu Tong
    Mou, Lili
    Firdaus, Mauajama
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS - EMNLP 2023, 2023, : 5740 - 5750