Zero-Shot Text Classification via Self-Supervised Tuning

被引:0
|
作者
Liu, Chaoqun [1 ,2 ]
Zhang, Wenxuan [2 ]
Chen, Guizhen [1 ,2 ]
Wu, Xiaobao [1 ]
Luu, Anh Tuan [1 ]
Chang, Chip Hong [1 ]
Bing, Lidong [2 ]
机构
[1] Nanyang Technol Univ, Singapore, Singapore
[2] Alibaba Grp, DAMO Acad, Hangzhou, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Existing solutions to zero-shot text classification either conduct prompting with pre-trained language models, which is sensitive to the choices of templates, or rely on large-scale annotated data of relevant tasks for meta-tuning. In this work, we propose a new paradigm based on self-supervised learning to solve zero-shot text classification tasks by tuning the language models with unlabeled data, called self-supervised tuning. By exploring the inherent structure of free texts, we propose a new learning objective called first sentence prediction to bridge the gap between unlabeled data and text classification tasks. After tuning the model to learn to predict the first sentence in a paragraph based on the rest, the model is able to conduct zero-shot inference on unseen tasks such as topic classification and sentiment analysis. Experimental results show that our model outperforms the state-of-the-art baselines on 7 out of 10 tasks. Moreover, the analysis reveals that our model is less sensitive to the prompt design. Our code and pretrained models are publicly available at https://github.com/DAMO-NLP-SG/SSTuning.
引用
收藏
页码:1743 / 1761
页数:19
相关论文
共 50 条
  • [1] Transductive zero-shot image classification based on self-supervised enhancement feature
    Wang H.-Y.
    Zhang X.-R.
    Wang X.-S.
    Cheng Y.-H.
    Kongzhi yu Juece/Control and Decision, 2024, 39 (05): : 1707 - 1717
  • [2] MultiCQA: Zero-Shot Transfer of Self-Supervised Text Matching Models on a Massive Scale
    Rueckle, Andreas
    Pfeiffer, Jonas
    Gurevych, Iryna
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 2471 - 2486
  • [3] Zero-shot Voice Conversion via Self-supervised Prosody Representation Learning
    Wang, Shijun
    Borth, Damian
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [4] Zero-shot Text Classification via Reinforced Self-training
    Ye, Zhiquan
    Geng, Yuxia
    Chen, Jiaoyan
    Xu, Xiaoxiao
    Zheng, Suhang
    Wang, Feng
    Chen, Jingmin
    Zhang, Jun
    Chen, Huajun
    58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 3014 - 3024
  • [5] Self-supervised embedding for generalized zero-shot learning in remote sensing scene classification
    Damalla, Rambabu
    Datla, Rajeshreddy
    Vishnu, Chalavadi
    Mohan, Chalavadi Krishna
    JOURNAL OF APPLIED REMOTE SENSING, 2023, 17 (03)
  • [6] ENHANCING CLASS UNDERSTANDING VIA PROMPT-TUNING FOR ZERO-SHOT TEXT CLASSIFICATION
    Dan, Yuhao
    Zhou, Jie
    Chen, Qin
    Bai, Qingchun
    He, Liang
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 4303 - 4307
  • [7] A weakly supervised textual entailment approach to zero-shot text classification
    Pamies, Marc
    Llop, Joan
    Multari, Francesco
    Duran-Silva, Nicolau
    Parra-Rojas, Cesar
    Gonzalez-Agirre, Aitor
    Massucci, Francesco Alessandro
    Villegas, Marta
    17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 286 - 296
  • [8] ZERO-SHOT TEXT-TO-SPEECH SYNTHESIS CONDITIONED USING SELF-SUPERVISED SPEECH REPRESENTATION MODEL
    Fujita, Kenichi
    Ashihara, Takanori
    Kanagawa, Hiroki
    Moriya, Takafumi
    Ijima, Yusuke
    2023 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING WORKSHOPS, ICASSPW, 2023,
  • [9] Zero-Shot Dialogue Disentanglement by Self-Supervised Entangled Response Selection
    Chi, Ta-Chung
    Rudnicky, Alexander, I
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 4897 - 4902
  • [10] Self-Supervised Knowledge Triplet Learning for Zero-Shot Question Answering
    Banerjee, Pratyay
    Baral, Chitta
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 151 - 162