Zero-Shot Text Classification via Self-Supervised Tuning

被引:0
|
作者
Liu, Chaoqun [1 ,2 ]
Zhang, Wenxuan [2 ]
Chen, Guizhen [1 ,2 ]
Wu, Xiaobao [1 ]
Luu, Anh Tuan [1 ]
Chang, Chip Hong [1 ]
Bing, Lidong [2 ]
机构
[1] Nanyang Technol Univ, Singapore, Singapore
[2] Alibaba Grp, DAMO Acad, Hangzhou, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Existing solutions to zero-shot text classification either conduct prompting with pre-trained language models, which is sensitive to the choices of templates, or rely on large-scale annotated data of relevant tasks for meta-tuning. In this work, we propose a new paradigm based on self-supervised learning to solve zero-shot text classification tasks by tuning the language models with unlabeled data, called self-supervised tuning. By exploring the inherent structure of free texts, we propose a new learning objective called first sentence prediction to bridge the gap between unlabeled data and text classification tasks. After tuning the model to learn to predict the first sentence in a paragraph based on the rest, the model is able to conduct zero-shot inference on unseen tasks such as topic classification and sentiment analysis. Experimental results show that our model outperforms the state-of-the-art baselines on 7 out of 10 tasks. Moreover, the analysis reveals that our model is less sensitive to the prompt design. Our code and pretrained models are publicly available at https://github.com/DAMO-NLP-SG/SSTuning.
引用
收藏
页码:1743 / 1761
页数:19
相关论文
共 50 条
  • [21] Information Retrieval from Alternative Data using Zero-Shot Self-Supervised Learning
    Assareh, Amin
    2022 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE FOR FINANCIAL ENGINEERING AND ECONOMICS (CIFER), 2022,
  • [22] Zero-Shot Self-Supervised Joint Temporal Image and Sensitivity Map Reconstruction via Linear Latent Space
    Zhang, Molin
    Xu, Junshen
    Arefeen, Yamin
    Adalsteinsson, Elfar
    MEDICAL IMAGING WITH DEEP LEARNING, VOL 227, 2023, 227 : 1713 - 1725
  • [23] Prototype-Augmented Self-Supervised Generative Network for Generalized Zero-Shot Learning
    Wu, Jiamin
    Zhang, Tianzhu
    Zha, Zheng-Jun
    Luo, Jiebo
    Zhang, Yongdong
    Wu, Feng
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 1938 - 1951
  • [24] BRING YOUR OWN KG: Self-Supervised Program Synthesis for Zero-Shot KGQA
    Agarwal, Dhruv
    Das, Rajarshi
    Khosla, Sopan
    Gangadharaiah, Rashmi
    Findings of the Association for Computational Linguistics: NAACL 2024 - Findings, 2024, : 896 - 919
  • [25] Zero-shot Topic Classification via Automatic Tagging on Chinese Text Datasets
    Cai, Xinyi
    Tian, Jiao
    Yu, Ke
    Xiao, Hongwang
    Zhang, Kai
    Tsai, Pei -Wei
    2022 IEEE INTL CONF ON PARALLEL & DISTRIBUTED PROCESSING WITH APPLICATIONS, BIG DATA & CLOUD COMPUTING, SUSTAINABLE COMPUTING & COMMUNICATIONS, SOCIAL COMPUTING & NETWORKING, ISPA/BDCLOUD/SOCIALCOM/SUSTAINCOM, 2022, : 482 - 488
  • [26] Self-supervised regularization for text classification
    Zhou M.
    Li Z.
    Xie P.
    Transactions of the Association for Computational Linguistics, 2021, 9 : 1147 - 1162
  • [27] Self-supervised Regularization for Text Classification
    Zhou, Meng
    Li, Zechen
    Xie, Pengtao
    TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2021, 9 : 641 - 656
  • [28] Improved Multi-shot Diffusion-Weighted MRI with Zero-Shot Self-supervised Learning Reconstruction
    Cho, Jaejin
    Jun, Yohan
    Wang, Xiaoqing
    Kobayashi, Caique
    Bilgic, Berkin
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT I, 2023, 14220 : 457 - 466
  • [29] Label Augmentation for Zero-Shot Hierarchical Text Classification
    Paletto, Lorenzo
    Basile, Valerio
    Esposito, Roberto
    PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, : 7697 - 7706
  • [30] Unified benchmark for zero-shot Turkish text classification
    celik, Emrecan
    Dalyan, Tugba
    INFORMATION PROCESSING & MANAGEMENT, 2023, 60 (03)