Zero-Shot Text Classification via Self-Supervised Tuning

被引:0
|
作者
Liu, Chaoqun [1 ,2 ]
Zhang, Wenxuan [2 ]
Chen, Guizhen [1 ,2 ]
Wu, Xiaobao [1 ]
Luu, Anh Tuan [1 ]
Chang, Chip Hong [1 ]
Bing, Lidong [2 ]
机构
[1] Nanyang Technol Univ, Singapore, Singapore
[2] Alibaba Grp, DAMO Acad, Hangzhou, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Existing solutions to zero-shot text classification either conduct prompting with pre-trained language models, which is sensitive to the choices of templates, or rely on large-scale annotated data of relevant tasks for meta-tuning. In this work, we propose a new paradigm based on self-supervised learning to solve zero-shot text classification tasks by tuning the language models with unlabeled data, called self-supervised tuning. By exploring the inherent structure of free texts, we propose a new learning objective called first sentence prediction to bridge the gap between unlabeled data and text classification tasks. After tuning the model to learn to predict the first sentence in a paragraph based on the rest, the model is able to conduct zero-shot inference on unseen tasks such as topic classification and sentiment analysis. Experimental results show that our model outperforms the state-of-the-art baselines on 7 out of 10 tasks. Moreover, the analysis reveals that our model is less sensitive to the prompt design. Our code and pretrained models are publicly available at https://github.com/DAMO-NLP-SG/SSTuning.
引用
收藏
页码:1743 / 1761
页数:19
相关论文
共 50 条
  • [31] Extreme Zero-Shot Learning for Extreme Text Classification
    Xiong, Yuanhao
    Chang, Wei-Cheng
    Hsieh, Cho-Jui
    Yu, Hsiang-Fu
    Dhillon, Inderjit
    NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 5455 - 5468
  • [32] CLIPTEXT: A New Paradigm for Zero-shot Text Classification
    Qin, Libo
    Wang, Weiyun
    Chen, Qiguang
    Che, Wanxiang
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, 2023, : 1077 - 1088
  • [33] Learn to Adapt for Generalized Zero-Shot Text Classification
    Zhang, Yiwen
    Yuan, Caixia
    Wang, Xiaojie
    Bai, Ziwei
    Liu, Yongbin
    PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 517 - 527
  • [34] Generalized Zero-Shot Text Classification for ICD Coding
    Song, Congzheng
    Zhang, Shanghang
    Sadoughi, Najmeh
    Xie, Pengtao
    Xing, Eric
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 4018 - 4024
  • [35] Zero-Shot Hashing via Transferring Supervised Knowledge
    Yang, Yang
    Luo, Yadan
    Chen, Weilun
    Shen, Fumin
    Shao, Jie
    Shen, Heng Tao
    MM'16: PROCEEDINGS OF THE 2016 ACM MULTIMEDIA CONFERENCE, 2016, : 1286 - 1295
  • [36] Two-stage and Self-supervised Voice Conversion for Zero-Shot Dysarthric Speech Reconstruction
    Liu, Dong
    Lin, Yueqian
    Bu, Hui
    Li, Ming
    2024 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING, IALP 2024, 2024, : 423 - 427
  • [37] UNCERTAINTY AS A PREDICTOR: LEVERAGING SELF-SUPERVISED LEARNING FOR ZERO-SHOT MOS PREDICTION<bold> </bold>
    Ravuri, Aditya
    Cooper, Erica
    Yamagishi, Junichi
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING WORKSHOPS, ICASSPW 2024, 2024, : 580 - 584
  • [38] Self-supervised learning of pseudo classes for generalized zero-shot fine-grained recognition
    Chen Y.-H.
    Yeh M.-C.
    Multimedia Tools and Applications, 2025, 84 (10) : 7915 - 7930
  • [39] Pushing the limits of zero-shot self-supervised super-resolution of anisotropic MR images
    Remedios, Samuel W.
    Wei, Shuwen
    Dewey, Blake E.
    Carass, Aaron
    Pham, Dzung L.
    Prince, Jerry L.
    MEDICAL IMAGING 2024: IMAGE PROCESSING, 2024, 12926
  • [40] Self-Supervised Tuning for Few-Shot Segmentation
    Zhu, Kai
    Zhai, Wei
    Cao, Yang
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 1019 - 1025