Zero-Shot Text Classification via Self-Supervised Tuning

被引：0

作者：

Liu, Chaoqun ^{[1
,2
]}

Zhang, Wenxuan ^{[2
]}

Chen, Guizhen ^{[1
,2
]}

Wu, Xiaobao ^{[1
]}

Luu, Anh Tuan ^{[1
]}

Chang, Chip Hong ^{[1
]}

Bing, Lidong ^{[2
]}

机构：

[1] Nanyang Technol Univ, Singapore, Singapore

[2] Alibaba Grp, DAMO Acad, Hangzhou, Peoples R China

来源：

FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023 | 2023年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Existing solutions to zero-shot text classification either conduct prompting with pre-trained language models, which is sensitive to the choices of templates, or rely on large-scale annotated data of relevant tasks for meta-tuning. In this work, we propose a new paradigm based on self-supervised learning to solve zero-shot text classification tasks by tuning the language models with unlabeled data, called self-supervised tuning. By exploring the inherent structure of free texts, we propose a new learning objective called first sentence prediction to bridge the gap between unlabeled data and text classification tasks. After tuning the model to learn to predict the first sentence in a paragraph based on the rest, the model is able to conduct zero-shot inference on unseen tasks such as topic classification and sentiment analysis. Experimental results show that our model outperforms the state-of-the-art baselines on 7 out of 10 tasks. Moreover, the analysis reveals that our model is less sensitive to the prompt design. Our code and pretrained models are publicly available at https://github.com/DAMO-NLP-SG/SSTuning.

引用

页码：1743 / 1761

页数：19

共 50 条

[31] Extreme Zero-Shot Learning for Extreme Text Classification
Xiong, Yuanhao
Chang, Wei-Cheng
Hsieh, Cho-Jui
Yu, Hsiang-Fu
Dhillon, Inderjit
NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 5455 - 5468
[32] CLIPTEXT: A New Paradigm for Zero-shot Text Classification
Qin, Libo
Wang, Weiyun
Chen, Qiguang
Che, Wanxiang
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, 2023, : 1077 - 1088
[33] Learn to Adapt for Generalized Zero-Shot Text Classification
Zhang, Yiwen
Yuan, Caixia
Wang, Xiaojie
Bai, Ziwei
Liu, Yongbin
PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 517 - 527
[34] Generalized Zero-Shot Text Classification for ICD Coding
Song, Congzheng
Zhang, Shanghang
Sadoughi, Najmeh
Xie, Pengtao
Xing, Eric
PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 4018 - 4024
[35] Zero-Shot Hashing via Transferring Supervised Knowledge
Yang, Yang
Luo, Yadan
Chen, Weilun
Shen, Fumin
Shao, Jie
Shen, Heng Tao
MM'16: PROCEEDINGS OF THE 2016 ACM MULTIMEDIA CONFERENCE, 2016, : 1286 - 1295
[36] Two-stage and Self-supervised Voice Conversion for Zero-Shot Dysarthric Speech Reconstruction
Liu, Dong
Lin, Yueqian
Bu, Hui
Li, Ming
2024 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING, IALP 2024, 2024, : 423 - 427
[37] UNCERTAINTY AS A PREDICTOR: LEVERAGING SELF-SUPERVISED LEARNING FOR ZERO-SHOT MOS PREDICTION<bold> </bold>
Ravuri, Aditya
Cooper, Erica
Yamagishi, Junichi
2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING WORKSHOPS, ICASSPW 2024, 2024, : 580 - 584
[38] Self-supervised learning of pseudo classes for generalized zero-shot fine-grained recognition
Chen Y.-H.
Yeh M.-C.
Multimedia Tools and Applications, 2025, 84 (10) : 7915 - 7930
[39] Pushing the limits of zero-shot self-supervised super-resolution of anisotropic MR images
Remedios, Samuel W.
Wei, Shuwen
Dewey, Blake E.
Carass, Aaron
Pham, Dzung L.
Prince, Jerry L.
MEDICAL IMAGING 2024: IMAGE PROCESSING, 2024, 12926
[40] Self-Supervised Tuning for Few-Shot Segmentation
Zhu, Kai
Zhai, Wei
Cao, Yang
PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 1019 - 1025

← 1 2 3 4 5 →