Joint Representations of Texts and Labels with Compositional Loss for Short Text Classification

被引：3

作者：

Hao, Ming ^{[1
]}

Wang, Weijing ^{[2
]}

Zhou, Fang ^{[1
]}

机构：

[1] Univ Sci & Technol Beijing, Sch Comp & Commun Engn, Beijing 100083, Peoples R China

[2] Univ Illinois, Dept Bioengn, Urbana, IL 61801 USA

来源：

JOURNAL OF WEB ENGINEERING | 2021年 / 20卷 / 03期

基金：

中国国家自然科学基金; 国家重点研发计划;

关键词：

Ambiguous text; deep language models; label embedding; text classification; triplet loss;

D O I：

10.13052/jwe1540-9589.2035

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Short text classification is an important foundation for natural language processing (NLP) tasks. Though, the text classification based on deep language models (DLMs) has made a significant headway, in practical applications however, some texts are ambiguous and hard to classify in multi-class classification especially, for short texts whose context length is limited. The mainstream method improves the distinction of ambiguous text by adding context information. However, these methods rely only the text representation, and ignore that the categories overlap and are not completely independent of each other. In this paper, we establish a new general method to solve the problem of ambiguous text classification by introducing label embedding to represent each category, which makes measurable difference between the categories. Further, a new compositional loss function is proposed to train the model, which makes the text representation closer to the ground-truth label and farther away from others. Finally, a constraint is obtained by calculating the similarity between the text representation and label embedding. Errors caused by ambiguous text can be corrected by adding constraints to the output layer of the model. We apply the method to three classical models and conduct experiments on six public datasets. Experiments show that our method can effectively improve the classification accuracy of the ambiguous texts. In addition, combining our method with BERT, we obtain the state-of-the-art results on the CNT dataset.

引用

页码：669 / 687

页数：19

共 50 条

[1] Joint Embedding of Words and Labels for Text Classification
Wang, Guoyin
Li, Chunyuan
Wang, Wenlin
Zhang, Yizhe
Shen, Dinghan
Zhang, Xinyuan
Henao, Ricardo
Carin, Lawrence
PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL), VOL 1, 2018, : 2321 - 2331
[2] Short Texts Representations for Legal Domain Classification
Zymkowski, Tomasz
Szymanski, Julian
Sobecki, Andrzej
Drozda, Pawel
Szalapak, Konrad
Komar-Komarowski, Kajetan
Scherer, Rafal
ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, ICAISC 2022, PT I, 2023, 13588 : 105 - 114
[3] Short Text Classification Based on Distributional Representations of Words
Ma, Chenglong
Zhao, Qingwei
Pan, Jielin
Yan, Yonghong
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2016, E99D (10): : 2562 - 2565
[4] Compositional Recurrent Neural Networks for Chinese Short Text Classification
Zhou, Yujun
Xu, Bo
Xu, Jiaming
Yang, Lei
Li, Changliang
Xu, Bo
2016 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE (WI 2016), 2016, : 137 - 144
[5] Bidirectional Multi-channel Semantic Interaction Model of Labels and Texts for Text Classification
Wang, Yuan
Zhou, Yubo
Hu, Peng
Xu, Maoling
Zhao, Tingting
Chen, Yarui
NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2022, PT II, 2022, 13552 : 73 - 84
[6] Compositional Mixture Representations for Vision and Text
Alaniz, Stephan
Federici, Marco
Akata, Zeynep
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 4201 - 4210
[7] Generating Compositional Color Representations from Text
Maheshwari, Paridhi
Jain, Nihal
Vaddamanu, Praneetha
Raut, Dhananjay
Vaishay, Shraiysh
Vinay, Vishwa
PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 1222 - 1231
[8] Text Classification: The Case of Multiple Labels
Bobicev, Victoria
2016 INTERNATIONAL CONFERENCE ON COMMUNICATIONS (COMM 2016), 2016, : 39 - 42
[9] Text Representations for Patent Classification
D'hondt, Eva
Verberne, Suzan
Koster, Cornelis
Boves, Lou
COMPUTATIONAL LINGUISTICS, 2013, 39 (03) : 755 - 775
[10] Local representations using very short labels
Scheinerman, ER
DISCRETE MATHEMATICS, 1999, 203 (1-3) : 287 - 290

← 1 2 3 4 5 →