Joint Representations of Texts and Labels with Compositional Loss for Short Text Classification

被引:3
|
作者
Hao, Ming [1 ]
Wang, Weijing [2 ]
Zhou, Fang [1 ]
机构
[1] Univ Sci & Technol Beijing, Sch Comp & Commun Engn, Beijing 100083, Peoples R China
[2] Univ Illinois, Dept Bioengn, Urbana, IL 61801 USA
来源
JOURNAL OF WEB ENGINEERING | 2021年 / 20卷 / 03期
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Ambiguous text; deep language models; label embedding; text classification; triplet loss;
D O I
10.13052/jwe1540-9589.2035
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Short text classification is an important foundation for natural language processing (NLP) tasks. Though, the text classification based on deep language models (DLMs) has made a significant headway, in practical applications however, some texts are ambiguous and hard to classify in multi-class classification especially, for short texts whose context length is limited. The mainstream method improves the distinction of ambiguous text by adding context information. However, these methods rely only the text representation, and ignore that the categories overlap and are not completely independent of each other. In this paper, we establish a new general method to solve the problem of ambiguous text classification by introducing label embedding to represent each category, which makes measurable difference between the categories. Further, a new compositional loss function is proposed to train the model, which makes the text representation closer to the ground-truth label and farther away from others. Finally, a constraint is obtained by calculating the similarity between the text representation and label embedding. Errors caused by ambiguous text can be corrected by adding constraints to the output layer of the model. We apply the method to three classical models and conduct experiments on six public datasets. Experiments show that our method can effectively improve the classification accuracy of the ambiguous texts. In addition, combining our method with BERT, we obtain the state-of-the-art results on the CNT dataset.
引用
收藏
页码:669 / 687
页数:19
相关论文
共 50 条
  • [1] Joint Embedding of Words and Labels for Text Classification
    Wang, Guoyin
    Li, Chunyuan
    Wang, Wenlin
    Zhang, Yizhe
    Shen, Dinghan
    Zhang, Xinyuan
    Henao, Ricardo
    Carin, Lawrence
    PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL), VOL 1, 2018, : 2321 - 2331
  • [2] Short Texts Representations for Legal Domain Classification
    Zymkowski, Tomasz
    Szymanski, Julian
    Sobecki, Andrzej
    Drozda, Pawel
    Szalapak, Konrad
    Komar-Komarowski, Kajetan
    Scherer, Rafal
    ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, ICAISC 2022, PT I, 2023, 13588 : 105 - 114
  • [3] Short Text Classification Based on Distributional Representations of Words
    Ma, Chenglong
    Zhao, Qingwei
    Pan, Jielin
    Yan, Yonghong
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2016, E99D (10): : 2562 - 2565
  • [4] Compositional Recurrent Neural Networks for Chinese Short Text Classification
    Zhou, Yujun
    Xu, Bo
    Xu, Jiaming
    Yang, Lei
    Li, Changliang
    Xu, Bo
    2016 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE (WI 2016), 2016, : 137 - 144
  • [5] Bidirectional Multi-channel Semantic Interaction Model of Labels and Texts for Text Classification
    Wang, Yuan
    Zhou, Yubo
    Hu, Peng
    Xu, Maoling
    Zhao, Tingting
    Chen, Yarui
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2022, PT II, 2022, 13552 : 73 - 84
  • [6] Compositional Mixture Representations for Vision and Text
    Alaniz, Stephan
    Federici, Marco
    Akata, Zeynep
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 4201 - 4210
  • [7] Generating Compositional Color Representations from Text
    Maheshwari, Paridhi
    Jain, Nihal
    Vaddamanu, Praneetha
    Raut, Dhananjay
    Vaishay, Shraiysh
    Vinay, Vishwa
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 1222 - 1231
  • [8] Text Classification: The Case of Multiple Labels
    Bobicev, Victoria
    2016 INTERNATIONAL CONFERENCE ON COMMUNICATIONS (COMM 2016), 2016, : 39 - 42
  • [9] Text Representations for Patent Classification
    D'hondt, Eva
    Verberne, Suzan
    Koster, Cornelis
    Boves, Lou
    COMPUTATIONAL LINGUISTICS, 2013, 39 (03) : 755 - 775
  • [10] Local representations using very short labels
    Scheinerman, ER
    DISCRETE MATHEMATICS, 1999, 203 (1-3) : 287 - 290