Joint Representations of Texts and Labels with Compositional Loss for Short Text Classification

被引:3
|
作者
Hao, Ming [1 ]
Wang, Weijing [2 ]
Zhou, Fang [1 ]
机构
[1] Univ Sci & Technol Beijing, Sch Comp & Commun Engn, Beijing 100083, Peoples R China
[2] Univ Illinois, Dept Bioengn, Urbana, IL 61801 USA
来源
JOURNAL OF WEB ENGINEERING | 2021年 / 20卷 / 03期
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Ambiguous text; deep language models; label embedding; text classification; triplet loss;
D O I
10.13052/jwe1540-9589.2035
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Short text classification is an important foundation for natural language processing (NLP) tasks. Though, the text classification based on deep language models (DLMs) has made a significant headway, in practical applications however, some texts are ambiguous and hard to classify in multi-class classification especially, for short texts whose context length is limited. The mainstream method improves the distinction of ambiguous text by adding context information. However, these methods rely only the text representation, and ignore that the categories overlap and are not completely independent of each other. In this paper, we establish a new general method to solve the problem of ambiguous text classification by introducing label embedding to represent each category, which makes measurable difference between the categories. Further, a new compositional loss function is proposed to train the model, which makes the text representation closer to the ground-truth label and farther away from others. Finally, a constraint is obtained by calculating the similarity between the text representation and label embedding. Errors caused by ambiguous text can be corrected by adding constraints to the output layer of the model. We apply the method to three classical models and conduct experiments on six public datasets. Experiments show that our method can effectively improve the classification accuracy of the ambiguous texts. In addition, combining our method with BERT, we obtain the state-of-the-art results on the CNT dataset.
引用
收藏
页码:669 / 687
页数:19
相关论文
共 50 条
  • [31] A Hybrid Distributed Model for Learning Representation of Short Texts with Attribute Labels
    Kumar, Shashi
    Roy, Suman
    Pathak, Vishal
    PROCEEDINGS OF THE 7TH ACM IKDD CODS AND 25TH COMAD (CODS-COMAD 2020), 2020, : 244 - 248
  • [32] Instances and Labels: Hierarchy-aware Joint Supervised Contrastive Learning for Hierarchical Multi-Label Text Classification
    Lok, Simon Chi U.
    He, Jie
    Gutierrez-Basulto, Victor
    Pan, Jeff Z.
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EMNLP 2023), 2023, : 8858 - 8875
  • [33] Joint Representations of Text and Knowledge Graphs for Retrieval and Evaluation
    Le Scao, Teven
    Gardent, Claire
    13TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING AND THE 3RD CONFERENCE OF THE ASIA-PACIFIC CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, IJCNLP-AACL 2023, 2023, : 110 - 122
  • [34] Keyword Extraction from Short Texts with a Text-to-Text Transfer Transformer
    Pezik, Piotr
    Mikolajczyk, Agnieszka
    Wawrzynski, Adam
    Niton, Bartlomiej
    Ogrodniczuk, Maciej
    RECENT CHALLENGES IN INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2022, 2022, 1716 : 530 - 542
  • [35] Text Classification for Data Loss Preventionwa
    Hart, Michael
    Manadhata, Pratyusa
    Johnson, Rob
    PRIVACY ENHANCING TECHNOLOGIES, 2011, 6794 : 18 - +
  • [36] A New Vector Representation of Short Texts for Classification
    Li, Yangyang
    Liu, Bo
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2020, 17 (02) : 241 - 249
  • [37] Review of short-text classification
    Alsmadi, Issa
    Gan, Keng Hoon
    INTERNATIONAL JOURNAL OF WEB INFORMATION SYSTEMS, 2019, 15 (02) : 155 - 182
  • [38] Introducing Semantics in Short Text Classification
    Bouaziz, Ameni
    Pereira, Celia da Costa
    Dartigues-Pallez, Christel
    Precioso, Frederic
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, (CICLING 2016), PT II, 2018, 9624 : 433 - 445
  • [39] A Probabilistic Framework for Short Text Classification
    Ali, Mubashir
    Khalid, Shehzad
    Rana, Mazhar Iqbal
    Azhar, Fizza
    2018 IEEE 8TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE (CCWC), 2018, : 742 - 747
  • [40] Short Text Classification Based on Semantics
    Ma, Chenglong
    Wan, Xin
    Zhang, Zhen
    Li, Taisong
    Zhang, Yan
    ADVANCED INTELLIGENT COMPUTING THEORIES AND APPLICATIONS, ICIC 2015, PT III, 2015, 9227 : 463 - 470