A multi-label social short text classification method based on contrastive learning and improved ml-KNN

被引:3
|
作者
Tian, Gang [1 ]
Wang, Jiachang [1 ]
Wang, Rui [2 ]
Zhao, Guangxin [1 ]
He, Cheng [1 ]
机构
[1] Shandong Univ Sci & Technol, Coll Comp Sci & Engn, Qingdao, Peoples R China
[2] Shandong Univ Sci & Technol, Coll Energy & Min Engn, Qingdao, Peoples R China
关键词
contrastive learning; deep learning; improved ml-KNN; multi-label text classification;
D O I
10.1111/exsy.13547
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Short texts on social platforms often have the problems of diverse categories and semantic sparsity, making it challenging to identify the diverse intentions of users. To address this issue, this article proposes a multi-label social short text classification method (IML-CL) based on contrastive learning and improved ml-KNN. First, a contrastive learning approach is employed to train a multi-label text classification model. This approach improves semantic sparsity by leveraging the knowledge from the existing samples to enrich the feature representation of short texts. Simultaneously, an improved ml-KNN algorithm is developed to enhance the accuracy of label prediction. This algorithm utilizes a two-layer nearest neighbor rule and introduces a penalty function and weight optimization. Next, the model generates the feature representation for the test sample and predicts its label. Additionally, the improved ml-KNN algorithm retrieves neighbors of the test sample and uses their label information for prediction. Finally, the two predictions are combined to obtain the final prediction, which accurately identifies the user's intention. The experimental results demonstrate that, on the dataset constructed in this article, the IML-CL method effectively boosts the performance of the baseline model.
引用
收藏
页数:19
相关论文
共 50 条
  • [21] Multi-label Text Classification Method Based on Label Semantic Information
    Xiao L.
    Chen B.-L.
    Huang X.
    Liu H.-F.
    Jing L.-P.
    Yu J.
    Ruan Jian Xue Bao/Journal of Software, 2020, 31 (04): : 1079 - 1089
  • [22] Contrastive Learning-Enhanced Nearest Neighbor Mechanism for Multi-Label Text Classification
    Su, Xi'ao
    Wang, Ran
    Dai, Xinyu
    PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022): (SHORT PAPERS), VOL 2, 2022, : 672 - 679
  • [23] An Improved Multi-label Classifier Chain Method for Automated Text Classification
    Abdullahi, Adeleke
    Samsudin, Noor Azah
    Khalid, Shamsul Kamal Ahmad
    Othman, Zuhaila Ali
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (03) : 442 - 449
  • [24] A Survey of Multi-label Text Classification Based on Deep Learning
    Chen, Xiaolong
    Cheng, Jieren
    Liu, Jingxin
    Xu, Wenghang
    Hua, Shuai
    Tang, Zhu
    Sheng, Victor S.
    ARTIFICIAL INTELLIGENCE AND SECURITY, ICAIS 2022, PT I, 2022, 13338 : 443 - 456
  • [25] Multi-Label Arabic Text Classification Based On Deep Learning
    Alsukhni, Batool
    2021 12TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION SYSTEMS (ICICS), 2021, : 475 - 477
  • [26] ML-FOREST: A Multi-Label Tree Ensemble Method for Multi-Label Classification
    Wu, Qingyao
    Tan, Mingkui
    Song, Hengjie
    Chen, Jian
    Ng, Michael K.
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2016, 28 (10) : 2665 - 2680
  • [27] Deep Learning Method with Attention for Extreme Multi-label Text Classification
    Chen, Si
    Wang, Liangguo
    Li, Wan
    Zhang, Kun
    PRICAI 2019: TRENDS IN ARTIFICIAL INTELLIGENCE, PT III, 2019, 11672 : 179 - 190
  • [28] Metadata-Induced Contrastive Learning for Zero-Shot Multi-Label Text Classification
    Zhang, Yu
    Shen, Zhihong
    Wu, Chieh-Han
    Xie, Boya
    Hao, Junheng
    Wang, Ye-Yi
    Wang, Kuansan
    Han, Jiawei
    PROCEEDINGS OF THE ACM WEB CONFERENCE 2022 (WWW'22), 2022, : 3162 - 3173
  • [29] Improved Graph Contrastive Learning for Short Text Classification
    Liu, Yonghao
    Huang, Lan
    Giunchiglia, Fausto
    Feng, Xiaoyue
    Guan, Renchu
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 17, 2024, : 18716 - 18724
  • [30] Hierarchical Multi-Label Classification of Social Text Streams
    Ren, Zhaochun
    Peetz, Maria-Hendrike
    Liang, Shangsong
    van Dolen, Willemijn
    de Rijke, Maarten
    SIGIR'14: PROCEEDINGS OF THE 37TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2014, : 213 - 222