A multi-label social short text classification method based on contrastive learning and improved ml-KNN

被引:3
|
作者
Tian, Gang [1 ]
Wang, Jiachang [1 ]
Wang, Rui [2 ]
Zhao, Guangxin [1 ]
He, Cheng [1 ]
机构
[1] Shandong Univ Sci & Technol, Coll Comp Sci & Engn, Qingdao, Peoples R China
[2] Shandong Univ Sci & Technol, Coll Energy & Min Engn, Qingdao, Peoples R China
关键词
contrastive learning; deep learning; improved ml-KNN; multi-label text classification;
D O I
10.1111/exsy.13547
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Short texts on social platforms often have the problems of diverse categories and semantic sparsity, making it challenging to identify the diverse intentions of users. To address this issue, this article proposes a multi-label social short text classification method (IML-CL) based on contrastive learning and improved ml-KNN. First, a contrastive learning approach is employed to train a multi-label text classification model. This approach improves semantic sparsity by leveraging the knowledge from the existing samples to enrich the feature representation of short texts. Simultaneously, an improved ml-KNN algorithm is developed to enhance the accuracy of label prediction. This algorithm utilizes a two-layer nearest neighbor rule and introduces a penalty function and weight optimization. Next, the model generates the feature representation for the test sample and predicts its label. Additionally, the improved ml-KNN algorithm retrieves neighbors of the test sample and uses their label information for prediction. Finally, the two predictions are combined to obtain the final prediction, which accurately identifies the user's intention. The experimental results demonstrate that, on the dataset constructed in this article, the IML-CL method effectively boosts the performance of the baseline model.
引用
收藏
页数:19
相关论文
共 50 条
  • [41] Multi-Label Classification Method Based on Extreme Learning Machines
    Venkatesan, Rajasekar
    Er, Meng Joo
    2014 13TH INTERNATIONAL CONFERENCE ON CONTROL AUTOMATION ROBOTICS & VISION (ICARCV), 2014, : 619 - 624
  • [42] Multi-Label Emotion Classification of Online Learners' Reviews Using Machine Learning Text-Based Multi-Label Classification Approach
    Makhoukhi, Hajar
    Roubi, Sarra
    2024 5TH INTERNATIONAL CONFERENCE ON EDUCATION DEVELOPMENT AND STUDIES, ICEDS 2024, 2024, : 59 - 64
  • [43] PROBMCL: SIMPLE PROBABILISTIC CONTRASTIVE LEARNING FOR MULTI-LABEL VISUAL CLASSIFICATION
    Sajedi, Ahmad
    Khaki, Samir
    Lawryshyn, Yuri A.
    Plataniotis, Konstantinos N.
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 5115 - 5119
  • [44] Gaussian Mixture Variational Autoencoder with Contrastive Learning for Multi-Label Classification
    Bai, Junwen
    Kong, Shufeng
    Gomes, Carla
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [45] An Improved Multi-label Classification Ensemble Learning Algorithm
    Fu, Zhongliang
    Wang, Lili
    Zhang, Danpu
    PATTERN RECOGNITION (CCPR 2014), PT I, 2014, 483 : 243 - 252
  • [46] Multi-label Arabic text classification in Online Social Networks
    Omar, Ahmed
    Mahmoud, Tarek M.
    Abd-El-Hafeez, Tarek
    Mahfouz, Ahmed
    INFORMATION SYSTEMS, 2021, 100
  • [47] Multi-label learning method based on ML-RBF and laplacian ELM
    Xu, Xinzheng
    Shan, Dong
    Li, Shan
    Sun, Tongfeng
    Xiao, Pengcheng
    Fan, Jianping
    NEUROCOMPUTING, 2019, 331 : 213 - 219
  • [48] Design of educational method classification model based on improved multi-label transfer learning model
    Zeng, Chanjuan
    Zhao, Chunhui
    SOFT COMPUTING, 2023,
  • [49] Multi-Label Classification of Text Documents Using Deep Learning
    Mohammed, Hamza Haruna
    Dogdu, Erdogan
    Gorur, Abdul Kadir
    Choupani, Roya
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 4681 - 4689
  • [50] EnML: Multi-label Ensemble Learning for Urdu Text Classification
    Mehmood, Faiza
    Shahzadi, Rehab
    Ghafoor, Hina
    Asim, Muhammad Nabeel
    Ghani, Muhammad Usman
    Mahmood, Waqar
    Dengel, Andreas
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2023, 22 (09)