MLAWSMOTE: Oversampling in Imbalanced Multi-label Classification with Missing Labels by Learning Label Correlation Matrix

被引:0
|
作者
Mao, Jian [1 ]
Huang, Kai [1 ]
Liu, Jinming [1 ]
机构
[1] Jimei Univ, Coll Comp Engn, Yinjiang Rd, Xiamen 361021, Fujian, Peoples R China
关键词
Multi-label learning; Imbalanced learning; Missing labels; Oversampling methods; LINEAR INVERSE PROBLEMS; FEATURE-SELECTION; RENEWABLE ENERGY; ALGORITHM; GENERATION;
D O I
10.1007/s44196-024-00607-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Missing labels in multi-label datasets are a common problem, especially for minority classes, which are more likely to occur. This limitation hinders the performance of classifiers in identifying and extracting information from minority classes. Oversampling is an effective method for addressing imbalanced multi-label problems by generating synthetic instances to create a class-balanced dataset. However, the existing oversampling algorithms mainly focus on the location of the generated data, and there is a lack of design on how to complete the labels of the synthetic data. To address this issue, we propose MLAWSMOTE, a synthetic data generation algorithm based on matrix factorization weights. We introduce a weak supervised learning method in the oversampling method, optimize the weights of features and labels by using label correlation, and iteratively learn the ideal label weights. The mapping relationship between features and labels is learned from the dataset and the label correlation matrix. The oversampling ratio is defined based on the discrepancy between observed labels and the ideal label of synthetic instances. It mitigates the impact of missing minority labels on the model's predictions. The labeling of synthetic instances is performed based on label prediction, and the potential labeling distribution is complemented. Experimental results on multiple multi-label datasets under different label missing ratios demonstrate the effectiveness of the proposed method in terms of ACC, Hamming loss, MacroF1 and MicroF1. In the validation of the four classifiers, MacroF1 decreased by 24.78%, 17.81%, 3.8% and 19.56%, respectively, with the increase of label loss rate. After applying MLAWSMOTE only decreased by 15.79%, 13.63%, 3.78% and 15.21%.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] Multi-label classification with Missing Labels using Label Correlation and Robust Structural Learning
    Rastogi, Reshma
    Mortaza, Sayed
    KNOWLEDGE-BASED SYSTEMS, 2021, 229
  • [2] Label correlation guided borderline oversampling for imbalanced multi-label data learning
    Zhang, Kai
    Mao, Zhaoyang
    Cao, Peng
    Liang, Wei
    Yang, Jinzhu
    Li, Weiping
    Zaiane, Osmar R.
    KNOWLEDGE-BASED SYSTEMS, 2023, 279
  • [3] Multi-label classification with weak labels by learning label correlation and label regularization
    Ji, Xiaowan
    Tan, Anhui
    Wu, Wei-Zhi
    Gu, Shenming
    APPLIED INTELLIGENCE, 2023, 53 (17) : 20110 - 20133
  • [4] Multi-label classification with weak labels by learning label correlation and label regularization
    Xiaowan Ji
    Anhui Tan
    Wei-Zhi Wu
    Shenming Gu
    Applied Intelligence, 2023, 53 : 20110 - 20133
  • [5] Multi-Label Learning with Missing Labels
    Wu, Baoyuan
    Liu, Zhilei
    Wang, Shangfei
    Hu, Bao-Gang
    Ji, Qiang
    2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 1964 - 1968
  • [6] Global and Adaptive Local Label Correlation for Multi-label Learning with Missing Labels
    Jiang, Qingxia
    Li, Peipei
    Zhang, Yuhong
    Hu, Xuegang
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [7] Learning Label-Specific Features for Multi-Label Classification with Missing Labels
    Huang, Jun
    Qin, Feng
    Zheng, Xiao
    Cheng, Zekai
    Yuan, Zhixiang
    Zhang, Weigang
    2018 IEEE FOURTH INTERNATIONAL CONFERENCE ON MULTIMEDIA BIG DATA (BIGMM), 2018,
  • [8] Pseudo Labels for Imbalanced Multi-Label Learning
    Zeng, Wenrong
    Chen, Xuewen
    Cheng, Hong
    2014 INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA), 2014, : 25 - 31
  • [9] Multi-Label Classification for images with Missing Labels
    Ma, Jianghong
    Fan, Jicong
    Wang, Wei
    2017 IEEE 15TH INTERNATIONAL CONFERENCE ON INDUSTRIAL INFORMATICS (INDIN), 2017, : 1050 - 1055
  • [10] Learning Common and Label-Specific Features for Multi-Label Classification With Missing Labels
    Li, Runxin
    Ouyang, Zexian
    Shang, Zhenhong
    Jia, Lianyin
    Li, Xiaowu
    IEEE ACCESS, 2024, 12 : 81170 - 81195