Multi-label borderline oversampling technique

被引:10
|
作者
Teng, Zeyu [1 ]
Cao, Peng [2 ,3 ]
Huang, Min [1 ]
Gao, Zheming [1 ]
Wang, Xingwei [2 ]
机构
[1] Northeastern Univ, Coll Informat Sci & Engn, Shenyang 110819, Liaoning, Peoples R China
[2] Northeastern Univ, Coll Comp Sci & Engn, Shenyang 110169, Liaoning, Peoples R China
[3] Northeastern Univ, Key Lab Intelligent Comp Med Image, Minist Educ, Shenyang 110169, Liaoning, Peoples R China
关键词
Multi-label learning; Class imbalance; Borderline sample; Oversampling; CLASSIFICATION; IMBALANCE; RANKING; MACHINE; SMOTE;
D O I
10.1016/j.patcog.2023.109953
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Class imbalance problem commonly exists in multi-label classification (MLC) tasks. It has non-negligible im-pacts on the classifier performance and has drawn extensive attention in recent years. Borderline oversampling has been widely used in single-label learning as a competitive technique in dealing with class imbalance. Nevertheless, the borderline samples in multi-label data sets (MLDs) have not been studied. Hence, this paper deeply discussed the borderline samples in MLDs and found they have different neighboring relationships with class borders, which makes their roles different in the classifier training. For that, they are divided into two types named the self-borderline samples and the cross-borderline samples. Further, a novel MLDs resampling approach called Multi-Label Borderline Oversampling Technique (MLBOTE) is proposed for multi -label imbalanced learning. MLBOTE identifies three types of seed samples, including interior, self-borderline, and cross-borderline samples, and different oversampling mechanisms are designed for them, respectively. Meanwhile, it regards not only the minority classes but also the classes suffering from one-vs-rest imbalance as those in need of oversampling. Experiments on eight data sets with nine MLC algorithms and three base classifiers are carried out to compare MLBOTE with some state-of-art MLDs resampling techniques. The results show MLBOTE outperforms other methods in various scenarios.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] Label correlation guided borderline oversampling for imbalanced multi-label data learning
    Zhang, Kai
    Mao, Zhaoyang
    Cao, Peng
    Liang, Wei
    Yang, Jinzhu
    Li, Weiping
    Zaiane, Osmar R.
    KNOWLEDGE-BASED SYSTEMS, 2023, 279
  • [2] A diversity and reliability-enhanced synthetic minority oversampling technique for multi-label learning
    Gong, Yanlu
    Wu, Quanwang
    Zhou, Mengchu
    Chen, Chao
    INFORMATION SCIENCES, 2025, 690
  • [3] Synthetic Oversampling of Multi-label Data Based on Local Label Distribution
    Liu, Bin
    Tsoumakas, Grigorios
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2019, PT II, 2020, 11907 : 180 - 193
  • [4] AEMLO: AutoEncoder-Guided Multi-label Oversampling
    Zhou, Ao
    Liu, Bin
    Wang, Jin
    Sun, Kaiwei
    Liu, Kelin
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, PT I, ECML PKDD 2024, 2024, 14941 : 107 - 124
  • [5] Oversampling multi-label data based on natural neighbor and label correlation
    Liu, Bin
    Zhou, Ao
    Wei, Bingkun
    Wang, Jin
    Tsoumakas, Grigorios
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 259
  • [6] MLAWSMOTE: Oversampling in Imbalanced Multi-label Classification with Missing Labels by Learning Label Correlation Matrix
    Mao, Jian
    Huang, Kai
    Liu, Jinming
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2024, 17 (01)
  • [7] Reverse-nearest neighborhood based oversampling for imbalanced, multi-label datasets
    Sadhukhan, Payel
    Palit, Sarbani
    PATTERN RECOGNITION LETTERS, 2019, 125 : 813 - 820
  • [8] Imbalanced Data Handling in Multi-label Aspect Categorization using Oversampling and Ensemble Learning
    Alnatara, Wildan Dicky
    Khodra, Masayu Leylia
    ICACSIS 2020: 2020 12TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER SCIENCE AND INFORMATION SYSTEMS (ICACSIS), 2020, : 165 - 170
  • [9] MLCE: A Multi-Label Crotch Ensemble Method for Multi-Label Classification
    Yao, Yuan
    Li, Yan
    Ye, Yunming
    Li, Xutao
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2021, 35 (04)
  • [10] Multi-label Classification of Small Samples Using an Ensemble Technique
    Mahdavi-Shahri, Amirreza
    Karimian, Jamil
    Javadi, Azadeh
    Houshmand, Mahboobeh
    26TH IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE 2018), 2018, : 1708 - 1713