Multi-label borderline oversampling technique

被引:10
|
作者
Teng, Zeyu [1 ]
Cao, Peng [2 ,3 ]
Huang, Min [1 ]
Gao, Zheming [1 ]
Wang, Xingwei [2 ]
机构
[1] Northeastern Univ, Coll Informat Sci & Engn, Shenyang 110819, Liaoning, Peoples R China
[2] Northeastern Univ, Coll Comp Sci & Engn, Shenyang 110169, Liaoning, Peoples R China
[3] Northeastern Univ, Key Lab Intelligent Comp Med Image, Minist Educ, Shenyang 110169, Liaoning, Peoples R China
关键词
Multi-label learning; Class imbalance; Borderline sample; Oversampling; CLASSIFICATION; IMBALANCE; RANKING; MACHINE; SMOTE;
D O I
10.1016/j.patcog.2023.109953
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Class imbalance problem commonly exists in multi-label classification (MLC) tasks. It has non-negligible im-pacts on the classifier performance and has drawn extensive attention in recent years. Borderline oversampling has been widely used in single-label learning as a competitive technique in dealing with class imbalance. Nevertheless, the borderline samples in multi-label data sets (MLDs) have not been studied. Hence, this paper deeply discussed the borderline samples in MLDs and found they have different neighboring relationships with class borders, which makes their roles different in the classifier training. For that, they are divided into two types named the self-borderline samples and the cross-borderline samples. Further, a novel MLDs resampling approach called Multi-Label Borderline Oversampling Technique (MLBOTE) is proposed for multi -label imbalanced learning. MLBOTE identifies three types of seed samples, including interior, self-borderline, and cross-borderline samples, and different oversampling mechanisms are designed for them, respectively. Meanwhile, it regards not only the minority classes but also the classes suffering from one-vs-rest imbalance as those in need of oversampling. Experiments on eight data sets with nine MLC algorithms and three base classifiers are carried out to compare MLBOTE with some state-of-art MLDs resampling techniques. The results show MLBOTE outperforms other methods in various scenarios.
引用
收藏
页数:17
相关论文
共 50 条
  • [31] Privileged Label Enhancement with Multi-Label Learning
    Zhu, Wenfang
    Jia, Xiuyi
    Li, Weiwei
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 2376 - 2382
  • [32] Label Construction for Multi-label Feature Selection
    Spolaor, Newton
    Monard, Maria Carolina
    Tsoumakas, Grigorios
    Lee, Huei Diana
    2014 BRAZILIAN CONFERENCE ON INTELLIGENT SYSTEMS (BRACIS), 2014, : 247 - 252
  • [33] Label prompt for multi-label text classification
    Rui Song
    Zelong Liu
    Xingbing Chen
    Haining An
    Zhiqi Zhang
    Xiaoguang Wang
    Hao Xu
    Applied Intelligence, 2023, 53 : 8761 - 8775
  • [34] Asymmetry label correlation for multi-label learning
    Bao, Jiachao
    Wang, Yibin
    Cheng, Yusheng
    APPLIED INTELLIGENCE, 2022, 52 (06) : 6093 - 6105
  • [35] Multi-label classification by exploiting label correlations
    Yu, Ying
    Pedrycz, Witold
    Miao, Duoqian
    EXPERT SYSTEMS WITH APPLICATIONS, 2014, 41 (06) : 2989 - 3004
  • [36] Multi-Label Classification with Label Graph Superimposing
    Wang, Ya
    He, Dongliang
    Li, Fu
    Long, Xiang
    Zhou, Zhichao
    Ma, Jinwen
    Wen, Shilei
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 12265 - 12272
  • [37] Multi-Label Confusion Tensor
    Krstinic, Damir
    Skelin, Ana Kuzmanic
    Slapnicar, Ivan
    Braovic, Maja
    IEEE ACCESS, 2024, 12 : 9860 - 9870
  • [38] On the generation of multi-label prototypes
    Bello, Marilyn
    Napoles, Gonzalo
    Vanhoof, Koen
    Bello, Rafael
    INTELLIGENT DATA ANALYSIS, 2020, 24 (S1) : S167 - S183
  • [39] On the Stratification of Multi-label Data
    Sechidis, Konstantinos
    Tsoumakas, Grigorios
    Vlahavas, Ioannis
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PT III, 2011, 6913 : 145 - 158
  • [40] Compact Multi-Label Learning
    Shen, Xiaobo
    Liu, Weiwei
    Tsang, Ivor W.
    Sun, Quan-Sen
    Ong, Yew-Soon
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 4066 - 4073