Imbalance: Oversampling algorithms for imbalanced classification in R

被引:59
|
作者
Cordon, Ignacio [1 ]
Garcia, Salvador [1 ]
Fernandez, Alberto [1 ]
Herrera, Francisco [1 ]
机构
[1] Univ Granada, DaSCI Andalusian Inst Data Sci & Computat Intelli, Granada, Spain
关键词
Oversampling; Imbalanced classification; Machine learning; Preprocessing; SMOTE; SOFTWARE; SMOTE;
D O I
10.1016/j.knosys.2018.07.035
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Addressing imbalanced datasets in classification tasks is a relevant topic in research studies. The main reason is that for standard classification algorithms, the success rate when identifying minority class instances may be adversely affected. Among different solutions to cope with this problem, data level techniques have shown a robust behavior. In this paper, the novel imbalance package is introduced. Written in R and C++, and available at CRAN repository, this library includes recent relevant oversampling algorithms to improve the quality of data in imbalanced datasets, prior to performing a learning task. The main features of the package, as well as some illustrative examples of its use are detailed throughout this manuscript. (C) 2018 Elsevier B.V. All rights reserved.
引用
收藏
页码:329 / 341
页数:13
相关论文
共 50 条
  • [31] FIAO: Feature Information Aggregation Oversampling for imbalanced data classification
    Wang, Fei
    Zheng, Ming
    Hu, Xiaowen
    Li, Hongchao
    Wang, Taochun
    Chen, Fulong
    APPLIED SOFT COMPUTING, 2024, 161
  • [32] Oversampling Methods for Classification of Imbalanced Breast Cancer Malignancy Data
    Krawczyk, Bartosz
    Jelen, Lukasz
    Krzyzak, Adam
    Fevens, Thomas
    COMPUTER VISION AND GRAPHICS, 2012, 7594 : 483 - 490
  • [33] Boundary Oversampling Based Graph Node Imbalance Classification Algorithm
    Wu, Tianhao
    Dong, Minggang
    Tan, Ruoqi
    Computer Engineering and Applications, 2024, 60 (13) : 92 - 101
  • [34] VCOS: A Novel Synergistic Oversampling Algorithm in Binary Imbalance Classification
    Zhang, Chunkai
    Zhou, Ting
    Deng, Yepeng
    IEEE ACCESS, 2019, 7 : 145435 - 145443
  • [35] A novel oversampling and feature selection hybrid algorithm for imbalanced data classification
    Feng, Fang
    Li, Kuan-Ching
    Yang, Erfu
    Zhou, Qingguo
    Han, Lihong
    Hussain, Amir
    Cai, Mingjiang
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (03) : 3231 - 3267
  • [36] A NOVEL RULE-BASED OVERSAMPLING APPROACH FOR IMBALANCED DATA CLASSIFICATION
    Zhang, Xiao
    Paz, Ivan
    Nebot, Angela
    37TH ANNUAL EUROPEAN SIMULATION AND MODELLING CONFERENCE 2023, ESM 2023, 2023, : 208 - 212
  • [37] Classification of Imbalanced Data by Oversampling in Kernel Space of Support Vector Machines
    Mathew, Josey
    Pang, Chee Khiang
    Luo, Ming
    Leong, Weng Hoe
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (09) : 4065 - 4076
  • [38] Imbalanced Classification via Feature Dictionary-Based Minority Oversampling
    Park, Minho
    Song, Hwa Jeon
    Kang, Dong-Oh
    IEEE ACCESS, 2022, 10 : 34236 - 34245
  • [39] Grouping-based Oversampling in Kernel Space for Imbalanced Data Classification
    Ren, Jinjun
    Wang, Yuping
    Cheung, Yiu-ming
    Gao, Xiao-Zhi
    Guo, Xiaofang
    PATTERN RECOGNITION, 2023, 133
  • [40] Efficient hybrid oversampling and intelligent undersampling for imbalanced big data classification
    Vairetti, Carla
    Assadi, Jose Luis
    Maldonado, Sebastian
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 246