Ensemble based adaptive over-sampling method for imbalanced data learning in computer aided detection of microaneurysm

被引:45
|
作者
Ren, Fulong [1 ,2 ]
Cao, Peng [1 ,2 ]
Li, Wei [2 ]
Zhao, Dazhe [1 ,2 ]
Zaiane, Osmar [3 ]
机构
[1] Northeastern Univ, Coll Comp Sci & Engn, Shenyang, Peoples R China
[2] Northeastern Univ, Key Lab Med Image Comp, Minist Educ, Shenyang, Peoples R China
[3] Univ Alberta, Comp Sci, Edmonton, AB, Canada
基金
中国国家自然科学基金;
关键词
Microaneurysm detection; Classification; False positive reduction; Imbalanced data learning; Ensemble learning; AUTOMATIC DETECTION; MACHINE;
D O I
10.1016/j.compmedimag.2016.07.011
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Diabetic retinopathy (DR) is a progressive disease, and its detection at an early stage is crucial for saving a patient's vision. An automated screening system for DR can help in reduce the chances of complete blindness due to DR along with lowering the work load on ophthalmologists. Among the earliest signs of DR are microaneurysms (MAs). However, current schemes for MA detection appear to report many false positives because detection algorithms have high sensitivity. Inevitably some non-MAs structures are labeled as MAs in the initial MAs identification step. This is a typical "class imbalance problem". Class imbalanced data has detrimental effects on the performance of conventional classifiers. In this work, we propose an ensemble based adaptive over-sampling algorithm for overcoming the class imbalance problem in the false positive reduction, and we use Boosting, Bagging, Random subspace as the ensemble framework to improve microaneurysm detection. The ensemble based over-sampling methods we proposed combine the strength of adaptive over-sampling and ensemble. The objective of the amalgamation of ensemble and adaptive over-sampling is to reduce the induction biases introduced from imbalanced data and to enhance the generalization classification performance of extreme learning machines (ELM). Experimental results show that our ASOBoost method has higher area under the ROC curve (AUC) and G-mean values than many existing class imbalance learning methods. (C) 2016 Elsevier Ltd. All rights reserved.
引用
收藏
页码:54 / 67
页数:14
相关论文
共 50 条
  • [41] Unbalanced data classification based on over-sampling and integrated learning
    Zhang, Yongjun
    Jian, Xiaowen
    2021 ASIA-PACIFIC CONFERENCE ON COMMUNICATIONS TECHNOLOGY AND COMPUTER SCIENCE (ACCTCS 2021), 2021, : 332 - 337
  • [42] A New Over-Sampling Approach: Random-SMOTE for Learning from Imbalanced Data Sets
    Dong, Yanjie
    Wang, Xuehua
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, 2011, 7091 : 343 - 352
  • [43] Weighted Ensemble with one-class Classification and Over-sampling and Instance selection (WECOI): An approach for learning from imbalanced data streams
    Czarnowski, Ireneusz
    JOURNAL OF COMPUTATIONAL SCIENCE, 2022, 61
  • [44] Cluster-Based Minority Over-Sampling for Imbalanced Datasets
    Puntumapon, Kamthorn
    Rakthamamon, Thanawin
    Waiyamai, Kitsana
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2016, E99D (12): : 3101 - 3109
  • [45] Affine combination-based over-sampling for imbalanced regression
    Li, Zhen-Zhen
    Huang, Niu
    Yi, Lun-Zhao
    Fu, Guang-Hui
    JOURNAL OF CHEMOMETRICS, 2024, 38 (03)
  • [46] Online Sequential Extreme Learning Machine with Under-Sampling and Over-Sampling for Imbalanced Big Data Classification
    Du, Jie
    Vong, Chi-Man
    Chang, Yajie
    Jiao, Yang
    PROCEEDINGS OF ELM-2016, 2018, 9 : 229 - 239
  • [48] Imbalanced Data Classification Method Based on Ensemble Learning
    Xiang, Yu
    Xie, Yongping
    COMMUNICATIONS, SIGNAL PROCESSING, AND SYSTEMS, CSPS 2018, VOL III: SYSTEMS, 2020, 517 : 18 - 24
  • [49] Adaptive weighted over-sampling for imbalanced datasets based on density peaks clustering with heuristic filtering
    Tao, Xinmin
    Li, Qing
    Guo, Wenjie
    Ren, Chao
    He, Qing
    Liu, Rui
    Zou, JunRong
    INFORMATION SCIENCES, 2020, 519 : 43 - 73
  • [50] Diversity and Separable Metrics in Over-Sampling Technique for Imbalanced Data Classification
    Mahmoudi, Shadi
    Moradi, Parham
    Akhlaghian, Fardin
    Moradi, Rizan
    2014 4TH INTERNATIONAL CONFERENCE ON COMPUTER AND KNOWLEDGE ENGINEERING (ICCKE), 2014, : 152 - 158