A Novel Imbalanced Data Classification Method Based on Weakly Supervised Learning for Fault Diagnosis

被引:43
|
作者
Liu, Hui [1 ]
Liu, Zhenyu [1 ]
Jia, Weiqiang [1 ,2 ]
Zhang, Donghao [1 ]
Tan, Jianrong [1 ]
机构
[1] Zhejiang Univ, State Key Lab Comp Aided Design & Comp Graph, Hangzhou 310027, Peoples R China
[2] Zhejiang Lab, Hangzhou 311121, Peoples R China
基金
中国国家自然科学基金;
关键词
Fault diagnosis; Supervised learning; Support vector machines; Classification algorithms; Informatics; Prognostics and health management; Prediction algorithms; Bidirectional gated recurrent units (BGRU); class imbalance; support vector machine (SVM); weakly supervised learning; SMOTE;
D O I
10.1109/TII.2021.3084132
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The class imbalance problem has a huge impact on the performance of diagnostic models. When it occurs, the minority samples are easily ignored by classification models. Besides, the distribution of class imbalanced data differs from the actual data distribution, which makes it difficult for classifiers to learn an accurate decision boundary. To tackle the above issues, this article proposes a novel imbalanced data classification method based on weakly supervised learning. First, Bagging algorithm is employed to sample majority data randomly to generate several relatively balanced subsets, which are then used to train several support vector machine (SVM) classifiers. Next, these trained SVM classifiers are adopted to predict the labels of those unlabeled data, and samples that are predicted as minority class are added to the original dataset to reduce the imbalance ratio. The critical idea of this article is to introduce real-world samples into the imbalanced dataset by virtue of weakly supervised learning. In addition, bidirectional gated recurrent units are used to construct a diagnostic model for fault diagnosis, and a new weighted cross-entropy function is proposed as the loss function to reduce the impact of noise. Besides, it also increases the model's attention to the original minority samples. Furthermore, experimental evaluations of the proposed method are conducted on two datasets, i.e., Prognostics and Health Management challenge 2008 and 2010 datasets, and the experimental results demonstrate the effectiveness and superiority of the proposed method.
引用
收藏
页码:1583 / 1593
页数:11
相关论文
共 50 条
  • [31] INTELLIGENT BEARING FAULT DIAGNOSIS METHOD BASED ON HNR ENVELOPE AND CLASSIFICATION USING SUPERVISED MACHINE LEARNING ALGORITHMS
    Ouachtouk, Ilias
    El Hani, Soumia
    Dahi, Khalid
    ADVANCES IN ELECTRICAL AND ELECTRONIC ENGINEERING, 2021, 19 (04) : 282 - 294
  • [32] Imbalanced Data Fault Diagnosis Based on an Evolutionary Online Sequential Extreme Learning Machine
    Hao, Wei
    Liu, Feng
    SYMMETRY-BASEL, 2020, 12 (08):
  • [33] A novel deep metric learning model for imbalanced fault diagnosis and toward open-set classification
    Wang, Cunjun
    Xin, Cun
    Xu, Zili
    KNOWLEDGE-BASED SYSTEMS, 2021, 220
  • [34] Imbalanced Data Classification Method Based on LSSASMOTE
    Wang, Zhi
    Liu, Qicheng
    IEEE ACCESS, 2023, 11 : 32252 - 32260
  • [35] A Novel Method for Imbalanced Fault Diagnosis of Rotating Machinery Based on Generative Adversarial Networks
    Li, Zhenxiang
    Zheng, Taisheng
    Wang, Yang
    Cao, Zhi
    Guo, Zhiqi
    Fu, Hongyong
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2021, 70
  • [36] Supervised Class Distribution Learning for GANs-based Imbalanced Classification
    Cai, Zixin
    Wang, Xinyue
    Zhou, Mingjie
    Xu, Jian
    Jing, Liping
    2019 19TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2019), 2019, : 41 - 50
  • [37] A bearing fault diagnosis method based on semi-supervised and transfer learning
    Zhang Z.
    Liu J.
    Huang L.
    Zhang X.
    Beijing Hangkong Hangtian Daxue Xuebao/Journal of Beijing University of Aeronautics and Astronautics, 2019, 45 (11): : 2291 - 2300
  • [38] Weakly Supervised Classification of Hyperspectral Image Based on Complementary Learning
    Huang, Lingbo
    Chen, Yushi
    He, Xin
    REMOTE SENSING, 2021, 13 (24)
  • [39] Imbalanced Learning of Fault Data Combined with Cloud Model and Ensemble Classification
    Ma S.
    Zhao R.
    Wu Y.
    Zhendong Ceshi Yu Zhenduan/Journal of Vibration, Measurement and Diagnosis, 2023, 43 (06): : 1114 - 1120and1243
  • [40] Imbalanced Node Classification Algorithm Based on Self-Supervised Learning
    Cui, Caixia
    Wang, Jie
    Pang, Tianjie
    Liang, Jiye
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2022, 35 (11): : 955 - 964