Gaussian Sampling Approach to deal with Imbalanced Telemetry Datasets in Industrial Applications

被引:0
|
作者
Galve, Sergio [1 ]
Puig, Vicenc [2 ]
Vilajosana, Xavi [1 ]
机构
[1] Univ Oberta Catalunya, Wireless Networks Res Lab, Castelldefels 08860, Barcelona, Spain
[2] UPC, CSIC, Inst Robot & Informat Ind, Barcelona, Spain
关键词
D O I
10.1109/MED59994.2023.10185829
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Practical implementation of data analytics in industrial environments has always been a problematic area because of data availability and quality. In this paper, a Gaussian sampling methodology is proposed to address the problem of imbalanced telemetry datasets that is one of the root causes that make modelling less reliable. By generating subsets that achieve homogeneous density distributions this problem is addressed. By comparing the impact of this method with the baseline case of random sampling, this paper aims to address this problem and propose a practical solution. A case study based on an industrial cooling device is used to assess and illustrate the proposed approach.
引用
收藏
页码:605 / 611
页数:7
相关论文
共 50 条
  • [41] Combination Approach of SMOTE and Biased-SVM for Imbalanced Datasets
    Wang He-Yong
    2008 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-8, 2008, : 228 - 231
  • [42] A hybrid under-sampling approach for mining unbalanced datasets: applications to banking and insurance
    Vasu, Madireddi
    Ravi, Vadlamani
    INTERNATIONAL JOURNAL OF DATA MINING MODELLING AND MANAGEMENT, 2011, 3 (01) : 75 - 105
  • [43] Gaussian sampling of lattices for cryptographic applications
    YuPu Hu
    Hao Lei
    FengHe Wang
    WenZheng Zhang
    Science China Information Sciences, 2014, 57 : 1 - 8
  • [44] Gaussian sampling of lattices for cryptographic applications
    HU YuPu
    LEI Hao
    WANG FengHe
    ZHANG WenZheng
    Science China(Information Sciences), 2014, 57 (07) : 154 - 161
  • [45] Gaussian sampling of lattices for cryptographic applications
    Hu YuPu
    Lei Hao
    Wang FengHe
    Zhang WenZheng
    SCIENCE CHINA-INFORMATION SCIENCES, 2014, 57 (07) : 1 - 8
  • [46] Cost-Sensitive Learning and Threshold-Moving Approach to Improve Industrial Lots Release Process on Imbalanced Datasets
    Lobo, Armindo
    Oliveira, Pedro
    Sampaio, Paulo
    Novais, Paulo
    19TH INTERNATIONAL SYMPOSIUM ON DISTRIBUTED COMPUTING AND ARTIFICIAL INTELLIGENCE, 2023, 583 : 280 - 290
  • [47] Applications of Autonomous Learning Multi Model System to Multiclass Imbalanced Datasets
    Seabra, Andre
    Ventura, Rodrigo
    Almeida, Rui Jorge
    Vieira, Susana
    Sousa, Joao M. C.
    2024 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, FUZZ-IEEE 2024, 2024,
  • [48] Adaptive over-sampling method for classification with application to imbalanced datasets in aluminum electrolysis
    Huang, Zhaoke
    Yang, Chunhua
    Chen, Xiaofang
    Huang, Keke
    Xie, Yongfang
    NEURAL COMPUTING & APPLICATIONS, 2020, 32 (11): : 7183 - 7199
  • [49] An Over-sampling Method Based on Probability Density Estimation for Imbalanced Datasets Classification
    Cao, Lu
    Zhai, Yi-Kui
    PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATION PROCESSING (ICIIP'16), 2016,
  • [50] Adaptive over-sampling method for classification with application to imbalanced datasets in aluminum electrolysis
    Zhaoke Huang
    Chunhua Yang
    Xiaofang Chen
    Keke Huang
    Yongfang Xie
    Neural Computing and Applications, 2020, 32 : 7183 - 7199