Gaussian Sampling Approach to deal with Imbalanced Telemetry Datasets in Industrial Applications

被引:0
|
作者
Galve, Sergio [1 ]
Puig, Vicenc [2 ]
Vilajosana, Xavi [1 ]
机构
[1] Univ Oberta Catalunya, Wireless Networks Res Lab, Castelldefels 08860, Barcelona, Spain
[2] UPC, CSIC, Inst Robot & Informat Ind, Barcelona, Spain
关键词
D O I
10.1109/MED59994.2023.10185829
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Practical implementation of data analytics in industrial environments has always been a problematic area because of data availability and quality. In this paper, a Gaussian sampling methodology is proposed to address the problem of imbalanced telemetry datasets that is one of the root causes that make modelling less reliable. By generating subsets that achieve homogeneous density distributions this problem is addressed. By comparing the impact of this method with the baseline case of random sampling, this paper aims to address this problem and propose a practical solution. A case study based on an industrial cooling device is used to assess and illustrate the proposed approach.
引用
收藏
页码:605 / 611
页数:7
相关论文
共 50 条
  • [1] ARCID: A New Approach to Deal with Imbalanced Datasets Classification
    Abdellatif, Safa
    Ben Hassine, Mohamed Ali
    Ben Yahia, Sadok
    Bouzeghoub, Amel
    SOFSEM 2018: THEORY AND PRACTICE OF COMPUTER SCIENCE, 2018, 10706 : 569 - 580
  • [2] A GENETIC RULE LEARNING APPROACH TO DEAL WITH IMBALANCED DATASETS
    Mahani, Aouatef
    Benkhider, Sadjia
    Baba-Ali, Ahmed Riadh
    PROCEEDINGS OF THE EUROPEAN CONFERENCE ON DATA MINING 2015 AND INTERNATIONAL CONFERENCES ON INTELLIGENT SYSTEMS AND AGENTS 2015 AND THEORY AND PRACTICE IN MODERN COMPUTING 2015, 2015, : 151 - 156
  • [3] A New Hybrid Sampling Approach for Classification of Imbalanced Datasets
    Hanskunatai, Anantaporn
    PROCEEDINGS OF 2018 3RD INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION SYSTEMS (ICCCS), 2018, : 67 - 71
  • [4] Frugal Gaussian clustering of huge imbalanced datasets through a bin-marginal approach
    Antonazzo, Filippo
    Biernacki, Christophe
    Keribin, Christine
    STATISTICS AND COMPUTING, 2023, 33 (03)
  • [5] Frugal Gaussian clustering of huge imbalanced datasets through a bin-marginal approach
    Filippo Antonazzo
    Christophe Biernacki
    Christine Keribin
    Statistics and Computing, 2023, 33 (3)
  • [6] Learning imbalanced datasets based on SMOTE and Gaussian distribution
    Pan, Tingting
    Zhao, Junhong
    Wu, Wei
    Yang, Jie
    INFORMATION SCIENCES, 2020, 512 : 1214 - 1233
  • [7] Selecting The Appropriate Data Sampling Approach for Imbalanced and High-Dimensional Bioinformatics Datasets
    Dittman, David J.
    Khoshgoftaar, Taghi M.
    Napolitano, Amri
    2014 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOENGINEERING (BIBE), 2014, : 304 - 310
  • [8] LoRAS: an oversampling approach for imbalanced datasets
    Saptarshi Bej
    Narek Davtyan
    Markus Wolfien
    Mariam Nassar
    Olaf Wolkenhauer
    Machine Learning, 2021, 110 : 279 - 301
  • [9] A Practical Anonymization Approach for Imbalanced Datasets
    Majeed, Abdul
    Hwang, Seong Oun
    IT PROFESSIONAL, 2022, 24 (01) : 63 - 69
  • [10] A Hybrid Approach Handling Imbalanced Datasets
    Soda, Paolo
    IMAGE ANALYSIS AND PROCESSING - ICIAP 2009, PROCEEDINGS, 2009, 5716 : 209 - 218