Data Augmentation of a Corrosion Dataset for Defect Growth Prediction of Pipelines Using Conditional Tabular Generative Adversarial Networks

被引:0
|
作者
Ma, Haonan [1 ]
Geng, Mengying [1 ]
Wang, Fan [1 ]
Zheng, Wenyue [1 ]
Ai, Yibo [1 ,2 ]
Zhang, Weidong [1 ]
机构
[1] Univ Sci & Technol Beijing, Natl Ctr Mat Serv Safety, Beijing 100083, Peoples R China
[2] Southern Marine Sci & Engn Guangdong Lab Zhuhai, Zhuhai 519082, Peoples R China
关键词
corrosion depth; data augmentation; CTGAN; corroded pipeline; machine learning; DYNAMIC-MODEL; DEPTH;
D O I
10.3390/ma17051142
中图分类号
O64 [物理化学(理论化学)、化学物理学];
学科分类号
070304 ; 081704 ;
摘要
Due to corrosion characteristics, there are data scarcity and uneven distribution in corrosion datasets, and collecting high-quality data is time-consuming and sometimes difficult. Therefore, this work introduces a novel data augmentation strategy using a conditional tabular generative adversarial network (CTGAN) for enhancing corrosion datasets of pipelines. Firstly, the corrosion dataset is subjected to data cleaning and variable correlation analysis. The CTGAN is then used to generate external environmental factors as input variables for corrosion growth prediction, and a hybrid model based on machine learning is employed to generate corrosion depth as an output variable. The fake data are merged with the original data to form the synthetic dataset. Finally, the proposed data augmentation strategy is verified by analyzing the synthetic dataset using different visualization methods and evaluation indicators. The results show that the synthetic and original datasets have similar distributions, and the data augmentation strategy can learn the distribution of real corrosion data and sample fake data that are highly similar to the real data. Predictive models trained on the synthetic dataset perform better than predictive models trained using only the original dataset. In comparative tests, the proposed strategy outperformed other data generation methods.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] Conditional Generative Adversarial Networks for Data Augmentation of a Neonatal Image Dataset
    Lyra, Simon
    Mustafa, Arian
    Rixen, Joeran
    Borik, Stefan
    Lueken, Markus
    Leonhardt, Steffen
    SENSORS, 2023, 23 (02)
  • [2] Conditional Generative Adversarial Networks with Adversarial Attack and Defense for Generative Data Augmentation
    Baek, Francis
    Kim, Daeho
    Park, Somin
    Kim, Hyoungkwan
    Lee, SangHyun
    JOURNAL OF COMPUTING IN CIVIL ENGINEERING, 2022, 36 (03)
  • [3] Data Augmentation using Conditional Generative Adversarial Networks for Robust Speech Recognition
    Sheng, Peiyao
    Yang, Zhuolin
    Hu, Hu
    Tan, Tian
    Qian, Yanmin
    2018 11TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2018, : 121 - 125
  • [4] Seismic Data Augmentation Based on Conditional Generative Adversarial Networks
    Li, Yuanming
    Ku, Bonhwa
    Zhang, Shou
    Ahn, Jae-Kwang
    Ko, Hanseok
    SENSORS, 2020, 20 (23) : 1 - 13
  • [5] Enhanced dataset synthesis using conditional generative adversarial networks
    Mert, Ahmet
    BIOMEDICAL ENGINEERING LETTERS, 2023, 13 (01) : 41 - 48
  • [6] Software Engineering for Dataset Augmentation using Generative Adversarial Networks
    Jahic, Benjamin
    Guelfi, Nicolas
    Ries, Benoit
    PROCEEDINGS OF 2019 IEEE 10TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS 2019), 2019, : 59 - 66
  • [7] Enhanced dataset synthesis using conditional generative adversarial networks
    Ahmet Mert
    Biomedical Engineering Letters, 2023, 13 : 41 - 48
  • [8] Auxiliary Conditional Generative Adversarial Networks for Image Data Set Augmentation
    Mudavathu, Kalpana Devi Bai
    Rao, V. P. Chandra Sekhara
    Ramana, K., V
    PROCEEDINGS OF THE 2018 3RD INTERNATIONAL CONFERENCE ON INVENTIVE COMPUTATION TECHNOLOGIES (ICICT 2018), 2018, : 263 - 269
  • [9] Utilizing conditional generative adversarial networks for data augmentation in logging evaluation
    Qiao, Lu
    He, Taohua
    Liu, Xianglong
    He, Jiayi
    Zeng, Qianghao
    Zhao, Ya
    Yang, Shengyu
    Hu, Qinhorng
    PHYSICS OF FLUIDS, 2025, 37 (03)
  • [10] Conditional Generative Adversarial Networks for Data Augmentation in Breast Cancer Classification
    Wong, Weng San
    Amer, Mohammed
    Maul, Tomas
    Liao, Iman Yi
    Ahmed, Amr
    RECENT ADVANCES ON SOFT COMPUTING AND DATA MINING (SCDM 2020), 2020, 978 : 392 - 402