A machine learning approach for corrosion small datasets

被引:52
|
作者
Sutojo, Totok [1 ,2 ]
Rustad, Supriadi [1 ,2 ]
Akrom, Muhamad [1 ,3 ]
Syukur, Abdul [2 ]
Shidik, Guruh Fajar [2 ]
Dipojono, Hermawan Kresno [3 ]
机构
[1] Dian Nuswantoro Univ, Fac Comp Sci, Res Ctr Mat Informat, Semarang 50131, Indonesia
[2] Dian Nuswantoro Univ, Fac Comp Sci, Doctoral Program Comp Sci, Semarang 50131, Indonesia
[3] Bandung Inst Technol, Adv Funct Mat Res Grp, Bandung 40132, Indonesia
关键词
INHIBITION EFFICIENCY; BENZIMIDAZOLE DERIVATIVES; GAS-INDUSTRY; MILD-STEEL; PREDICTION; DESIGN; MODEL; OIL;
D O I
10.1038/s41529-023-00336-7
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
In this work, we developed a QSAR model using the K-Nearest Neighbor (KNN) algorithm to predict the corrosion inhibition performance of the inhibitor compound. To overcome the small dataset problems, virtual samples are generated and added to the training set using a Virtual Sample Generation (VSG) method. The generalizability of the proposed KNN + VSG model is verified by using six small datasets from references and comparing their prediction performances. The research shows that for the six datasets, the proposed model is able to make predictions with the best accuracy. Adding virtual samples to the training data helps the algorithm recognize feature-target relationship patterns, and therefore increases the number of chemical quantum parameters correlated with corrosion inhibition efficiency. This proposed method strengthens the prospect of ML for developing material designs, especially in the case of small datasets.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] Particulate matter estimation using satellite datasets: a machine learning approach
    Verma, Sunita
    Sharma, Ajay
    Payra, Swagata
    Chaudhary, Neelam
    Mishra, Manoj
    Environmental Science and Pollution Research, 31 (58): : 66372 - 66387
  • [22] A machine learning model with human cognitive biases capable of learning from small and biased datasets
    Hidetaka Taniguchi
    Hiroshi Sato
    Tomohiro Shirakawa
    Scientific Reports, 8
  • [23] A machine learning model with human cognitive biases capable of learning from small and biased datasets
    Taniguchi, Hidetaka
    Sato, Hiroshi
    Shirakawa, Tomohiro
    SCIENTIFIC REPORTS, 2018, 8
  • [24] Making a big impact with small datasets using machine-learning approaches
    Choi, May Y.
    Ma, Christopher
    LANCET RHEUMATOLOGY, 2020, 2 (08): : E451 - E452
  • [25] Time-series failure prediction on small datasets using machine learning
    Maior, Caio B. S.
    Silva, Thaylon G.
    IEEE LATIN AMERICA TRANSACTIONS, 2024, 22 (05) : 362 - 371
  • [26] Effects of information and machine learning algorithms on word sense disambiguation with small datasets
    Leroy, G
    Rindflesch, TC
    INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2005, 74 (7-8) : 573 - 585
  • [27] A comprehensive machine learning framework for the prediction of therapeutic peptides from small datasets
    Otovic, Erik
    Kalafatovic, Daniela
    Mausa, Goran
    JOURNAL OF PEPTIDE SCIENCE, 2024, 30
  • [28] ILA4: Overcoming missing values in machine learning datasets - An inductive learning approach
    Elhassan, Ammar
    Abu-Soud, Saleh M.
    Alghanim, Firas
    Salameh, Walid
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2022, 34 (07) : 4284 - 4295
  • [29] Internet Traffic Classification using Machine Learning Approach: Datasets Validation Issues
    Ibrahim, Hamza Awad Hamza
    AL Zuobi, Omer Radhi Aqeel
    Al-Namari, Marwan A.
    MohamedAli, Gaafer
    Abdalla, Ali Ahmed Alfaki
    2016 CONFERENCE OF BASIC SCIENCES AND ENGINEERING STUDIES (SCGAC), 2016, : 158 - 166
  • [30] A voting-based machine learning approach for classifying biological and clinical datasets
    Negar Hossein-Nezhad Daneshvar
    Yosef Masoudi-Sobhanzadeh
    Yadollah Omidi
    BMC Bioinformatics, 24