Comparison of Gaussian process regression, partial least squares, random forest and support vector machines for a near infrared calibration of paracetamol samples

被引:4
|
作者
Sow, Aminata [1 ]
Traore, Issiaka [1 ]
Diallo, Tidiane [2 ,3 ]
Traore, Mohamed [4 ]
Ba, Abdramane [1 ]
机构
[1] Univ Sci Tech & Technol Bamako, Fac Sci & Tech FST, Lab Opt Spect & Sci Atmospher LOSSA, Bamako, Mali
[2] Univ Sci Tech & Technol Bamako, Fac Pharm, Dept Sci Medicament, Bamako, Mali
[3] Lab Natl Sante LNS, Bamako, Mali
[4] Ecole Natl Ingn Abderhamane Baba Toure, Bamako, Mali
关键词
Paracetamol; Near Infrared Spectroscopy; Data preprocessing; Nonlinear regression models; Linear regression techniques; COMPONENTS; TABLETS;
D O I
10.1016/j.rechem.2022.100508
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
In this article, we analyze the near-infrared (NIR) spectra of fifty-eight (58) commercial tablets of 500 mg of paracetamol from different origins (that is, with different batch numbers) in the local markets in Bamako. The NIR spectra were recorded in the spectral range 930 nm-1700 nm. The samples are divided into forty-eight (48) samples forming the set of calibration (training set) and ten (10) samples used as the validation or test set. To perform multivariate calibration, we apply-three nonlinear regression techniques (Gaussian processes regression (GPR), Random Forest (RF), Support vector machine (KSVM)), along with the traditional linear partial leastsquares regression (PLSR) to several data pretreatments of the 58 samples. The results show that the three nonlinear regression calibrations have better prediction performance than PLS as far as RMSE is concerned. To decide the best regression model, we avoid R2 since this quantity is not a good parameter for this purpose. We will instead consider RMSE when comparing the different multivariate models. Additionally, to assess the impact of data preprocessing, we apply the above regression techniques to the original data, Multi-scattering correction (MSC), standard variate normalization (SNV) correction, smoothing correction, first derivative (FD), and second derivative correction (SD). The overall results reveal that Gaussian Processes Regression (GPR) applied to smooth correction gives the lowest RMSEP = 2.303053e-06 for validation (prediction) and RMSEC = 2.112316e-06 for calibration. In our investigation, one also notices that the developed GPR model is more accurate and exhibits enhanced behavior no matter which data preprocessing is used. All in all, GPR can be seen as an alternative powerful regression tool for NIR spectra of paracetamol samples. The statistical parameters of the proposed model are compared to the results of some other models reported in the literature.
引用
收藏
页数:7
相关论文
共 50 条
  • [21] Identification of chaotic process systems with least squares support vector machines
    Jemwa, GT
    Aldrich, C
    PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS 2003, VOLS 1-4, 2003, : 2066 - 2071
  • [22] Mutual conversion of regression and classification based on least squares support vector machines
    Jiang, Jing-Qing
    Song, Chu-Yi
    Wu, Chun-Guo
    Liang, Yang-Chun
    Yang, Xiao-Wei
    Hao, Zhi-Feng
    ADVANCES IN NEURAL NETWORKS - ISNN 2006, PT 1, 2006, 3971 : 1010 - 1015
  • [23] Multi-output least-squares support vector regression machines
    Xu, Shuo
    An, Xin
    Qiao, Xiaodong
    Zhu, Lijun
    Li, Lin
    PATTERN RECOGNITION LETTERS, 2013, 34 (09) : 1078 - 1084
  • [24] An Improved Active Learning Sparse Least Squares Support Vector Machines for Regression
    Si Gangquan
    Shi Jianquan
    Guo Zhang
    Gao Hong
    2015 27TH CHINESE CONTROL AND DECISION CONFERENCE (CCDC), 2015, : 4558 - 4562
  • [26] Twin Least Squares Support Vector Regression of Heteroscedastic Gaussian Noise Model
    Zhang, Shiguang
    Liu, Chao
    Zhou, Ting
    Sun, Lin
    IEEE ACCESS, 2020, 8 : 94076 - 94088
  • [27] Forecasting electricity consumption: A comparison of regression analysis, neural networks and least squares support vector machines
    Kaytez, Fazil
    Taplamacioglu, M. Cengiz
    Cam, Ertugrul
    Hardalac, Firat
    INTERNATIONAL JOURNAL OF ELECTRICAL POWER & ENERGY SYSTEMS, 2015, 67 : 431 - 438
  • [28] Prediction of Ion Drift Times for a Proteome-Wide Peptide Set Using Partial Least Squares Regression, Least-Squares Support Vector Machine and Gaussian Process
    Liu, Xiuhong
    Liang, Jun
    Fan, Jicai
    Shang, Zhicai
    QSAR & COMBINATORIAL SCIENCE, 2009, 28 (11-12): : 1386 - 1393
  • [29] A comparison of pruning algorithms for sparse least squares support vector machines
    Hoegaerts, L
    Suykens, JAK
    Vandewalle, J
    De Moor, B
    NEURAL INFORMATION PROCESSING, 2004, 3316 : 1247 - 1253
  • [30] Nonlinear Calibration of Thermocouple Sensor Using Least Squares Support Vector Regression
    Yu, Yaojun
    2010 INTERNATIONAL COLLOQUIUM ON COMPUTING, COMMUNICATION, CONTROL, AND MANAGEMENT (CCCM2010), VOL III, 2010, : 410 - 413