Comparison of Gaussian process regression, partial least squares, random forest and support vector machines for a near infrared calibration of paracetamol samples

被引:4
|
作者
Sow, Aminata [1 ]
Traore, Issiaka [1 ]
Diallo, Tidiane [2 ,3 ]
Traore, Mohamed [4 ]
Ba, Abdramane [1 ]
机构
[1] Univ Sci Tech & Technol Bamako, Fac Sci & Tech FST, Lab Opt Spect & Sci Atmospher LOSSA, Bamako, Mali
[2] Univ Sci Tech & Technol Bamako, Fac Pharm, Dept Sci Medicament, Bamako, Mali
[3] Lab Natl Sante LNS, Bamako, Mali
[4] Ecole Natl Ingn Abderhamane Baba Toure, Bamako, Mali
关键词
Paracetamol; Near Infrared Spectroscopy; Data preprocessing; Nonlinear regression models; Linear regression techniques; COMPONENTS; TABLETS;
D O I
10.1016/j.rechem.2022.100508
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
In this article, we analyze the near-infrared (NIR) spectra of fifty-eight (58) commercial tablets of 500 mg of paracetamol from different origins (that is, with different batch numbers) in the local markets in Bamako. The NIR spectra were recorded in the spectral range 930 nm-1700 nm. The samples are divided into forty-eight (48) samples forming the set of calibration (training set) and ten (10) samples used as the validation or test set. To perform multivariate calibration, we apply-three nonlinear regression techniques (Gaussian processes regression (GPR), Random Forest (RF), Support vector machine (KSVM)), along with the traditional linear partial leastsquares regression (PLSR) to several data pretreatments of the 58 samples. The results show that the three nonlinear regression calibrations have better prediction performance than PLS as far as RMSE is concerned. To decide the best regression model, we avoid R2 since this quantity is not a good parameter for this purpose. We will instead consider RMSE when comparing the different multivariate models. Additionally, to assess the impact of data preprocessing, we apply the above regression techniques to the original data, Multi-scattering correction (MSC), standard variate normalization (SNV) correction, smoothing correction, first derivative (FD), and second derivative correction (SD). The overall results reveal that Gaussian Processes Regression (GPR) applied to smooth correction gives the lowest RMSEP = 2.303053e-06 for validation (prediction) and RMSEC = 2.112316e-06 for calibration. In our investigation, one also notices that the developed GPR model is more accurate and exhibits enhanced behavior no matter which data preprocessing is used. All in all, GPR can be seen as an alternative powerful regression tool for NIR spectra of paracetamol samples. The statistical parameters of the proposed model are compared to the results of some other models reported in the literature.
引用
收藏
页数:7
相关论文
共 50 条
  • [31] Nonlinear Calibration of Thermocouple Sensor Using Least Squares Support Vector Regression
    Yu, Yaojun
    MANUFACTURING SCIENCE AND MATERIALS ENGINEERING, PTS 1 AND 2, 2012, 443-444 : 302 - 308
  • [32] Nonlinear Calibration of Thermocouple Sensor Based on Least Squares Support Vector Regression
    Zhang, Shengbo
    Dai, Qingling
    PROCEEDINGS OF THE 2015 5TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCES AND AUTOMATION ENGINEERING, 2016, 42 : 5 - 10
  • [33] Least squares support vector regression for prediction of peak samples in time series
    Yuan, Cong-Gui
    Zhang, Xin-Zheng
    Kongzhi yu Juece/Control and Decision, 2012, 27 (11): : 1745 - 1750
  • [34] Application of Least Squares Support Vector Machines for Discrimination of Red Wine Using Visible and Near Infrared Spectroscopy
    Liu, Fei
    Wang, Li
    He, Yong
    2008 3RD INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEM AND KNOWLEDGE ENGINEERING, VOLS 1 AND 2, 2008, : 1002 - 1006
  • [35] Least-squares support vector machines and near infrared spectroscopy for quantification of common adulterants in powdered milk
    Borin, Alessandra
    Ferrao, Marco Flores
    Mello, Cesar
    Maretto, Danilo Althmann
    Poppi, Ronei Jesus
    ANALYTICA CHIMICA ACTA, 2006, 579 (01) : 25 - 32
  • [36] Forest coverage prediction based on least squares support vector regression algorithm
    Xiao, Fang
    TRENDS IN CIVIL ENGINEERING, PTS 1-4, 2012, 446-449 : 2978 - 2982
  • [37] Dynamic Nonlinear Partial Least Squares Modeling Using Gaussian Process Regression
    Liu, Hongbin
    Yang, Chong
    Carlsson, Bengt
    Qin, S. Joe
    Yoo, ChangKyoo
    INDUSTRIAL & ENGINEERING CHEMISTRY RESEARCH, 2019, 58 (36) : 16676 - 16686
  • [38] Optimizing the tuning parameters of least squares support vector machines regression for NIR spectra
    Coen, T.
    Saeys, W.
    Ramon, H.
    De Baerdemaeker, J.
    JOURNAL OF CHEMOMETRICS, 2006, 20 (05) : 184 - 192
  • [39] Partial least squares regression calibration for determining wax content in processed flax fiber by near-infrared spectroscopy
    Sohn, M
    Himmelsbach, DS
    Morrison, WH
    Akin, DE
    Barton, FE
    APPLIED SPECTROSCOPY, 2006, 60 (04) : 437 - 440
  • [40] Rapid analysis of the Tanreqing injection by near-infrared spectroscopy combined with least squares support vector machine and Gaussian process modeling techniques
    Li, Wenlong
    Yan, Xu
    Pan, Jianchao
    Liu, Shaoyong
    Xue, Dongsheng
    Qu, Haibin
    SPECTROCHIMICA ACTA PART A-MOLECULAR AND BIOMOLECULAR SPECTROSCOPY, 2019, 218 : 271 - 280