Classification of high-speed gas chromatography-mass spectrometry data by principal component analysis coupled with piecewise alignment and feature selection

被引:33
|
作者
Watson, Nathanial E.
VanWingerden, Matthew M.
Pierce, Karisa M.
Wright, Bob W.
Synovec, Robert E.
机构
[1] Univ Washington, Dept Chem, Seattle, WA 98195 USA
[2] Pacific NW Natl Lab, Richland, WA 99352 USA
[3] US Mil Acad, Dept Chem & Life Sci, West Point, NY 10996 USA
关键词
alignment; gas chromatography; feature selection; principal component analysis; fuel; chemometrics; GC-MS;
D O I
10.1016/j.chroma.2006.06.087
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
A useful methodology is introduced for the analysis of data obtained via gas chromatography with mass spectrometry (GC-MS) utilizing a complete mass spectrum at each retention time interval in which a mass spectrum was collected. Principal component analysis (PCA) with preprocessing by both piecewise retention time alignment and analysis of variance (ANOVA) feature selection is applied to all mass channels collected. The methodology involves concatenating all concurrently measured individual m/z chromatograms from m/z 20 to 120 for each GC-MS separation into a row vector. All of the sample row vectors are incorporated into a matrix where each row is a sample vector. This matrix is piecewise aligned and reduced by ANOVA feature selection. Application of the preprocessing steps (retention time alignment and feature selection) to all mass channels collected during the chromatographic separation allows considerably more selective chemical information to be incorporated in the PCA classification, and is the primary novelty of the report. This methodology is objective and requires no knowledge of the specific analytes of interest, as in selective ion monitoring (SIM), and does not restrict the mass spectral data used, as in both SIM and total ion current (TIC) methods. Significantly, the methodology allows for the classification of data with low resolution in the chromatographic dimension because of the added selectivity from the complete mass spectral dimension. This allows for the successful classification of data over significantly decreased chromatographic separation times, since high-speed separations can be employed. The methodology is demonstrated through the analysis of a set of four differing gasoline samples that serve as model complex samples. For comparison, the gasoline samples are analyzed by GC-MS over both 10-min and 10-s separation times. The successfully classified 10-min GC-MS TIC data served as the benchmark analysis to compare to the 10-s data. When only alignment and feature selection was applied to the 10-s gasoline separations using GC-MS TIC data, PCA failed. PCA was successful for 10-s gasoline separations when the methodology was applied with all the m/z information. With ANOVA feature selection, chromatographic regions with Fisher ratios greater than 1500 were retained in a new matrix and subjected to PCA yielding successful classification for the 10-s separations. (c) 2006 Elsevier B.V. All rights reserved.
引用
收藏
页码:111 / 118
页数:8
相关论文
共 50 条
  • [1] Classification of gasoline data obtained by gas chromatography using a piecewise alignment algorithm combined with feature selection and principal component analysis
    Pierce, KM
    Hope, JL
    Johnson, KJ
    Wright, BW
    Synovec, RE
    JOURNAL OF CHROMATOGRAPHY A, 2005, 1096 (1-2) : 101 - 110
  • [2] High-speed, low-pressure gas chromatography-mass spectrometry for essential oil analysis
    Poynter, Samuel D. H.
    Shellie, Robert A.
    JOURNAL OF CHROMATOGRAPHY A, 2008, 1200 (01) : 28 - 33
  • [3] Alignment of gas chromatography-mass spectrometry data by landmark selection from complex chemical mixtures
    Krebs, MD
    Tingley, RD
    Zeskind, JE
    Holmboe, ME
    Kang, JM
    Davis, CE
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2006, 81 (01) : 74 - 81
  • [4] Principal component analysis for resolving coeluting substances in gas chromatography-mass spectrometry doping control analysis
    Statheropoulos, M
    Smaragdis, E
    Tzamtzis, N
    Georgakopoulos, C
    ANALYTICA CHIMICA ACTA, 1996, 331 (1-2) : 53 - 61
  • [5] Peak alignment of gas chromatography-mass spectrometry data with deep learning
    Li, Mike
    Wang, X. Rosalind
    JOURNAL OF CHROMATOGRAPHY A, 2019, 1604
  • [6] Classification of natural resins by liquid chromatography-mass spectrometry and gas chromatography-mass spectrometry using chemometric analysis
    Rhourrhi-Frih, B.
    West, C.
    Pasquier, L.
    Andre, P.
    Chaimbault, P.
    Lafosse, M.
    JOURNAL OF CHROMATOGRAPHY A, 2012, 1256 : 177 - 190
  • [7] Feature Selection and Analysis of Jet Fuel by Comprehensive Two-Dimensional Gas Chromatography-Mass Spectrometry
    Zhang J.-S.
    Zhang G.-N.
    Wang W.
    Zhang Y.
    Zhu J.
    Jiang H.
    Liu Z.-F.
    Journal of Chinese Mass Spectrometry Society, 2021, 42 (03): : 294 - 304
  • [8] Assessment of oil weathering by as chromatography-mass spectrometry, time warping and principal component analysis
    Malmquist, Linus M. V.
    Olsen, Rasmus R.
    Hansen, Asger B.
    Andersen, Ole
    Christensen, Jan H.
    JOURNAL OF CHROMATOGRAPHY A, 2007, 1164 (1-2) : 262 - 270
  • [9] Detection of batch effects in liquid chromatography-mass spectrometry metabolomic data using guided principal component analysis
    Kuligowski, J.
    Perez-Guaita, D.
    Lliso, I.
    Escobar, J.
    Leon, Z.
    Gombau, L.
    Solberg, R.
    Saugstad, O. D.
    Vento, M.
    Quintas, G.
    TALANTA, 2014, 130 : 442 - 448
  • [10] HIGH-SPEED ANALYSIS OF UNDERIVATIZED DRUGS BY GAS-CHROMATOGRAPHY MASS-SPECTROMETRY
    KOKANOVICH, JD
    SIMONICK, TF
    WATTS, VW
    HRC-JOURNAL OF HIGH RESOLUTION CHROMATOGRAPHY, 1989, 12 (01): : 45 - 48