Prediction of Vacuum Ultraviolet/Ultraviolet Gas-Phase Absorption Spectra Using Molecular Feature Representations and Machine Learning

被引:1
|
作者
Ho Manh, Linh [1 ]
Chen, Victoria C. P. [1 ]
Rosenberger, Jay [1 ]
Wang, Shouyi [1 ]
Yang, Yujing [1 ]
Schug, Kevin A. [2 ]
机构
[1] Univ Texas Arlington, Dept Ind Mfg & Syst Engn, Arlington, TX 76019 USA
[2] Univ Texas Arlington, Dept Chem & Biochem, Arlington, TX 76019 USA
基金
美国国家科学基金会;
关键词
ULTRAVIOLET SPECTROSCOPY; DECISION TREES; CHROMATOGRAPHY; CHEMISTRY;
D O I
10.1021/acs.jcim.4c00676
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Ultraviolet (UV) absorption spectroscopy is a widely used tool for quantitative and qualitative analyses of chemical compounds. In the gas phase, vacuum UV (VUV) and UV absorption spectra are specific and diagnostic for many small molecules. An accurate prediction of VUV/UV absorption spectra can aid the characterization of new or unknown molecules in areas such as fuels, forensics, and pharmaceutical research. An alternative to quantum chemical spectral prediction is the use of artificial intelligence. Here, different molecular feature representation techniques were used and developed to encode chemical structures for testing three machine learning models to predict gas-phase VUV/UV absorption spectra. Structure data files (.sdf) and VUV/UV absorption spectra for 1397 volatile and semivolatile chemical compounds were used to train and test the models. New molecular features (termed ABOCH) were introduced to better capture pi-bonding, aromaticity, and halogenation. The incorporation of these new features benefited spectral prediction and demonstrated superior performance compared to computationally intensive molecular-based deep learning methods. Of the machine learning methods, the use of a Random Forest regressor returned the best accuracy score with the shortest training time. The developed machine learning prediction model also outperformed spectral predictions based on the time-dependent density functional theory.
引用
收藏
页码:5547 / 5556
页数:10
相关论文
共 50 条
  • [1] GAS-PHASE ULTRAVIOLET PHOTOELECTRON-SPECTRA OF HYDROXYANTHRAQUINONES
    MILLEFIORI, S
    MILLEFIORI, A
    SPECTROCHIMICA ACTA PART A-MOLECULAR AND BIOMOLECULAR SPECTROSCOPY, 1988, 44 (01): : 17 - 21
  • [2] ULTRAVIOLET-ABSORPTION SPECTRUM OF HCNO IN GAS-PHASE
    SHEASLEY, WD
    MATHEWS, CW
    JOURNAL OF MOLECULAR SPECTROSCOPY, 1972, 43 (03) : 467 - &
  • [3] Efficient gas-phase generation of coherent vacuum ultraviolet radiation
    Merriam, AJ
    Sharpe, SJ
    Xia, H
    Manuszak, D
    Yin, GY
    Harris, SE
    OPTICS LETTERS, 1999, 24 (09) : 625 - 627
  • [4] THE DETERMINATION OF NITRITE BY ULTRAVIOLET-ABSORPTION SPECTROMETRY IN THE GAS-PHASE
    SYTY, A
    SIMMONS, RA
    ANALYTICA CHIMICA ACTA, 1980, 120 (NOV) : 163 - 170
  • [5] ULTRAVIOLET PHOTO-ELECTRON SPECTRA OF GAS-PHASE LANTHANIDE CHLORIDES
    LEE, EPF
    POTTS, AW
    JOURNAL OF MOLECULAR STRUCTURE, 1982, 79 (1-4) : 305 - 308
  • [6] GAS-PHASE ULTRAVIOLET PHOTOELECTRON-SPECTRA OF AROMATIC AZOMETHINE COMPOUNDS
    FRAGALA, I
    MILLEFIORI, S
    RECCA, A
    JOURNAL OF CHEMICAL RESEARCH-S, 1980, (01): : 28 - 29
  • [7] OBSERVATION OF THE ULTRAVIOLET-ABSORPTION SPECTRUM OF PHENYL RADICAL IN THE GAS-PHASE
    IKEDA, N
    NAKASHIMA, N
    YOSHIHARA, K
    JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 1985, 107 (11) : 3381 - 3382
  • [8] Quantitative Characterization of the Molecular Absorption Spectra in the Ultraviolet Region by Gas Chromatography
    Lagesson-Andrasko, Ludmila
    Lagesson, Verner
    Andrasko, Jan
    ANALYTICAL LETTERS, 2018, 51 (13) : 2073 - 2084
  • [9] Photofragmentation of gas-phase acetic acid and acetamide clusters in the vacuum ultraviolet region
    Berholts, Marta
    Myllynen, Hanna
    Kooser, Kuno
    Itala, Eero
    Granroth, Sari
    Levola, Helena
    Laksman, Joakim
    Oghbaiee, Shabnam
    Oostenrijk, Bart
    Nommiste, Ergo
    Kukk, Edwin
    JOURNAL OF CHEMICAL PHYSICS, 2017, 147 (19):
  • [10] FAR-ULTRAVIOLET ABSORPTION SPECTRA OF PHOSPHORUS COMPOUNDS IN GAS PHASE
    HALMANN, M
    JOURNAL OF THE CHEMICAL SOCIETY, 1963, (MAY): : 2853 - &