Deep Learning-Based Method for Compound Identification in NMR Spectra of Mixtures

被引:21
|
作者
Wei, Weiwei [1 ]
Liao, Yuxuan [2 ]
Wang, Yufei [2 ]
Wang, Shaoqi [2 ]
Du, Wen [1 ]
Lu, Hongmei [2 ]
Kong, Bo [1 ]
Yang, Huawu [3 ]
Zhang, Zhimin [2 ]
机构
[1] China Tobacco Hunan Ind Co Ltd, Technol Ctr, Changsha 410014, Peoples R China
[2] Cent South Univ, Coll Chem & Chem Engn, Changsha 410083, Peoples R China
[3] China Tobacco Hunan Ind Co Ltd, Flavors & Fragrances Res Inst, Technol Ctr, Changsha 410014, Peoples R China
来源
MOLECULES | 2022年 / 27卷 / 12期
关键词
deep learning; identification; NMR; mixture analysis; NUCLEAR-MAGNETIC-RESONANCE; METABOLITE IDENTIFICATION; COMPLEX-MIXTURES; METABOLOMICS; SPECTROSCOPY; RESOLUTION; DECONVOLUTION; PREDICTION; ALIGNMENT; ROBUST;
D O I
10.3390/molecules27123653
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Nuclear magnetic resonance (NMR) spectroscopy is highly unbiased and reproducible, which provides us a powerful tool to analyze mixtures consisting of small molecules. However, the compound identification in NMR spectra of mixtures is highly challenging because of chemical shift variations of the same compound in different mixtures and peak overlapping among molecules. Here, we present a pseudo-Siamese convolutional neural network method (pSCNN) to identify compounds in mixtures for NMR spectroscopy. A data augmentation method was implemented for the superposition of several NMR spectra sampled from a spectral database with random noises. The augmented dataset was split and used to train, validate and test the pSCNN model. Two experimental NMR datasets (flavor mixtures and additional flavor mixture) were acquired to benchmark its performance in real applications. The results show that the proposed method can achieve good performances in the augmented test set (ACC = 99.80%, TPR = 99.70% and FPR = 0.10%), the flavor mixtures dataset (ACC = 97.62%, TPR = 96.44% and FPR = 2.29%) and the additional flavor mixture dataset (ACC = 91.67%, TPR = 100.00% and FPR = 10.53%). We have demonstrated that the translational invariance of convolutional neural networks can solve the chemical shift variation problem in NMR spectra. In summary, pSCNN is an off-the-shelf method to identify compounds in mixtures for NMR spectroscopy because of its accuracy in compound identification and robustness to chemical shift variation.
引用
收藏
页数:16
相关论文
共 50 条
  • [21] Deep Learning-based Object Classification on Automotive Radar Spectra
    Patel, Kanil
    Ramhach, Kilian
    Visentin, Tristan
    Rusev, Daniel
    Pfeiffer, Michael
    Yang, Bin
    2019 IEEE RADAR CONFERENCE (RADARCONF), 2019,
  • [22] A Deep Learning-Based Method Facilitates scRNA-seq Cell Type Identification
    Wang, Xin
    Li, Zhuo
    Hang, Jie
    Xu, Ren
    Meng, Lin
    NEURAL COMPUTING FOR ADVANCED APPLICATIONS, NCAA 2024, PT I, 2025, 2181 : 171 - 185
  • [23] A deep learning-based method for assessing tricuspid regurgitation using continuous wave Doppler spectra
    Xie, Shenghua
    Liu, Han
    Su, Li
    Shen, Jie
    Miao, Junwang
    Huang, Duo
    Zhou, Mi
    Liu, Huiruo
    Li, Yan
    Yin, Lixue
    Shu, Qinglan
    Wang, Yi
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [24] Deep Learning-Based Facial Landmarks Localization Using Compound Scaling
    Colaco, Savina Jassica
    Han, Dong Seog
    IEEE ACCESS, 2022, 10 : 7653 - 7663
  • [25] Identification of Specific Substances in the FAIMS Spectra of Complex Mixtures Using Deep Learning
    Li, Hua
    Pan, Jiakai
    Zeng, Hongda
    Chen, Zhencheng
    Du, Xiaoxia
    Xiao, Wenxiang
    SENSORS, 2021, 21 (18)
  • [26] A deep learning-based constrained intelligent routing method
    Rao, Zheheng
    Xu, Yanyan
    Pan, Shaoming
    PEER-TO-PEER NETWORKING AND APPLICATIONS, 2021, 14 (04) : 2224 - 2235
  • [27] Deep Learning-Based Method for Classification of Sugarcane Varieties
    Kai, Priscila Marques
    de Oliveira, Bruna Mendes
    da Costa, Ronaldo Martins
    AGRONOMY-BASEL, 2022, 12 (11):
  • [28] A deep learning-based method for the design of microstructural materials
    Tan, Ren Kai
    Zhang, Nevin L.
    Ye, Wenjing
    STRUCTURAL AND MULTIDISCIPLINARY OPTIMIZATION, 2020, 61 (04) : 1417 - 1438
  • [29] A deep learning-based method for silkworm egg counting
    Shi, Hongkang
    Chen, Xiao
    Zhu, Minghui
    Li, Linbo
    Wu, Jianmei
    Zhang, Jianfei
    JOURNAL OF ASIA-PACIFIC ENTOMOLOGY, 2025, 28 (01)
  • [30] A Deep Learning-Based Innovative Points Extraction Method
    Yu, Tao
    Wang, Rui
    Zhan, Hongfei
    Lin, Yingjun
    Yu, Junhe
    ADVANCES IN NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, ICNC-FSKD 2022, 2023, 153 : 130 - 138