Deep Learning-Based Method for Compound Identification in NMR Spectra of Mixtures

被引:21
|
作者
Wei, Weiwei [1 ]
Liao, Yuxuan [2 ]
Wang, Yufei [2 ]
Wang, Shaoqi [2 ]
Du, Wen [1 ]
Lu, Hongmei [2 ]
Kong, Bo [1 ]
Yang, Huawu [3 ]
Zhang, Zhimin [2 ]
机构
[1] China Tobacco Hunan Ind Co Ltd, Technol Ctr, Changsha 410014, Peoples R China
[2] Cent South Univ, Coll Chem & Chem Engn, Changsha 410083, Peoples R China
[3] China Tobacco Hunan Ind Co Ltd, Flavors & Fragrances Res Inst, Technol Ctr, Changsha 410014, Peoples R China
来源
MOLECULES | 2022年 / 27卷 / 12期
关键词
deep learning; identification; NMR; mixture analysis; NUCLEAR-MAGNETIC-RESONANCE; METABOLITE IDENTIFICATION; COMPLEX-MIXTURES; METABOLOMICS; SPECTROSCOPY; RESOLUTION; DECONVOLUTION; PREDICTION; ALIGNMENT; ROBUST;
D O I
10.3390/molecules27123653
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Nuclear magnetic resonance (NMR) spectroscopy is highly unbiased and reproducible, which provides us a powerful tool to analyze mixtures consisting of small molecules. However, the compound identification in NMR spectra of mixtures is highly challenging because of chemical shift variations of the same compound in different mixtures and peak overlapping among molecules. Here, we present a pseudo-Siamese convolutional neural network method (pSCNN) to identify compounds in mixtures for NMR spectroscopy. A data augmentation method was implemented for the superposition of several NMR spectra sampled from a spectral database with random noises. The augmented dataset was split and used to train, validate and test the pSCNN model. Two experimental NMR datasets (flavor mixtures and additional flavor mixture) were acquired to benchmark its performance in real applications. The results show that the proposed method can achieve good performances in the augmented test set (ACC = 99.80%, TPR = 99.70% and FPR = 0.10%), the flavor mixtures dataset (ACC = 97.62%, TPR = 96.44% and FPR = 2.29%) and the additional flavor mixture dataset (ACC = 91.67%, TPR = 100.00% and FPR = 10.53%). We have demonstrated that the translational invariance of convolutional neural networks can solve the chemical shift variation problem in NMR spectra. In summary, pSCNN is an off-the-shelf method to identify compounds in mixtures for NMR spectroscopy because of its accuracy in compound identification and robustness to chemical shift variation.
引用
收藏
页数:16
相关论文
共 50 条
  • [31] A deep learning-based constrained intelligent routing method
    Zheheng Rao
    Yanyan Xu
    Shaoming Pan
    Peer-to-Peer Networking and Applications, 2021, 14 : 2224 - 2235
  • [32] Deep learning-based inverse method for layout design
    Zhang, Yujie
    Ye, Wenjing
    STRUCTURAL AND MULTIDISCIPLINARY OPTIMIZATION, 2019, 60 (02) : 527 - 536
  • [33] Deep learning-based indirect bridge damage identification system
    Hajializadeh, Donya
    STRUCTURAL HEALTH MONITORING-AN INTERNATIONAL JOURNAL, 2023, 22 (02): : 897 - 912
  • [34] Deep Learning-Based Esophageal Tumor Identification on CT Slice
    Lu, S.
    Wang, J.
    Hu, W.
    Zhu, Z.
    Chen, Y.
    Yang, X.
    INTERNATIONAL JOURNAL OF RADIATION ONCOLOGY BIOLOGY PHYSICS, 2023, 117 (02): : E691 - E692
  • [35] Deep learning-based identification of eyes at risk for glaucoma surgery
    Wang, Ruolin
    Bradley, Chris
    Herbert, Patrick
    Hou, Kaihua
    Ramulu, Pradeep
    Breininger, Katharina
    Unberath, Mathias
    Yohannan, Jithin
    SCIENTIFIC REPORTS, 2024, 14 (01)
  • [36] Deep Transfer Learning-Based Automated Identification of Bird Song
    Das, Nabanita
    Padhy, Neelamadhab
    Dey, Nilanjan
    Bhattacharya, Sudipta
    Tavares, Joao Manuel R. S.
    INTERNATIONAL JOURNAL OF INTERACTIVE MULTIMEDIA AND ARTIFICIAL INTELLIGENCE, 2023, 8 (04): : 33 - 45
  • [37] Deep learning-based lexical character identification in TV series
    Dalla Torre, Paola
    Fantozzi, Paolo
    Naldi, Maurizio
    DIGITAL SCHOLARSHIP IN THE HUMANITIES, 2023, 38 (04) : 1453 - 1465
  • [38] Deep Learning-based Osteocyte Identification And Quantification In Histological Images
    Lee, S.
    Childs, H. R.
    Tzoc-Pacheco, E.
    Anang, Y. Naa Ardua
    Doty, S. B.
    Lu, H. H.
    TISSUE ENGINEERING PART A, 2023, 29 (9-10)
  • [39] A Deep Learning-Based Unbalanced Force Identification of the Hypergravity Centrifuge
    Lin, Kuigeng
    Li, Yuke
    Wu, Yunhao
    Fu, Haoran
    Jiang, Jianqun
    Chen, Yunmin
    SENSORS, 2023, 23 (08)
  • [40] Deep learning-based approach for identification of diseases of maize crop
    Haque, Md Ashraful
    Marwaha, Sudeep
    Deb, Chandan Kumar
    Nigam, Sapna
    Arora, Alka
    Hooda, Karambir Singh
    Soujanya, P. Lakshmi
    Aggarwal, Sumit Kumar
    Lall, Brejesh
    Kumar, Mukesh
    Islam, Shahnawazul
    Panwar, Mohit
    Kumar, Prabhat
    Agrawal, R. C.
    SCIENTIFIC REPORTS, 2022, 12 (01)