TransPeakNet for solvent-aware 2D NMR prediction via multi-task pre-training and unsupervised learning

被引:0
|
作者
Li, Yunrui [1 ]
Xu, Hao [2 ]
Kumar, Ambrish [3 ]
Wang, Duo-Sheng [4 ]
Heiss, Christian [3 ]
Azadi, Parastoo [3 ]
Hong, Pengyu [1 ]
机构
[1] Brandeis Univ, Dept Comp Sci, Waltham, MA 02453 USA
[2] Harvard Med Sch, Dept Med, Boston, MA USA
[3] Univ Georgia, Complex Carbohydrate Res Ctr, Athens, GA USA
[4] Boston Coll, Dept Chem, Chestnut Hill, MA USA
来源
COMMUNICATIONS CHEMISTRY | 2025年 / 8卷 / 01期
基金
美国国家科学基金会;
关键词
CHEMICAL-SHIFTS; NEURAL-NETWORK; ASSIGNMENT; C-13; H-1; COMPUTATION; DATABASE; HOSE;
D O I
10.1038/s42004-025-01455-9
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Nuclear Magnetic Resonance (NMR) spectroscopy is essential for revealing molecular structure, electronic environment, and dynamics. Accurate NMR shift prediction allows researchers to validate structures by comparing predicted and observed shifts. While Machine Learning (ML) has improved one-dimensional (1D) NMR shift prediction, predicting 2D NMR remains challenging due to limited annotated data. To address this, we introduce an unsupervised training framework for predicting cross-peaks in 2D NMR, specifically Heteronuclear Single Quantum Coherence (HSQC). Our approach pretrains an ML model on an annotated 1D dataset of 1H and 13C shifts, then finetunes it in an unsupervised manner using unlabeled HSQC data, which simultaneously generates cross-peak annotations. Our model also adjusts for solvent effects. Evaluation on 479 expert-annotated HSQC spectra demonstrates our model's superiority over traditional methods (ChemDraw and Mestrenova), achieving Mean Absolute Errors (MAEs) of 2.05 ppm and 0.165 ppm for 13C shifts and 1H shifts respectively. Our algorithmic annotations show a 95.21% concordance with experts' assignments, underscoring the approach's potential for structural elucidation in fields like organic chemistry, pharmaceuticals, and natural products.
引用
收藏
页数:10
相关论文
共 32 条
  • [21] A Multi-Task Learning Framework for Emotion Recognition Using 2D Continuous Space
    Xia, Rui
    Liu, Yang
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2017, 8 (01) : 3 - 14
  • [22] Fine-Grained Driving Behavior Prediction via Context-Aware Multi-Task Inverse Reinforcement Learning
    Nishi, Kentaro
    Shimosaka, Masamichi
    2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, : 2281 - 2287
  • [23] Unsupervised Anomaly Detection in 3D Brain MRI using Deep Learning with Multi-Task Brain Age Prediction
    Bengs, Marcel
    Behrendt, Finn
    Laves, Max-Heinrich
    Krueger, Julia
    Opfer, Roland
    Schlaefer, Alexander
    MEDICAL IMAGING 2022: COMPUTER-AIDED DIAGNOSIS, 2022, 12033
  • [24] Mask3D: Pre-training 2D Vision Transformers by Learning Masked 3D Priors
    Hou, Ji
    Dai, Xiaoliang
    He, Zijian
    Dai, Angela
    Niessner, Matthias
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 13510 - 13519
  • [25] Prediction of 3D Dose Distribution and PTV Contour Via Deep Multi-Task Learning Network
    Guo, F.
    Kong, F.
    Li, Y.
    Zhou, L.
    Song, T.
    MEDICAL PHYSICS, 2018, 45 (06) : E223 - E223
  • [26] End-to-End Motion Planners through Multi-Task Learning for Mobile Robots with 2D LiDAR
    Hoshino, Satoshi
    Unuma, Kyohei
    2023 IEEE/SICE INTERNATIONAL SYMPOSIUM ON SYSTEM INTEGRATION, SII, 2023,
  • [27] CONDENSE: Consistent 2D/3D Pre-training for Dense and Sparse Features from Multi-View Images
    Zhang, Xiaoshuai
    Wang, Zhicheng
    Zhou, Howard
    Ghosh, Soham
    Gnanapragasam, Danushen
    Jampani, Varun
    Su, Hao
    Guibas, Leonidas
    COMPUTER VISION - ECCV 2024, PT LIV, 2025, 15112 : 19 - 38
  • [28] EchoEFNet: Multi-task deep learning network for automatic calculation of left ventricular ejection fraction in 2D echocardiography
    Li, Honghe
    Wang, Yonghuai
    Qu, Mingjun
    Cao, Peng
    Feng, Chaolu
    Yang, Jinzhu
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 156
  • [29] Multi-Task Learning for Pulmonary Arterial Hypertension Prognosis Prediction via Memory Drift and Prior Prompt Learning on 3D Chest CT
    Yang, Guanyu
    He, Yuting
    Lv, Yang
    Chen, Yang
    Coatrieux, Jean-Louis
    Sun, Xiaoxuan
    Wang, Qiang
    Wei, Yongyue
    Li, Shuo
    Zhu, Yinsu
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2023, 27 (04) : 1967 - 1978
  • [30] Histogram of Oriented Gradients meet deep learning: A novel multi-task deep network for 2D surgical image semantic segmentation
    Bhattarai, Binod
    Subedi, Ronast
    Gaire, Rebati Raman
    Vazquez, Eduard
    Stoyanov, Danail
    MEDICAL IMAGE ANALYSIS, 2023, 85