Data Fusion-based Discovery (DAFdiscovery) pipeline to aid compound annotation and bioactive compound discovery across diverse spectral data

被引:9
|
作者
Borges, Ricardo Moreira [1 ]
Costa, Fernanda das Neves [1 ]
Chagas, Fernanda O. [1 ]
Teixeira, Andrew Magno [1 ]
Yoon, Jaewon [2 ]
Weiss, Marcio Barczyszyn [2 ]
Crnkovic, Camila Manoel [2 ]
Pilon, Alan Cesar [3 ]
Garrido, Bruno C. [4 ]
Betancur, Luz Adriana [5 ]
Forero, Abel M. [6 ,7 ,8 ]
Castellanos, Leonardo [6 ]
Ramos, Freddy A. [6 ]
Pupo, Monica T. [3 ]
Kuhn, Stefan [9 ]
机构
[1] Univ Fed Rio de Janeiro, Inst Pesquisas Prod Nat Walter Mors, Rio De Janeiro, Brazil
[2] Univ Sao Paulo, Fac Ciencias Farmaceut, Sao Paulo, Brazil
[3] Univ Sao Paulo, Fac Ciencias Farmaceut Ribeirao Preto, Sao Paulo, Brazil
[4] Organ Anal Lab, Chem Metrol Div, Inmetro, Brazil
[5] Univ Caldas, Dept Quim, Edificio Orlando Sierra, Caldas, Colombia
[6] Univ Nacl Colombia, Dept Quim, Sede Bogota, Bogota, Colombia
[7] Univ A Coruna, Dept Quim, Fac Ciencias, Coruna, Spain
[8] Univ A Coruna, Ctr Invest Cient Avanzadas CI CA, Coruna, Spain
[9] De Montfort Univ, Sch Comp Sci & Informat, Leicester, Leics, England
基金
巴西圣保罗研究基金会;
关键词
NMR;
D O I
10.1002/pca.3178
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Introduction Data Fusion-based Discovery (DAFdiscovery) is a pipeline designed to help users combine mass spectrometry (MS), nuclear magnetic resonance (NMR), and bioactivity data in a notebook-based application to accelerate annotation and discovery of bioactive compounds. It applies Statistical Total Correlation Spectroscopy (STOCSY) and Statistical HeteroSpectroscopy (SHY) calculation in their data using an easy-to-follow Jupyter Notebook. Method Different case studies are presented for benchmarking, and the resultant outputs are shown to aid natural products identification and discovery. The goal is to encourage users to acquire MS and NMR data from their samples (in replicated samples and fractions when available) and to explore their variance to highlight MS features, NMR peaks, and bioactivity that might be correlated to accelerated bioactive compound discovery or for annotation-identification studies. Results Different applications were demonstrated using data from different research groups, and it was shown that DAFdiscovery reproduced their findings using a more straightforward method. Conclusion DAFdiscovery has proven to be a simple-to-use method for different situations where data from different sources are required to be analyzed together.
引用
收藏
页码:48 / 55
页数:8
相关论文
共 42 条
  • [41] Rapid and accurate determination methods based on data fusion of laser-induced breakdown spectra and near-infrared spectra for main elemental contents in compound fertilizers
    Xu, Zhuopin
    Li, Xiaohong
    Cheng, Weimin
    Zhao, Guangxia
    Tang, Liwen
    Yang, Yang
    Wu, Yuejin
    Zhang, Pengfei
    Wang, Qi
    TALANTA, 2024, 266
  • [42] Application of an Exploratory Knowledge-Discovery Pipeline Based on Machine Learning to Multi-Scale OMICS Data to Characterise Myocardial Injury in a Cohort of Patients with Septic Shock: An Observational Study
    Pinto, Bernardo Bollen
    Ripoll, Vicent Ribas
    Subias-Beltran, Paula
    Herpain, Antoine
    Barlassina, Cristina
    Oliveira, Eliandre
    Pastorelli, Roberta
    Braga, Daniele
    Barcella, Matteo
    Subirats, Laia
    Bauza-Martinez, Julia
    Odena, Antonia
    Ferrario, Manuela
    Baselli, Giuseppe
    Aletti, Federico
    Bendjelid, Karim
    JOURNAL OF CLINICAL MEDICINE, 2021, 10 (19)