Deep learning for peptide identification from metaproteomics datasets

被引:9
|
作者
Feng, Shichao [1 ]
Sterzenbach, Ryan [2 ]
Guo, Xuan [1 ]
机构
[1] Univ North Texas, Dept Comp Sci & Engn, 3940 N Elm St,Ste F290, Denton, TX 76207 USA
[2] Univ North Texas, Dept Biomed Engn, Denton, TX 76203 USA
基金
美国国家卫生研究院;
关键词
Peptide identification; Deep learning; Tandem mass spectrometry; CNN; PROTEIN IDENTIFICATION; STATISTICAL-MODEL; MS/MS; CONFIDENCE; CHALLENGES; REVEALS;
D O I
10.1016/j.jprot.2021.104316
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Metaproteomics is becoming widely used in microbiome research for gaining insights into the functional state of the microbial community. Current metaproteomics studies are generally based on high-throughput tandem mass spectrometry (MS/MS) coupled with liquid chromatography. In this paper, we proposed a deep-learningbased algorithm, named DeepFilter, for improving peptide identifications from a collection of tandem mass spectra. The key advantage of the DeepFilter is that it does not need ad hoc training or fine-tuning as in existing filtering tools. DeepFilter is freely available under the GNU GPL license at https://github. com/Biocomputing-Research-Group/DeepFilter. Significance: The identification of peptides and proteins from MS data involves the computational procedure of searching MS/MS spectra against a predefined protein sequence database and assigning top-scored peptides to spectra. Existing computational tools are still far from being able to extract all the information out of MS/MS data sets acquired from metaproteome samples. Systematical experiment results demonstrate that the DeepFilter identified up to 12% and 9% more peptide-spectrum-matches and proteins, respectively, compared with existing filtering algorithms, including Percolator, Q-ranker, PeptideProphet, and iProphet, on marine and soil microbial metaproteome samples with false discovery rate at 1%. The taxonomic analysis shows that DeepFilter found up to 7%, 10%, and 14% more species from marine, soil, and human gut samples compared with existing filtering algorithms. Therefore, DeepFilter was believed to generalize properly to new, previously unseen peptidespectrum-matches and can be readily applied in peptide identification from metaproteomics data.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] Deep learning for peptide classification: from data to effective models
    Mausa, Goran
    JOURNAL OF PEPTIDE SCIENCE, 2024, 30
  • [22] Deep Learning for Glaucoma Detection and Identification of Novel Diagnostic Areas in Diverse Real-World Datasets
    Noury, Erfan
    Mannil, Suria S.
    Chang, Robert T.
    Ran, An Ran
    Cheung, Carol Y.
    Thapa, Suman S.
    Rao, Harsha L.
    Dasari, Srilakshmi
    Riyazuddin, Mohammed
    Chang, Dolly
    Nagaraj, Sriharsha
    Tham, Clement C.
    Zadeh, Reza
    TRANSLATIONAL VISION SCIENCE & TECHNOLOGY, 2022, 11 (05):
  • [23] Deep learning in regulatory genomics: from identification to design*
    Hu, Xuehai
    Fernie, Alisdair R.
    Yan, Jianbing
    CURRENT OPINION IN BIOTECHNOLOGY, 2023, 79
  • [24] Integrating Heterogeneous Datasets by Using Multimodal Deep Learning
    Khoshghalbvash, Fariba
    Gao, Jean X.
    COMMUNICATIONS, SIGNAL PROCESSING, AND SYSTEMS, CSPS 2018, VOL III: SYSTEMS, 2020, 517 : 279 - 285
  • [25] Deep learning in retrosynthesis planning: datasets, models and tools
    Dong, Jingxin
    Zhao, Mingyi
    Liu, Yuansheng
    Su, Yansen
    Zeng, Xiangxiang
    BRIEFINGS IN BIOINFORMATICS, 2022, 23 (01)
  • [26] Deep learning models for stock prediction on diverse datasets
    Sable, Rachna
    Goel, Shivani
    Chatterjee, Pradeep
    Jindal, Mani
    JOURNAL OF ADVANCED APPLIED SCIENTIFIC RESEARCH, 2024, 6 (03): : 25 - 38
  • [27] Deep Learning for Emotion Recognition on Small Datasets Using Transfer Learning
    Hong-Wei Ng
    Viet Dung Nguyen
    Vonikakis, Vassilios
    Winkler, Stefan
    ICMI'15: PROCEEDINGS OF THE 2015 ACM INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2015, : 443 - 449
  • [28] Deep Learning Applied on Refined Opinion Review Datasets
    Jost, Ingo
    Valiati, Joao Francisco
    INTELIGENCIA ARTIFICIAL-IBEROAMERICAL JOURNAL OF ARTIFICIAL INTELLIGENCE, 2018, 21 (62): : 91 - 102
  • [29] Minimum Precision Requirements for Deep Learning with Biomedical Datasets
    Sakr, Charbel
    Shanbhag, Naresh
    2018 IEEE BIOMEDICAL CIRCUITS AND SYSTEMS CONFERENCE (BIOCAS): ADVANCED SYSTEMS FOR ENHANCING HUMAN HEALTH, 2018, : 303 - 306
  • [30] Prediction of toxicity: Deep learning with small and imbalanced datasets
    Ecker, Gerhard
    Hemmerich, Jennifer
    Asilar, Ece
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2019, 257