Motivation: Peptidic natural products (PNPs) are considered a promising compound class that has many applications in medicine. Recently developed mass spectrometry-based pipelines are transforming PNP discovery into a high-throughput technology. However, the current computational methods for PNP identification via database search of mass spectra are still in their infancy and could be substantially improved. Results: Here we present NPS, a statistical learning-based approach for scoring PNP-spectrum matches. We incorporated NPS into two leading PNP discovery tools and benchmarked them on millions of natural product mass spectra. The results demonstrate more than 45% increase in the number of identified spectra and 20% more found PNPs at a false discovery rate of 1%.

Original languageEnglish
Article numberbtz374
Pages (from-to)I315-I323
Number of pages9
JournalBioinformatics
Volume35
Issue number14
DOIs
StatePublished - 15 Jul 2019

    Scopus subject areas

  • Computational Mathematics
  • Molecular Biology
  • Biochemistry
  • Statistics and Probability
  • Computer Science Applications
  • Computational Theory and Mathematics

    Research areas

  • TANDEM MASS-SPECTRA, DATABASE SEARCH, SPECTROMETRY, IDENTIFICATION, DEREPLICATION, DISCOVERY

ID: 43669243