DOI

  • Xiaowen Liu
  • Lennard J.M. Dekker
  • Si Wu
  • Martijn M. Vanduijn
  • Theo M. Luider
  • Nikola Tolić
  • Qiang Kou
  • Mikhail Dvorkin
  • Sonya Alexandrova
  • Kira Vyatkina
  • Ljiljana Paša-Tolić
  • Pavel A. Pevzner

There are two approaches for de novo protein sequencing: Edman degradation and mass spectrometry (MS). Existing MS-based methods characterize a novel protein by assembling tandem mass spectra of overlapping peptides generated from multiple proteolytic digestions of the protein. Because each tandem mass spectrum covers only a short peptide of the target protein, the key to high coverage protein sequencing is to find spectral pairs from overlapping peptides in order to assemble tandem mass spectra to long ones. However, overlapping regions of peptides may be too short to be confidently identified. High-resolution mass spectrometers have become accessible to many laboratories. These mass spectrometers are capable of analyzing molecules of large mass values, boosting the development of top-down MS. Top-down tandem mass spectra cover whole proteins. However, top-down tandem mass spectra, even combined, rarely provide full ion fragmentation coverage of a protein. We propose an algorithm, TBNovo, for de novo protein sequencing by combining top-down and bottom-up MS. In TBNovo, a top-down tandem mass spectrum is utilized as a scaffold, and bottom-up tandem mass spectra are aligned to the scaffold to increase sequence coverage. Experiments on data sets of two proteins showed that TBNovo achieved high sequence coverage and high sequence accuracy.

Язык оригиналаанглийский
Страницы (с-по)3241-3248
Число страниц8
ЖурналJournal of Proteome Research
Том13
Номер выпуска7
DOI
СостояниеОпубликовано - 3 июл 2014

    Предметные области Scopus

  • Биохимия
  • Химия (все)

ID: 100631086