Charles Explorer logo
🇬🇧

A Statistical Comparison of SimTandem with State-of-the-Art Peptide Identification Tools

Publication at Faculty of Mathematics and Physics |
2013

Abstract

The similarity search in theoretical mass spectra generated from protein sequence databases is a widely accepted approach for identification of peptides from query mass spectra generated by shotgun proteomics. Since query spectra contain many inaccuracies and the sizes of databases grow rapidly in recent years, demands on more accurate mass spectra similarities and on the utilization of database indexing techniques are still desirable.

We propose a statistical comparison of parameterized Hausdorff distance with freely available tools OMSSA, X!Tandem and with the cosine similarity. We show that a precursor mass filter in combination with a modification of previously proposed parameterized Hausdorff distance outperforms state-of-the-art tools in both - the speed of search and the number of identified peptide sequences (even though the q-value is only 0.001).

Our method is implemented in the freely available application SimTandem which can be used in the framework TOPP based on OpenMS.