Skip to content
2000
Volume 14, Issue 3
  • ISSN: 1568-0266
  • E-ISSN: 1873-4294

Abstract

Since the advent of high-throughput DNA sequencing technologies, the ever-increasing rate at which genomes have been published has generated new challenges notably at the level of genome annotation. Even if gene predictors and annotation softwares are more and more efficient, the ultimate validation is still in the observation of predicted gene product( s). Mass-spectrometry based proteomics provides the necessary high throughput technology to show evidences of protein presence and, from the identified sequences, confirmation or invalidation of predicted annotations. We review here different strategies used to perform a MS-based proteogenomics experiment with a bottom-up approach. We start from the strengths and weaknesses of the different database construction strategies, based on different genomic information (whole genome, ORF, cDNA, EST or RNA-Seq data), which are then used for matching mass spectra to peptides and proteins. We also review the important points to be considered for a correct statistical assessment of the peptide identifications. Finally, we provide references for tools used to map and visualize the peptide identifications back to the original genomic information.

Loading

Article metrics loading...

/content/journals/ctmc/10.2174/1568026613666131204105652
2014-02-01
2025-09-05
Loading full text...

Full text loading...

/content/journals/ctmc/10.2174/1568026613666131204105652
Loading

  • Article Type:
    Research Article
Keyword(s): bioinformatics; databases; gene annotation; mass-spectrometry; Proteogenomics; proteomics
This is a required field
Please enter a valid email address
Approval was a Success
Invalid data
An Error Occurred
Approval was partially successful, following selected items could not be processed due to error
Please enter a valid_number test