Conclusion  Comprehensive workflow identified approximately 70% more high confident peptide as compare to general search strategy.  The comprehensive.

Slides:



Advertisements
Similar presentations
Genomes and Proteomes genome: complete set of genetic information in organism gene sequence contains recipe for making proteins (genotype) proteome: complete.
Advertisements

The Proteomics Core at Wayne State University
Proteomics Informatics – Protein characterization I: post-translational modifications (Week 10)
Conclusion The workflow presented provides a strategy to incorporate unbiased glycopeptide identification to generate an initial list of targets for data.
UC Mass Spectrometry Facility & Protein Characterization for Proteomics Core Proteomics Capabilities: Examples of Protein ID and Analysis of Modified Proteins.
In-depth Analysis of Protein Amino Acid Sequence and PTMs with High-resolution Mass Spectrometry Lian Yang 2 ; Baozhen Shan 1 ; Bin Ma 2 1 Bioinformatics.
Targeted Quantification of O-Linked Glycosylation Site for Glycan Distribution Analysis Scott M. Peterman 1, Amol Prakash 1, Bryan Krastins 1, Mary Lopez.
MN-B-C 2 Analysis of High Dimensional (-omics) Data Kay Hofmann – Protein Evolution Group Week 5: Proteomics.
N-Glycopeptide Identification from CID Tandem Mass Spectra using Glycan Databases and False Discovery Rate Estimation Kevin B. Chandler, Petr Pompach,
How to identify peptides October 2013 Gustavo de Souza IMM, OUS.
Peptide Mass Fingerprinting
Comparison of serum proteomics analysis using in-silico and in-gel fractionation Anna Drabik, Anna Bodzoń-Kułakowska, Piotr Suder, Marek Sierzęga, Jan.
De Novo Sequencing v.s. Database Search Bin Ma School of Computer Science University of Waterloo Ontario, Canada.
Data Processing Algorithms for Analysis of High Resolution MSMS Spectra of Peptides with Complex Patterns of Posttranslational Modifications Shenheng Guan.
20-30% of a trypsinised proteome are constituted of peptides with Mw≥3000 (TReP) Identification of large peptides by shotgun MS is not efficient Isolation.
ProReP - Protein Results Parser v3.0©
Lawrence Hunter, Ph.D. Director, Computational Bioscience Program University of Colorado School of Medicine
Proteomics Informatics – Protein identification II: search engines and protein sequence databases (Week 5)
Advantages of a Two-Pass Workflow for Biomarker Discovery in Plasma or Serum Samples for Clinical Research Maryann S Vogelsang 1, Bryan Krastins 1, David.
FIGURE 5. Plot of peptide charge state ratios. Quality Control Concept Figure 6 shows a concept for the implementation of quality control as system suitability.
Scaffold Download free viewer:
Improving Throughput for Highly Multiplexed Targeted Quantification Methods Using Novel API-Remote Instrument Control and State-Model Data Acquisition.
Proteomics Informatics (BMSC-GA 4437) Course Director David Fenyö Contact information
My contact details and information about submitting samples for MS
Goals in Proteomics 1.Identify and quantify proteins in complex mixtures/complexes 2.Identify global protein-protein interactions 3.Define protein localizations.
Overview Purpose: Accurately estimate peptide retention based on spectrum library data utilizing commonly observed peptides in place of synthetic standards.
Facts and Fallacies about de Novo Sequencing & Database Search.
Proteomics Informatics (BMSC-GA 4437) Course Director David Fenyö Contact information
Tryptic digestion Proteomics Workflow for Gel-based and LC-coupled Mass Spectrometry Protein or peptide pre-fractionation is a prerequisite for the reduction.
Comparison of chicken light and dark meat using LC MALDI-TOF mass spectrometry as a model system for biomarker discovery WP 651 Jie Du; Stephen J. Hattan.
Introduction : Standard methodologies for enzymatic digestions have changed little in the past 40 years. The same process for sample incubation with trypsin,
Production of polypeptides, Da, and middle-down analysis by LC-MSMS Catherine Fenselau 1, Joseph Cannon 1, Nathan Edwards 2, Karen Lohnes 1,
Chapter 9 Mass Spectrometry (MS) -Microbial Functional Genomics 조광평 CBBL.
Results Intelligent Data Acquisition Initial discovery experiments were performed to help drive targeted quantitative experiments (Figures 1, 2). The discovery.
The dynamic nature of the proteome
PROTEIN STRUCTURE NAME: ANUSHA. INTRODUCTION Frederick Sanger was awarded his first Nobel Prize for determining the amino acid sequence of insulin, the.
Serendipity in the Blood: Mass spectrometry in the discovery of clinical biomarkers AFMR Symposium Boston, 4/24/13 Mary F Lopez, Director BRIMS Biomarker.
Introduction The GPM project (The Global Proteome Machine Organization) Salvador Martínez de Bartolomé Bioinformatics support –
Common parameters At the beginning one need to set up the parameters.
Improving Peptide Searching Workflow to Maximize Protein Identifications Shadab Ahmad 1, Amol Prakash 1, David Sarracino 1, Bryan Krastins 1, MingMing.
Integrated Targeted Quantitative Method for Insulin and its Therapeutic Analogs Eric Niederkofler 1, Dobrin Nedelkov 1, Urban Kiernan 1, David Phillips.
Analysis of Complex Proteomic Datasets Using Scaffold Free Scaffold Viewer can be downloaded at:
A Comprehensive Comparison of the de novo Sequencing Accuracies of PEAKS, BioAnalyst and PLGS Bin Ma 1 ; Amanda Doherty-Kirby 1 ; Aaron Booy 2 ; Bob Olafson.
A Phospho-Peptide Spectrum Library for Improved Targeted Assays Barbara Frewen 1, Scott Peterman 1, John Sinclair 2, Claus Jorgensen 2, Amol Prakash 1,
Laxman Yetukuri T : Modeling of Proteomics Data
In-Gel Digestion Why In-Gel Digest?
Overview of Mass Spectrometry
EBI is an Outstation of the European Molecular Biology Laboratory. In silico analysis of accurate proteomics, complemented by selective isolation of peptides.
Proteomics Informatics (BMSC-GA 4437) Instructor David Fenyö Contact information
Peptide-assisted annotation of the Mlp genome Philippe Tanguay Nicolas Feau David Joly Richard Hamelin.
Salamanca, March 16th 2010 Participants: Laboratori de Proteomica-HUVH Servicio de Proteómica-CNB-CSIC Participants: Laboratori de Proteomica-HUVH Servicio.
Lecture 6 Comparative analysis Oct 2011 SDMBT.
Click to add Text Sample Preparation for Mass Spectrometry Sermin Tetik, PhD Marmara University July 2015, New Orleans.
C HROMOSOME 16 Marta Mendes Ignacio Casal Centro de Investigaciones Biológicas August 2012.
Deducing protein composition from complex protein preparations by MALDI without peptide separation.. TP #419 Kenneth C. Parker SimulTof Corporation, Sudbury,
What is proteomics? Richard Mbasu and Ben Richards.
Using Scaffold OHRI Proteomics Core Facility. This presentation is intended for Core Facility internal training purposes only.
Protein identification by mass spectrometry The shotgun proteomics strategy, based on digesting proteins into peptides and sequencing them using tandem.
Protein identification by mass spectrometry The shotgun proteomics strategy, based on digesting proteins into peptides and sequencing them using tandem.
Database Search Algorithm for Identification of Intact Cross-Links in Proteins and Peptides Using Tandem Mass Sepctrometry 신성호.
Table 1. Quality Parameters Being Considered for Evaluation
Mass spectrometry data enhancement software
PROTOCOL OPTIMIZATION
Bioinformatics Solutions Inc.
Quantitative proteomics reveals daf‐16‐mediated reduction in protein metabolism in long‐lived daf‐2(e1370) mutants. Quantitative proteomics reveals daf‐16‐mediated.
A perspective on proteomics in cell biology
Shotgun Proteomics in Neuroscience
Sample preparation Protein and peptide separation techniques Karel Bezstarosti (Proteomics Center, Erasmus MC)
Presentation transcript:

Conclusion  Comprehensive workflow identified approximately 70% more high confident peptide as compare to general search strategy.  The comprehensive workflow helped increase the number of high confident protein identification and high confident grouped protein identification by approximately 63% and 44% respectively as compared to general search approach.  Comprehensive workflow identifies large number of high confident peptides with multiple PTMs.  The percentage of matched spectra improves significantly when using comprehensive search workflow. References 1.Khoury GA, Baliban RC, Floudas CA. Proteome-wide post- translational modification statistics: frequency analysis and curation of the swiss-prot database. Sci Rep Sep 13;1. 2.Schandorff S, Olsen JV, Bunkenborg J, Blagoev B, Zhang Y, Andersen JS, Mann M. A mass spectrometry-friendly database for cSNP identification. Nat Methods. 2007Jun;4(6): Overview Purpose: Development of a comprehensive protein identification workflow that helps identify more high confidence peptide/protein IDs including post translational modifications than traditional workflows. Methods: Use of combinations of multiple search engines (e.g., SEQUEST and Mascot) where combinations of PTMs were judiciously chosen for each node based on uniprotKB-relative PTM abundances from high-quality, manually curated, proteome-wide data 1. Results: Tremendous enhancement in the high confident percolator validated peptide/protein identification compared to standard SEQUEST and MASCOT workflow. Introduction Mass spectrometry has become an established method for protein identification and characterization in recent years. The number of protein identification from complex biological samples depends on many factors, ranging from data acquisition strategy to MS/MS data searching methods. Unfortunately, only a fraction of spectra generated have confident peptide matches for any complex biological sample. There are several factors that are being overlooked by many users in data searching strategy including appropriate combination of post translational modifications (PTMs), coding SNP 2, isoforms of proteins, iterative searching etc. that can possibly help identify these unmatched spectrum. We herein develop a comprehensive protein identification workflow that helps identify higher number of high confidence peptide/protein IDs and also identify multiple PTMs and partially cleaved peptide in a single run. Methods Comprehensive workflow development We developed a comprehensive MS/MS searching workflow within Proteome Discoverer using a combination of multiple search engines (Figure1) in an iterative fashion to maximise number of protein/peptide identification by considering the most frequently found PTMs 1 ; sequence-isoforms of proteins; and partially cleaved peptide etc. Effect of various factors on peptide identification were explored and implemented in the process that include protein isoforms, missed cleavage sites, semi tryptic digestion and most importantly appropriate combination of PTMs in each search node. The combination of PTMs were judiciously chosen based on uniprotKB-relative abundances of each PTM found experimentally and putatively, from high-quality, manually curated, proteome-wide data 1. The workflows were tested on plasma and urine samples acquired on a hybrid Orbitrap mass spectrometer. FIGURE 2. Comprehensive workflow increases number of peptide identification Results Peptide Identification We compare the results from our comprehensive searching workflow with general search. We found that on average, the number of high confidence peptides identification (FDR≤0.01) increased by approximately 70% with our comprehensive workflow as compared to general searches, whereas the number of medium confidence peptides identification (FDR≤0.05) increment was twice as compared to general searches (figure2). SEQUEST and Percolator are registered trademarks of University of Washington. All other trademarks are the property of Thermo Fisher Scientific and its subsidiaries. This information is not intended to encourage use of these products in any manners that might infringe the intellectual property rights of others. FIGURE 4. Comprehensive workflow increases number of matched spectra. Table1. Examples of peptide containing multiple PTMs from Comprehensive search. Improving mass spectrometry data searching workflow to maximize protein Identifications Shadab Ahmad 1, Amol Prakash 1, David Sarracino 1, Bryan Krastins 1, MingMing Ning 2, Barbara Frewen 1, Scott Peterman 1, Gregory Byram 1, Maryann S. Vogelsang 1, Gouri Vadali 1, Jennifer Sutton 1, Mary F. Lopez 1 1 Thermo Fisher Scientific, BRIMS (Biomarker Research in Mass Spectrometry), Cambridge, MA 2 Massachusetts General Hospital, Boston, MA with Thermo QExactive benchtop mass spectrometer, with top 15 data dependent MS/MS using HCD fragmentation. Data Analysis The acquired data was searched with proteome discoverer 1.4 (Thermo Fisher Scientific) using comprehensive workflow and also with general SEQUEST workflow with standard PTMs (oxidation at methionine as dynamic modification and alkylation as static modification) coupled with percolator validation (General Search). FIGURE 1. Structure of Comprehensive workflow Sample Preparation In order to evaluate the performance of the comprehensive workflow we took four human samples from two different sources (a) Urine and (b) Plasma (three samples). Human urine and plasma samples were collected with full consent and approval. The samples were subjected to reduction and alkylation followed by digestion with trypsin. Liquid Chromatography and Mass Spectrometry The digested samples were separated with C18 column with 5-45% acetonitrile gradient in 0.1% formic acid through nano-LC system. The urine sample (sample no. 1) and a plasma sample (sample no. 2) were run for 140 minutes and 90 minutes respectively and the data were acquired with LTQ Orbitrap Velos MS with top 11 and top 10 data dependent MS/MS respectively using CID fragmentation. Another two plasma samples (sample no.3 and 4) were run for 250 minutes and 240 minutes respectively and the data were acquired FIGURE 3. Comprehensive workflow increases number of grouped protein identification (with at least two peptide hits per protein) The comprehensive workflow found to increase the number of high confident protein (FDR≤0.01) by 63% and the high confident grouped protein by 44% with respect to the general search. Moreover the comprehensive workflow increases the high confident group proteins (with at least two high confident peptides for every protein in the group) by 15% (figure3). FileTotal Spectra Matched Spectra General Search (FDR≤0.05) Matched Spectra Comprehensive Search (FDR≤0.05) Matched Spectra General Search (FDR≤0.01) Matched Spectra Comprehensive Search (FDR≤0.01) Sample %43.5 %26.0 %38.5 % Sample %34.4 %14.5 %30.1 % Sample %32.8 %19.1 %30.1 % Sample %18.1 %8.0 %16.8 % SequenceModificationq-Value RATTVTGTPCQDWAAQEPHR R1(ADP-Ribosyl); G7(Myristoyl); C10(Carboxymethyl)≤0.001 VSHSPPPKQRSSPVTK S2(Phospho); S4(Phospho); K8(Methyl); R10(Methyl) ≤0.001 LLIYAASSLETGVPSRY4(Phospho); A6(Acetyl)0.007 LVRPEVDVMCTAFHDNEETFLK M9(Oxidation); C10(Carboxymethyl); F13(Amidated); E17(Carboxy); F20(Amidated) ≤0.001 Moreover the comprehensive workflow identified several high confident peptides with multiple PTMs which reveal the importance of right combination of PTM in a search node (table1). We further investigate the matched and unmatched spectra while using general search and our comprehensive search. We found that the percentage of matched spectra improves significantly when using comprehensive search workflow (figure 4, table2). Table2. Comparative table for matched spectra