Constructing high resolution consensus spectra for a peptide library

Slides:



Advertisements
Similar presentations
Tandem MS (MS/MS) on the Q-ToF2
Advertisements

Kaizhong Zhang Department of Computer Science University of Western Ontario London, Ontario, Canada Joint work with Bin Ma, Gilles Lajoie, Amanda Doherty-Kirby,
In-depth Analysis of Protein Amino Acid Sequence and PTMs with High-resolution Mass Spectrometry Lian Yang 2 ; Baozhen Shan 1 ; Bin Ma 2 1 Bioinformatics.
Proteomics Informatics – Protein identification III: de novo sequencing (Week 6)
How to identify peptides October 2013 Gustavo de Souza IMM, OUS.
CSE182 CSE182-L12 Mass Spectrometry Peptide identification.
Protein Sequencing and Identification by Mass Spectrometry.
Fa 05CSE182 CSE182-L7 Protein sequencing and Mass Spectrometry.
Peptide Identification by Tandem Mass Spectrometry Behshad Behzadi April 2005.
Data Processing Algorithms for Analysis of High Resolution MSMS Spectra of Peptides with Complex Patterns of Posttranslational Modifications Shenheng Guan.
Building and Using Libraries of Peptide Ion Fragmentation Spectra S.E. Stein, L.E. Kilpatrick, M. Mautner, P. Neta, J. Roth National Institute of Standards.
Proteomics: A Challenge for Technology and Information Science CBCB Seminar, November 21, 2005 Tim Griffin Dept. Biochemistry, Molecular Biology and Biophysics.
ProReP - Protein Results Parser v3.0©
Lawrence Hunter, Ph.D. Director, Computational Bioscience Program University of Colorado School of Medicine
Mass spectrometry in proteomics Modified from: I519 Introduction to Bioinformatics, Fall, 2012.
Proteomics Informatics – Protein identification II: search engines and protein sequence databases (Week 5)
Proteomics Informatics Workshop Part I: Protein Identification
Previous Lecture: Regression and Correlation
De Novo Sequencing of MS Spectra
Scaffold Download free viewer:
My contact details and information about submitting samples for MS
Proteomics Informatics (BMSC-GA 4437) Course Director David Fenyö Contact information
Evaluated Reference MS/MS Spectra Libraries Current and Future NIST Programs.
Protein sequencing and Mass Spectrometry. Sample Preparation Enzymatic Digestion (Trypsin) + Fractionation.
Tryptic digestion Proteomics Workflow for Gel-based and LC-coupled Mass Spectrometry Protein or peptide pre-fractionation is a prerequisite for the reduction.
Karl Clauser Proteomics and Biomarker Discovery Taming Errors for Peptides with Post-Translational Modifications Bioinformatics for MS Interest Group ASMS.
The dynamic nature of the proteome
PROTEIN STRUCTURE NAME: ANUSHA. INTRODUCTION Frederick Sanger was awarded his first Nobel Prize for determining the amino acid sequence of insulin, the.
INF380 - Proteomics-91 INF380 – Proteomics Chapter 9 – Identification and characterization by MS/MS The MS/MS identification problem can be formulated.
MS/MS Libraries of Identified Peptides and Recurring Spectra in Protein Digests Lisa Kilpatrick, Jeri Roth, Paul Rudnick, Xiaoyu Yang, Steve Stein Mass.
Acknowledgements This work is supported by NSF award DBI , and National Center for Glycomics and Glycoproteomics, funded by NIH/NCRR grant 5P41RR
Common parameters At the beginning one need to set up the parameters.
A Phospho-Peptide Spectrum Library for Improved Targeted Assays Barbara Frewen 1, Scott Peterman 1, John Sinclair 2, Claus Jorgensen 2, Amol Prakash 1,
Laxman Yetukuri T : Modeling of Proteomics Data
Temple University MASS SPECTROMETRY FURTHER INVESTIGATIONS Ilyana Mushaeva and Amber Moscato Department of Electrical and Computer Engineering Temple University.
Temple University MASS SPECTROMETRY FURTHER INVESTIGATIONS Ilyana Mushaeva and Amber Moscato Department of Electrical and Computer Engineering Temple University.
Temple University MASS SPECTROMETRY INTRODUCTION Ilyana Mushaeva and Amber Moscato Department of Electrical and Computer Engineering Temple University.
A Reference Library of Peptide Ion Fragmentation Spectra: Yeast S.E. Stein, L.E. Kilpatrick, P. Neta, Q.L. Pu, J. Roth, X. Yang National Institute of Standards.
Proteomics What is it? How is it done? Are there different kinds? Why would you want to do it (what can it tell you)?
CSE182 CSE182-L12 Mass Spectrometry Peptide identification.
A Reference Library of Peptide Ion Fragmentation Spectra Stephen Stein 1 ; Lisa Kilpatrick 2 ; Pedatsur Neta 1 ; Jeri Roth 1 ; Xiaoyu Yang 1 National Institute.
CSE182 CSE182-L11 Protein sequencing and Mass Spectrometry.
Peptide Identification via Tandem Mass Spectrometry Sorin Istrail.
Multiple flavors of mass analyzers Single MS (peptide fingerprinting): Identifies m/z of peptide only Peptide id’d by comparison to database, of predicted.
Overview of Mass Spectrometry
Proteomics Informatics (BMSC-GA 4437) Instructor David Fenyö Contact information
Tag-based Blind Identification of PTMs with Point Process Model 1 Chunmei Liu, 2 Bo Yan, 1 Yinglei Song, 2 Ying Xu, 1 Liming Cai 1 Dept. of Computer Science.
MC 13.3 Spectroscopy, Pt III 1 Introduction to Mass Spectrometry (cont) Principles of Electron-Impact Mass Spectrometry:  A mass spectrometer produces.
Proteomics Informatics (BMSC-GA 4437) Course Directors David Fenyö Kelly Ruggles Beatrix Ueberheide Contact information
김지형. Introduction precursor peptides are dynamically selected for fragmentation with exclusion to prevent repetitive acquisition of MS/MS spectra.
B Monoisotopic mass of neutral peptide M r (calc): Fixed modifications: Carbamidomethyl Ions score: 45 † Expect: ‡ Matches (red): 18/50.
MS Libraries for Forensics: DART-MS and GC-MS
Database Search Algorithm for Identification of Intact Cross-Links in Proteins and Peptides Using Tandem Mass Sepctrometry 신성호.
Mass Spectrometry 101 (continued) Hackert - CH 370 / 387D
Sample Preparation Enzymatic Digestion (Trypsin) + Fractionation.
A Database of Peak Annotations of Empirically Derived Mass Spectra
LC-MS/MS Identification of Impurities Present in Synthetic Peptide Drugs Dr Anna Meljon*, Dr Alan Thompson, Dr Osama Chahrour, and Dr John Malone Almac.
MassMatrix Search Results Explained
Analytical techniques
Bioinformatics Solutions Inc.
Proteomics Informatics David Fenyő
Interpretation of Mass Spectra I
Proteomics Informatics –
NoDupe algorithm to detect and group similar mass spectra.
Protein Identification Using Tandem Mass Spectrometry
Shotgun Proteomics in Neuroscience
Proteomics Informatics David Fenyő
Interpretation of Mass Spectra
Kuen-Pin Wu Institute of Information Science Academia Sinica
Presentation transcript:

Constructing high resolution consensus spectra for a peptide library Sergey L. Sheetlin, Yuri A. Mirokhin, Dmitrii V. Tchekhovskoi, Xiaoyu Yang, Stephen E. Stein NIST Mass Spectrometry Data Center, Biomolecular Measurement Division, National Institute of Standards and Technology

Tandem mass spectrometry Sample Ionization m/z ion sorting MS1 precursor ion CID fragmentation m/z ion sorting MS2 product ions Detection (measuring m/z) m/z: mass-to-charge ratio CID: Collision-Induced Dissociation MS: Mass Spectrum

Example of chromatogram (using Thermo Xcaliber Qual Browser) Relative abundance Time (min)

Example of MS1 spectrum (using Thermo Xcaliber Qual Browser) Relative abundance

Example of high resolution MS2 spectrum (shown with NIST MS search program) Relative abundance

Peptides Glu-Thr-Lys ETK Glutamylthreonyllysine C15H28N4O7 Short chains of amino acids connected by amide bonds Glu-Thr-Lys ETK Glutamylthreonyllysine C15H28N4O7

Peptide libraries NIST MS Search libraries of peptide spectra are available at http://chemdata.nist.gov/dokuwiki/doku.php?id=peptidew:cdownload

Nomenclature for peptide CID fragment ions R1 R2 Rn-1 Rn | | | | H2N-CH-CO-NH-CH-CO- …...-NH-CH-CO-NH-CH-CO2H I. A. Papayannopoulos. The interpretation of collision-induced dissociation mass spectra of peptides. Mass Spectrometry Review, 14:49–73, 1995.

Computation of fragment masses

Peptides monoisotopic masses 20 amino acids residues and their monoisotopic masses Ala Arg Asn Asp Cys Glu Gln Gly His Ile A R N D C E Q G H I 71.0371 156.1011 114.0429 115.0269 103.0092 129.0426 128.0586 57.0215 137.0589 113.0841 Leu Lys Met Phe Pro Ser Thr Trp Tyr Val L K M F P S T W Y V 113.0841 128.095 131.0405 147.0684 97.0528 87.032 101.0477 186.0793 163.0633 99.0684

Protein modifications for mass spectrometry In vivo Posttranslational modifications (PTM) are covalent modifications of proteins after its translation. PTMs play role in activity, function of proteins and their interaction with other molecules. Modifications caused by sample preparation. Modification name Composition Monoisotopic mass Carbamidomethyl C2H3NO 57.021464 iTRAQ4plex H12C413C3N15NO 144.102063 Oxidation O 15.994915 Deamidation H-1N-1O 0.984016 Phosphorylation HO3P 79.966331 Glu->pyro-Glu H-2O-1 -18.010565 Gln->pyro-Glu H-3N-1 -17.026549 Protein modifications for mass spectrometry www.unimod.org

Fragments neutral losses Common losses are H2O, NH3, CO, H3PO4,iTRAQ (H12C413C3N15NO). Relative abundance

Peptide “GHVIAAR”; charge 2; modification iTRAQ4plex on the first AA ‘G’ Relative abundance

Experimental and theoretical isotopic peaks Relative abundance Valkenborg, D., Mertens, I., Lemiere, F., Witters, E., Burzykowski, T.: The isotopic distribution conundrum. Mass Spectrom. Rev. 31(1), 96–109 (2012)

Annotation of peaks of experimental spectra

Experimental and theoretical densities Probability density function

Experimental and theoretical densities Probability density function

Experimental and theoretical densities Probability density function

Error of annotation of experimental peaks Probability density function

Statistics of different types of fragment ions (based on limited set of data)

Clustering experimental spectra Set of unidentified MS2 spectra Identification (peptide sequencing) Grouping results by charge, peptide, modifications, collision energy Filtering Clusters of replicate spectra

Peptide sequencing algorithms Database search: comparing theoretical spectra for sequences from a database with the query spectrum MS-GF+, Mascot etc. De novo sequencing: trying to find a peptide optimal in terms of some measure of similarity between its theoretical spectrum and the query spectrum PEAKS, NovoHMM etc. Library search: direct comparison of the query spectrum with identified spectra from a library NIST Mass Spectral Library etc.

Example of experimental spectra from the same cluster Relative abundance

Computing peaks of consensus spectra MS-GF+ 𝑑 1 𝑋 1 𝑑 2 𝑋 2 𝑑 3 Replicate spectra 𝑋 3 𝑑 5 𝑋 5 Consensus

Computing peaks of consensus spectra

Example of log-likelihood

Average fraction of replicates Hypothesis: “good” peaks of the consensus spectrum have properties similar to annotated peaks

Filtering peaks of consensus spectra Replicate number Is there a peak in the replicate corresponding to the given consensus peak with abundance A? 1 Yes 2 … N-1 No N Bernoulli distribution: Yes with probability p; No with probability 1-p

Density of peaks for different relative abundancies

Comparison of consensus and best replicate spectra

Further directions Adjusting the parameters of the method for optimal performance of the existing search algorithms Building peptide libraries of consensus spectra

Acknowledgements NIST MS Data Center Yuri A. Mirokhin Dmitrii V. Tchekhovskoi Xiaoyu Yang Stephen E. Stein William E. Wallace