Acknowledgements This work is supported by NSF award DBI-0642897, and National Center for Glycomics and Glycoproteomics, funded by NIH/NCRR grant 5P41RR018942.

Slides:



Advertisements
Similar presentations
Genomes and Proteomes genome: complete set of genetic information in organism gene sequence contains recipe for making proteins (genotype) proteome: complete.
Advertisements

Proteomics Informatics – Protein characterization I: post-translational modifications (Week 10)
De novo glycan structure search with CID MS/MS spectra of native N-glycopeptides Hannu Peltoniemi
Analysis of human haptoglobin, digest with trypsin and Glu-C – six putative N-motif peptides. Glycopeptide separation by hydrophilic interaction liquid.
A Multi-PCA Approach to Glycan Biomarker Discovery using Mass Spectrometry Profile Data Anoop Mayampurath, Chuan-Yih Yu Info-690 (Glycoinformatics) Final.
In-depth Analysis of Protein Amino Acid Sequence and PTMs with High-resolution Mass Spectrometry Lian Yang 2 ; Baozhen Shan 1 ; Bin Ma 2 1 Bioinformatics.
ACCELERATING CLINICAL AND TRANSLATIONAL RESEARCH Metabolomics/Proteomics and Genomics at IUB Indiana CTSI – Purdue Retreat Monday,
N-Glycopeptide Identification from CID Tandem Mass Spectra using Glycan Databases and False Discovery Rate Estimation Kevin B. Chandler, Petr Pompach,
How to identify peptides October 2013 Gustavo de Souza IMM, OUS.
PepArML: A model-free, result-combining peptide identification arbiter via machine learning Xue Wu, Chau-Wen Tseng, Nathan Edwards University of Maryland,
Automatic annotation of N-glycan species in MALDI-TOF-TOF spectra for rapid profiling and comparing Chuan-Yih, Yu Capstone Advisor: Prof. Haixu.
Protein Sequencing and Identification by Mass Spectrometry.
Biomarker discovery by automatic annotation of N-glycan species in MALDI-TOF-TOF spectra Chuan-Yih, Yu Capstone Advisor: Prof. Haixu Tang Indiana.
Peptide Identification by Tandem Mass Spectrometry Behshad Behzadi April 2005.
Data Processing Algorithms for Analysis of High Resolution MSMS Spectra of Peptides with Complex Patterns of Posttranslational Modifications Shenheng Guan.
Smart Templates for Chemical Identification in GCxGC-MS QingPing Tao 1, Stephen E. Reichenbach 2, Mingtian Ni 3, Arvind Visvanathan 2, Michael Kok 2, Luke.
Sangtae Kim Ph.D. candidate University of California, San Diego
Proteomics: A Challenge for Technology and Information Science CBCB Seminar, November 21, 2005 Tim Griffin Dept. Biochemistry, Molecular Biology and Biophysics.
Biomarker discovery by automatic annotation of N-glycan species in MALDI-TOF-TOF spectra Chuan-Yih, Yu Capstone Advisor: Prof. Haixu Tang.
ProReP - Protein Results Parser v3.0©
Computational Methods for Biomarker Discovery in Proteomics and Glycomics Vijetha Vemulapalli School of Informatics Indiana University Capstone Advisor:
Proteomics Informatics – Protein identification II: search engines and protein sequence databases (Week 5)
Proteomics Informatics Workshop Part I: Protein Identification
Previous Lecture: Regression and Correlation
Mass Spectrometry. What are mass spectrometers? They are analytical tools used to measure the molecular weight of a sample. Accuracy – 0.01 % of the total.
FIGURE 5. Plot of peptide charge state ratios. Quality Control Concept Figure 6 shows a concept for the implementation of quality control as system suitability.
Material Measurement Laboratory Mass Spectral Database of Glycans and Glycopeptides in Therapeutic Drugs Maria Lorna A. De Leoz, Xinjian (Eric) Yan, Xiaoyu.
Proteomics Informatics (BMSC-GA 4437) Course Director David Fenyö Contact information
My contact details and information about submitting samples for MS
Proteomics Informatics (BMSC-GA 4437) Course Director David Fenyö Contact information
Tryptic digestion Proteomics Workflow for Gel-based and LC-coupled Mass Spectrometry Protein or peptide pre-fractionation is a prerequisite for the reduction.
Production of polypeptides, Da, and middle-down analysis by LC-MSMS Catherine Fenselau 1, Joseph Cannon 1, Nathan Edwards 2, Karen Lohnes 1,
PROTEIN STRUCTURE NAME: ANUSHA. INTRODUCTION Frederick Sanger was awarded his first Nobel Prize for determining the amino acid sequence of insulin, the.
Introduction The GPM project (The Global Proteome Machine Organization) Salvador Martínez de Bartolomé Bioinformatics support –
Automatic annotation of N-glycans in MALDI-TOF spectra for rapid glycan profiling and comparison Chuan-Yih, Yu Capstone Presentation Advisor:
An introduction and possible applications Ariane Kahnt
Common parameters At the beginning one need to set up the parameters.
Analysis of Complex Proteomic Datasets Using Scaffold Free Scaffold Viewer can be downloaded at:
A Comprehensive Comparison of the de novo Sequencing Accuracies of PEAKS, BioAnalyst and PLGS Bin Ma 1 ; Amanda Doherty-Kirby 1 ; Aaron Booy 2 ; Bob Olafson.
A Phospho-Peptide Spectrum Library for Improved Targeted Assays Barbara Frewen 1, Scott Peterman 1, John Sinclair 2, Claus Jorgensen 2, Amol Prakash 1,
Laxman Yetukuri T : Modeling of Proteomics Data
Automatic Analysis of Ion Mobility Spectrometry – Mass Spectrometry (IMS-MS) Data Hyejin Yoon School of Informatics Indiana University Bloomington December.
PeptideProphet Explained Brian C. Searle Proteome Software Inc SW Bertha Blvd, Portland OR (503) An explanation.
CS 461b/661b: Bioinformatics Tools and Applications Software Algorithm Mathematical Models Biology Experiments and Data.
Knowledge Enabled Information and Services Science Glycomics project overview.
PEAKS: De Novo Sequencing using Tandem Mass Spectrometry Bin Ma Dept. of Computer Science University of Western Ontario.
Glycoprotein Microheterogeneity via N-Glycopeptide Identification Kevin Brown Chandler, Petr Pompach, Radoslav Goldman, Nathan Edwards Georgetown University.
CSE182 CSE182-L11 Protein sequencing and Mass Spectrometry.
EBI is an Outstation of the European Molecular Biology Laboratory. In silico analysis of accurate proteomics, complemented by selective isolation of peptides.
Data Management Support for Life Sciences or What can we do for the Life Sciences? Mourad Ouzzani
Proteomics Informatics (BMSC-GA 4437) Instructor David Fenyö Contact information
Proteomics Informatics (BMSC-GA 4437) Course Directors David Fenyö Kelly Ruggles Beatrix Ueberheide Contact information
Constructing high resolution consensus spectra for a peptide library
김지형. Introduction precursor peptides are dynamically selected for fragmentation with exclusion to prevent repetitive acquisition of MS/MS spectra.
Effects of Dissolution of Biomass in Ionic Liquids Using Direct Analysis in Real Time (DART) Mass Spectrometry Casey Howdieshell a, Darrin.
Quantitation using Pseudo-Isobaric Tags (QuPIT) and Quantitation using Pseudo-isobaric Amino acids in Cell culture (QuPAC) Parimal Samir Andrew J. Link.
Data independent acquisition methods for metabolomics Stephen Tate, Ron Bonner AB SCIEX, 71 Four Valley Drive, Concord, ON, L4K 4V8 Canada A high resolution.
Carbonyl-Reactive Tandem Mass Tags for the Proteome-Wide Quantification of N-Linked Glycans Hannes Hahne, Patrick Neubert, Karsten Kuhn, Chris Etienne,
Target Analyses in Parallel Reaction Monitoring Mode (PRM)
A Database of Peak Annotations of Empirically Derived Mass Spectra
LC-MS/MS Identification of Impurities Present in Synthetic Peptide Drugs Dr Anna Meljon*, Dr Alan Thompson, Dr Osama Chahrour, and Dr John Malone Almac.
Tandem MS.
Authors: Aruna Jyothi. M, Sanovar Bhargava, Hima Bindu. A, Subbarayudu
Bioinformatics Solutions Inc.
Proteomics Informatics David Fenyő
The Haystack Is Full of Needles: Technology Rescues Sugars!
Proteomics Informatics –
NoDupe algorithm to detect and group similar mass spectra.
Bioinformatics for Proteomics
Proteomics Informatics David Fenyő
Presentation transcript:

Acknowledgements This work is supported by NSF award DBI , and National Center for Glycomics and Glycoproteomics, funded by NIH/NCRR grant 5P41RR This work was also partially supported by the Indiana Metabolomics and Cytomics Initiative METACyt). An Integrative HCD/CID Scoring Scheme for Improved Characterization of Site-Specific Protein N -Glycosylation Anoop M. Mayampurath 1, Yin Wu 1, Zaneer M. Segu 2,3, Milos V. Novotny 2,3, Yehia Mehcref 2,3, Haixu Tang 1,3 1 School of Informatics, 2 Department of Chemistry and 3 National Center for Glycomics and Glycoproteomics Indiana University, Bloomington, IN  We present GlyPID 2.0, a software tool for accurate characterization of protein glycosylation through a combination of the following techniques Accurate precursor ion mass calculated using THRASH–based deisotoping methods [2, 3] A CID score calculated from Collision- Induced Dissociation (CID) spectra, similar to previous version of GlypID [1]. The model is based on spacing between fragment peaks indicative of glycan fragmentation. A new HCD score calculated from High- energy C-trap Dissociation (HCD) spectra scoring technique based on the presence of characterestic glycan peaks. HCD is available on the LTQ Orbitrap XL.  GlyPID also reports the type of glycosylation (High Mannose, Complex etc.).  The tool is open source and thus can be tailored to different needs. Ongoing and Future work Methods Introduction Overview Protein glycosylation is a common post- translational modification, estimated to occur in over 50% of human proteins. Platforms such as liquid chromatography with tandem mass spectrometry (LC/MS/MS) typically employ Collision-Induced Dissociation (CID) for the characterization of glycoproteins. We have previously developed GlyPID that enables the automatic mapping of N-glycolsylation sites and their microhetrogenties from LC/MS/MS data. This was achieved by combining the clusters of precursor ions and their fragmentation patterns. Here, we use the newly developed LTQ Orbitrap XL instrument that simultaneously performs high- energy-C-trap dissociation (HCD) and CID. An integrative scoring scheme is presented and implemented as part of GlypID 2.0 that combines HCD and CID data with the accurate monoisotopic precursor mass, thus allowing a confident characterization of protein glycosylation.  Accurate Precursor Ion Mass - Used DeconMSn [2] methodology to accurately detect precursor ion mass Precursor mono mass : Precursor CS : 3  HCD Scoring - The algorithm looks for the presence of seven characteristic monosacharride peaks from HCD fragments, (see example above) and models the number of detected characteristic peaks as a binomial distribution. - A p-value score is then calculated to assign confidence to the identification.  CID Scoring -The De Novo CID scoring algorithm finds all connected-monosaccharide paths in the CID spectrum using fragment mass differences[3]. -The largest oligosaccharide sequence subset identified using the dynamic programming algorithm to find the longest path in the spectrum graph (see example above) THRASH precursor  Prediction of the classes of N-glycosylation -The classes of N-glycosylation (High-Mannose, Complex-asaliated, Complex-saliated or Hybrid) can be predicted by matching observed HCD spectra against theoretical characteristic peak distribution. - As an example, high mannose will have a prominent peak at Hex (from Mannose) and a weak peak at Hex+GlcNAc. Results Class 137 m/zHexGlcNAc NeuAC (-H20)NeuAc Hex+ GlcNAc NeuAc+He x+GlcNAc High Mannose 0S000W0 Complex- Asaliated 0WM00S0 Complex- Saliated MMMMMMM Hybrid MSMMMMM S – Strong (0.99); M- Moderate ( ); W – weak ( ) Example (from glycosylated Fetuin sample datset) of integrative scoring for an ion (given in 2D view with monoisotopic mass on Y axis and LC scan on X axis) of mass Presence of monosaccharide peaks (in HCD spectra) and monosocharride path (in CID spectra) suggests that this could be a glycosylated peptide. References 1.Mayampurath et. al., DeconMSn: a software tool for accurate parent ion monoisotopic mass determination for tandem mass spectra. Bioinformatics Apr 1;24(7): Horn, et. al. Automated Reduction and Interpretation of High Resolution Electrospray Mass Spectra of Large Molecules. J. Am. Soc. Mass Spectrom. 2000, 11, Wu et. al. A Comp. Approach for the Identification of Site-Specific Protein Glycosylations Through Ion-Trap Mass Spectrometry, Lecture Notes in Comp Sci, 2007, 4532: Efforts are currently being made to combine the two scoring models together to provide a unified score, to increase glycosylation identification confidence by using the glycan type determined by HCD scoring model to validate the CID scoring model, to determine microheterogenties, and to implement user-friendly visualization controls. Software link: