Integration of Mass Spectrometry with High- throughput Protein Crystallography Tarun Gheyi SGX Pharmaceuticals, Inc. / NYSGXRC April 14, 2008.

Slides:



Advertisements
Similar presentations
Tandem MS (MS/MS) on the Q-ToF2
Advertisements

Genomes and Proteomes genome: complete set of genetic information in organism gene sequence contains recipe for making proteins (genotype) proteome: complete.
From Genome to Proteome Juang RH (2004) BCbasics Systems Biology, Integrated Biology.
Mass Spectrometry Kyle Chau and Andrew Gioe. Computation of Molecular Mass -Mass Spectrum is a plot of intensity as a function of mass- charge ratio,
High Throughput Protein Domain Elucidation by Limited Proteolysis-Mass Spectrometry Jeff Bonanno and Xia Gao Structural GenomiX, Inc.
UC Mass Spectrometry Facility & Protein Characterization for Proteomics Core Proteomics Capabilities: Examples of Protein ID and Analysis of Modified Proteins.
Nanoliter Solid Phase Extraction (SPE) Using Chromatographic Hollow Fibers for Sample Preparation for Mass Spectrometry Mukta M. Shukla 1, Ashok K. Shukla.
Oligonucleotide analysis by Linear MALDI-TOF mass spectrometry Stephen J. Hattan , Kenneth C. Parker, Marvin L. Vestal; SimulTOF Systems, 60 Union Ave,
Analyzing Biological and Organic Polymers by MALDI-TOF Jonathan A. Karty, Ph.D.
How to identify peptides October 2013 Gustavo de Souza IMM, OUS.
Peptide Mass Fingerprinting
Acknowledgements: We thank the LSMBO and the Structural Biology and Genomics Dept. members. This work was supported by funds from SPINE EEC QLG2-CT ,
Introduction to Proteomics CSC Gene Expression and Proteomics Simon Cockell Bioinformatics Support Unit Feb 2008.
Mass Spectrometry in the Biosciences: Introduction to Mass Spectrometry and Its Uses in a Company Like Decode. Sigurður V. Smárason, Ph.D. New Technologies.
PROTEIN IDENTIFICATION BY MASS SPECTROMETRY. OBJECTIVES To become familiar with matrix assisted laser desorption ionization-time of flight mass spectrometry.
Protein & Peptide Analysis Linda Breci Chemistry Mass Spectrometry Facility University of Arizona MS Summer Workshop.
Beta-Galactosidase Digest from 500 fmole Loaded on a 1-D Gel A. B. C. Bovine Serum Albumin Digest from 250 fmole Loaded on a 1-D Gel Automation of In-Gel.
LC-MS Lecture 7.
RESULTS Figure 1: Glycoprotein INTRODUCTION EXPERIMENTAL In this poster, we have studied the action of different enzymes on glycoproteins or their glycopeptides.
LC/MS WORKSHOP IOWA STATE UNIVERSITY Kamel Harrata  Instrument Description  Data Acquisition  Data Processing.
Proteomics Informatics – Protein identification II: search engines and protein sequence databases (Week 5)
Announcements: Proposal resubmissions are due 4/23. It is recommended that students set up a meeting to discuss modifications for the final step of the.
Previous Lecture: Regression and Correlation
De Novo Sequencing of MS Spectra
HOW MASS SPECTROMETRY CAN IMPROVE YOUR RESEARCH
Mass Spectrometry. What are mass spectrometers? They are analytical tools used to measure the molecular weight of a sample. Accuracy – 0.01 % of the total.
FIGURE 5. Plot of peptide charge state ratios. Quality Control Concept Figure 6 shows a concept for the implementation of quality control as system suitability.
My contact details and information about submitting samples for MS
Tryptic digestion Proteomics Workflow for Gel-based and LC-coupled Mass Spectrometry Protein or peptide pre-fractionation is a prerequisite for the reduction.
Comparison of chicken light and dark meat using LC MALDI-TOF mass spectrometry as a model system for biomarker discovery WP 651 Jie Du; Stephen J. Hattan.
Introduction : Standard methodologies for enzymatic digestions have changed little in the past 40 years. The same process for sample incubation with trypsin,
PSI Data Management and Reporting: Expectations, Standards and Utility J. Michael Sauder Director, Bioinformatics NYSGXRC Project Leader.
Production of polypeptides, Da, and middle-down analysis by LC-MSMS Catherine Fenselau 1, Joseph Cannon 1, Nathan Edwards 2, Karen Lohnes 1,
Chapter 9 Mass Spectrometry (MS) -Microbial Functional Genomics 조광평 CBBL.
PROTEIN STRUCTURE NAME: ANUSHA. INTRODUCTION Frederick Sanger was awarded his first Nobel Prize for determining the amino acid sequence of insulin, the.
Mass Spectrometry I Basic Data Processing. Mass spectrometry A mass spectrometer measures molecular masses. The mass unit is called dalton, which is 1/12.
Common parameters At the beginning one need to set up the parameters.
-Aluminum housing with o-ring seal allows for washing (salt removal), plate cleaning and regeneration enabling for multiple use -Sample elution for MS.
Additional file 1 1.1Workflow of large-scale proteomic analysis of normal human kidney glomerulus 1.2Detailed procedure of LC-MS/MS analysis Additional.
Protein-protein interactions “The Interactome” Yeast two-hybrid analysis Yeast two-hybrid analysis Protein chips Protein chips Biochemical purification/Mass.
A new "Molecular Scanner" design for interfacing gel electrophoresis with MALDI-TOF ThP Stephen J. Hattan; Kenneth C. Parker; Marvin L. Vestal SimulTof.
Lecture 9. Functional Genomics at the Protein Level: Proteomics.
In-Gel Digestion Why In-Gel Digest?
Genomics II: The Proteome Using high-throughput methods to identify proteins and to understand their function.
Software Project MassAnalyst Roeland Luitwieler Marnix Kammer April 24, 2006.
Proteomics What is it? How is it done? Are there different kinds? Why would you want to do it (what can it tell you)?
Pulsed Field Gel Electrophoresis In normal electrophoresis - electrophoretic mobility is independent of molecular weight for large DNA (> 50 kbp) elongate.
Improving the Detection of Hydrophilic Peptides for Increased Protein Sequence Coverage and Enhanced Proteomic Analyses Brian S. Hampton 1 and Amos H.
A New Strategy of Protein Identification in Proteomics Xinmin Yin CS Dept. Ball State Univ.
EBI is an Outstation of the European Molecular Biology Laboratory. In silico analysis of accurate proteomics, complemented by selective isolation of peptides.
Separates charged atoms or molecules according to their mass-to-charge ratio Mass Spectrometry Frequently.
Chem. 133 – 4/26 Lecture. Announcements Return graded quiz and additional problem Lab – Lab report deadlines (2:4 – Thursday) Today’s Lecture – Mass Spectrometry.
2014 생화학 실험 (1) 6주차 실험조교 : 류 지 연 Yonsei Proteome Research Center 산학협동관 421호
RANIA MOHAMED EL-SHARKAWY Lecturer of clinical chemistry Medical Research Institute, Alexandria University MEDICAL RESEARCH INSTITUTE– ALEXANDRIA UNIVERSITY.
Date of download: 6/24/2016 Copyright © The American College of Cardiology. All rights reserved. From: Proteomic Strategies in the Search of New Biomarkers.
Yonsei Proteome Research Center Peptide Mass Finger-Printing Part II. MALDI-TOF 2013 생화학 실험 (1) 6 주차 자료 임종선 조교 내선 6625.
An introduction to Mass spectrometry. What is mass spectrometry? Analytical tool measuring molecular weight (MW) of sample Only picomolar concentrations.
Peptide de novo sequencing Peptide de novo sequencing is the analytical process that derives a peptide’s amino acid sequence from its tandem mass spectrum.
Post translational modification n- acetylation Peptide Mass Fingerprinting (PMF) is an analytical technique for identifying unknown protein. Proteins to.
Peptide Mass Finger-Printing Part II. MALDI-TOF
Mass Spectrometry 101 (continued) Hackert - CH 370 / 387D
The Syllabus. The Syllabus Safety First !!! Students will not be allowed into the lab without proper attire. Proper attire is designed for your protection.
A sensitive and repeatable method for characterization of sulfonamides and trimethoprim in honey using QuEChERS extracts with Liquid-Chromatography-Tandem.
2 Dimensional Gel Electrophoresis
Figure SI-15. Detailed experimental procedures.
Bioinformatics Solutions Inc.
Interpretation of Mass Spectra I
Mass Spectrometry THE MAIN USE OF MS IN ORG CHEM IS:
Shotgun Proteomics in Neuroscience
Interpretation of Mass Spectra
Presentation transcript:

Integration of Mass Spectrometry with High- throughput Protein Crystallography Tarun Gheyi SGX Pharmaceuticals, Inc. / NYSGXRC April 14, 2008

Introduction In an effort to support various groups in the platform (protein production, crystallization and crystallography), we have made our goals simple: 1. Use as small amount of protein as we can 2. Provide as much information as possible on the “precious” protein sample 3. Reduce the analysis time to synchronize with the high-throughput activities in the platform

Introduction MS: established tool for analysis of bio-molecules in solution. MALDI-TOF MS (2000 ppm) + SDS-gel electrophoresis An accurate measure of sample purity ESI-MS (200 ppm) Mass accuracy At SGX, as part of the NYSGXRC activity, MALDI-MS and ESI-MS are routinely used to monitor the quality of proteins prior to initiation of crystallization trials

MS Lab-Instrumentation LC-ESI-MS (single quad; accurate mass) MALDI-TOF (Voyager DE-RP; Linear Mode) MALDI-TOF (Voyager DE-STR; Reflectron) capLC-ESI-Q-Iontrap (MS/MS analysis)

MS Criteria Intact MS analysis Mass accuracy: ± 260 Da (~ 2 mutations) If dM > 260 Da then protein characterization by tandem MS is performed to verify that it is intended protein. If yes, then DNA sequencing is done to identify all mutations Purity: ≥ 80 % If contaminants are present with the intended protein then an extra purification step (Mono Q- column) is included if there is sufficient protein

Intact protein MS analysis-Purity Representative sample that Pass MS status Clone 9252b1BCt9p1, PID 10963, Pool 1

Intact protein MS analysis-Purity Clone 8662a5KWg2h1, PID 10553, Pool 1 Status: Failed Mass Spec

Intact protein MS analysis-Purity Clone 10120g2BSt20p1, PID 11490, Pool 1 Status: “Passed” Mass Spec BUT

Intact protein MS analysis-Crystals Da MALDI-MS spectrum showing the MW of the protein crystallized Crystal was washed with the buffer of the crystallization condition to remove non-crystallized protein and PEG Thin layer technique is used to spot sample on the MALDI plate that usually is more sensitive than other techniques for large biomolecules.

Intact protein MS analysis-SeMet incorporation Standard M9 HY SeMet 60mg/L HY SeMet 90mg/L Mass Spectrum showing the affect of concentration of SeMet in M9 and HY media

Intact protein MS analysis- SeMet incorporation Mass Spectrum showing the affect of concentration of SeMet in HY media mg/L Conclusion Protein with a “High” solubility rating and less than 10 Methionines in its sequence can have nearly full incorporation at 90 mg/L SeMet. A protein with more than 10 Methionines in its sequence can have nearly full incorporation at a concentration of 120 mg/L of SeMet.

Conclusions-Intact protein MS analysis Sample purity and identity is routinely monitored on protein samples If crystals are obtained on a sample where contaminants were also observed with the intended protein, MS analysis of the crystals can be performed (on request basis) to confirm it’s identity. Percent SeMet incorporation is routinely monitored. Heterogeneity due to unknown PTMs are routinely monitored and pursued accordingly (Please check poster with title “Bottlenecks/Solutions for the Amidohydrolase Protein Superfamily” for more information)

MS/MS analysis is performed on all protein samples that have mass discrepancies, as observed by ESI-MS, to determine their true identity. A batch of 15 samples (15 ug each sample) is simultaneously subjected to trypsin digestion (30:1, protein:enzyme) for 14 hrs at 37°C. MS/MS analysis is performed on the batch of 15 digested samples using using ESI-quadrupole-ion trap mass spectrometer with online capillary-HPLC. An example as a representative of different sources of mass discrepancies will be discussed further in this presentation. Tandem Mass Spectrometry (MS/MS)

HPLC instrument conditions Instrument: (Agilent Technologies 1100 series). Flow Rate: 5 uL/min Column: Zorbax 300SB C-18; 3.5 uM particle size ; 150 X 0.3 mm Solvent A: 95% H 2 O, 5% Acetonitrile, 0.1% Formic Acid Solvent B: 5% H 2 O, 95% Acetonitrile, 0.1% Formic Acid Gradient: 0-10 min: 100% A min: 0-100% B min: 100% B min: 100% A High Performance Liquid Chromatography (HPLC)-Tandem Mass Spectrometry (MS/MS)

MS/MS instrument conditions Instrument: Finnigan LCQ DECA (Thermoquest, San Jose, CA, USA) ion-trap mass analyzer equipped with ESI source. The experimental conditions were as follows: Duration of experiment90 min Number of scan events6 MS mass range Da Default charge state2 Normalization collision energy35% (of the maximum) Activation Q25 eV Activation time30 msec Activating gasHelium ESI capillary tip voltage4.20 kV ESI source temperature180ºC Data dependent MS/MS mode is used. Sequence-specific ions of the amino acids are used to search a non-redundant protein database with the TurboSequest search engine (Bioworks 3.3) to identify proteins. The amino acid sequence of the identified protein is searched against in-house SGX_Gold database to identify the targets.

MS/MS analysis- Identification of “Mix-ups” dM of – 8026 Da was observed in Clone 10337p1BCt11p1 PID Pool 1. Trypsin digestion and MS/MS analysis followed by a database search identified this protein as clone 9436c1BCt12p1. Subsequently, the theoretical mass of clone 9436c1BCt12p1 also matched with the observed mass of this clone. In the similar fashion, a total of 56 protein samples were identified and were associated with the right clones. The MS/MS spectrum of a doubly charged ion at m/z The peptide was identified as AWTPAIAVEVLNSVR (MW Da) that belongs to clone 9436c.

A dM of Da was observed in Clone 9257a1BCt12p1 PID Pool 1 and MS/MS analysis identified the tryptic peptides as part of the intended protein. DNA and glycerol stock sequencing did not identify any mutations or errors and hence the dM observed was attributed to in-cell proteolysis. Another observation that supported this argument was none of the peptides that were identified covered the C-terminal sequence. Moreover, by accurate mass measurement the observed mass matched with A[24-426]H. Using this method a total of 25 protein samples were identified and were associated with right sequence. MS/MS analysis- Adventitious proteolysis The MS/MS spectrum of a doubly charged ion at m/z The peptide was identified as IWNGYSPLGLR (MW Da) that belongs to clone 9257a.

MS/MS analysis- Molecular Biology Errors dM of Da was observed in Clone 11012k2BCt2p1 PID Pool 1. MS/MS analysis identified it as an intended clone. Plasmid and glycerol stock sequencing identified that there was a DNA deletion in the vector His tag causing a frame shift that read through the stop until the next one was encountered ~2150 Da downstream. Using this method a total of 29 protein samples were identified that were followed by sequencing and rectified sequences were uploaded in LIMS. The MS/MS spectrum of a doubly charged ion at m/z The peptide was identified as AM#TLHLLDLSPER (MW Da) that belongs to clone 11012k.

Conclusions-Tandem Mass Spectrometry In 2007, ~1400 protein samples were analyzed by ESI and MALDI-MS. Out of these 120 samples of questionable identity were further analyzed by MS/MS. Using mass spectrometry we have identified the following common problems that can lead to mass discrepancies and number of structures thus benefited from it: Results indicated that these fit into the above groups as follows: 1. Clone “mix-up”: 56 of which 8 resulted in structures; 2. Cloning, etc. artifacts: 29 of which 3 resulted in structures; 3. Truncation by proteolysis: 25 of which 3 resulted in structures; 4. Unwanted E. coli contaminant: 10, none of which were pursued for structure determination. In conclusion, our standard MS quality control analysis procedures contributed substantially to the success of 14 of a total 158 NYSGXRC PDB depositions during calendar 2007 and allowed us to avoid wasting effort on 10 inadvertently purified protein samples.

In the end In an effort to support various groups in the platform (protein production, crystallization and crystallography), we have made our goals simple: Use as less sample as we can-Never used more than 100ug of sample under study Provide as much information possible on the “precious” protein sample-Protein purity, Identity, PTM study, crystal analysis, SeMet incorporation, In-gel trypsinization MS/MS analysis, Identification of mix-ups, In-cell proteolysis, Molecular Biology errors etc Reduce the analysis time to synchronize with the high- throughput activities in the platform-use of high throughput MS approach helps us to reduce analysis time

Acknowledgement This work was supported by SGX Pharmaceuticals, Inc. and NIH Grant U54 GM (Principal Investigator: Stephen K. Burley)