Top-down characterization of proteins in bacteria with unsequenced genomes Colin Wynne Catherine Fenselau University of Maryland, College Park Nathan Edwards.

Slides:



Advertisements
Similar presentations
Genomes and Proteomes genome: complete set of genetic information in organism gene sequence contains recipe for making proteins (genotype) proteome: complete.
Advertisements

Proteomics and Glycoproteomics (Bio-)Informatics of Protein Isoforms Nathan Edwards Department of Biochemistry and Molecular & Cellular Biology Georgetown.
UC Mass Spectrometry Facility & Protein Characterization for Proteomics Core Proteomics Capabilities: Examples of Protein ID and Analysis of Modified Proteins.
In-depth Analysis of Protein Amino Acid Sequence and PTMs with High-resolution Mass Spectrometry Lian Yang 2 ; Baozhen Shan 1 ; Bin Ma 2 1 Bioinformatics.
N-Glycopeptide Identification from CID Tandem Mass Spectra using Glycan Databases and False Discovery Rate Estimation Kevin B. Chandler, Petr Pompach,
Generalized Protein Parsimony and Spectral Counting for Functional Enrichment Analysis Nathan Edwards Department of Biochemistry and Molecular & Cellular.
Improving the Sensitivity of Peptide Identification for Genome Annotation Nathan Edwards Department of Biochemistry and Molecular & Cellular Biology Georgetown.
PepArML: A model-free, result-combining peptide identification arbiter via machine learning Xue Wu, Chau-Wen Tseng, Nathan Edwards University of Maryland,
Peptide Identification by Tandem Mass Spectrometry Behshad Behzadi April 2005.
Sangtae Kim Ph.D. candidate University of California, San Diego
Proteomics Informatics – Protein identification II: search engines and protein sequence databases (Week 5)
Proteomics Informatics Workshop Part I: Protein Identification
Previous Lecture: Regression and Correlation
Scaffold Download free viewer:
My contact details and information about submitting samples for MS
Proteomic Characterization of Alternative Splicing and Coding Polymorphism Nathan Edwards Center for Bioinformatics and Computational Biology University.
Proteomics Informatics (BMSC-GA 4437) Course Director David Fenyö Contact information
Evaluated Reference MS/MS Spectra Libraries Current and Future NIST Programs.
Karl Clauser Proteomics and Biomarker Discovery Taming Errors for Peptides with Post-Translational Modifications Bioinformatics for MS Interest Group ASMS.
Introduction Recent research has proposed rapid and robust identification of intact microorganisms using matrix assisted laser desorption/ ionization time-of-flight.
Production of polypeptides, Da, and middle-down analysis by LC-MSMS Catherine Fenselau 1, Joseph Cannon 1, Nathan Edwards 2, Karen Lohnes 1,
Improving Genome Annotation using Proteomics Nathan Edwards Center for Bioinformatics and Computational Biology University of Maryland, College Park.
Proteomic Characterization of Alternative Splicing and Coding Polymorphism Nathan Edwards Center for Bioinformatics and Computational Biology University.
Nathan Edwards Center for Bioinformatics and Computational Biology
Top-down characterization of proteins in bacteria with unsequenced genomes Nathan Edwards Georgetown University Medical Center.
Illinois Bio-Grid Grid Computing The Illinois Bio-Grid Alexander B. Schilling, Ph.D. University of Chicago Proteomics Core Lab
INF380 - Proteomics-91 INF380 – Proteomics Chapter 9 – Identification and characterization by MS/MS The MS/MS identification problem can be formulated.
Direct Experimental Observation of Functional Protein Isoforms by Tandem Mass Spectrometry Nathan Edwards Center for Bioinformatics and Computational Biology.
Generating Peptide Candidates from Protein Sequence Databases for Protein Identification via Mass Spectrometry Nathan Edwards Informatics Research.
Acknowledgements This work is supported by NSF award DBI , and National Center for Glycomics and Glycoproteomics, funded by NIH/NCRR grant 5P41RR
Common parameters At the beginning one need to set up the parameters.
Analysis of Complex Proteomic Datasets Using Scaffold Free Scaffold Viewer can be downloaded at:
Protein and Peptide Sequencing by FTMS Susan Martin.
Laxman Yetukuri T : Modeling of Proteomics Data
Search Engine Result Combining Nathan Edwards Department of Biochemistry and Molecular & Cellular Biology Georgetown University Medical Center.
Protein bioinformatics and systems biology Nathan Edwards Department of Biochemistry and Molecular & Cellular Biology Georgetown University Medical Center.
Panel Discussion: Reference Databases Nathan Edwards Georgetown University Medical Center.
Improving the Sensitivity of Peptide Identification for Genome Annotation Nathan Edwards Department of Biochemistry and Molecular & Cellular Biology Georgetown.
Precursor m/z double charge iTRAQ ratio 114: iTRAQ ratio 116: iTRAQ ratio 121: Precursor m/z triple charge iTRAQ.
Proteomic Characterization of Alternative Splicing and Coding Polymorphism Nathan Edwards Center for Bioinformatics and Computational Biology University.
Poster produced by Faculty & Curriculum Support (FACS), Georgetown University Medical Center Application of meta-search, grid-computing, and machine-learning.
Glycoprotein Microheterogeneity via N-Glycopeptide Identification Kevin Brown Chandler, Petr Pompach, Radoslav Goldman, Nathan Edwards Georgetown University.
CSE182 CSE182-L11 Protein sequencing and Mass Spectrometry.
Novel Peptide Identification using ESTs and Genomic Sequence Nathan Edwards Center for Bioinformatics and Computational Biology University of Maryland,
Improving the Sensitivity of Peptide Identification Nathan Edwards Department of Biochemistry and Molecular & Cellular Biology Georgetown University Medical.
Faster, more sensitive peptide identification from tandem mass spectra by sequence database compression Nathan J. Edwards Center for Bioinformatics & Computational.
Eat Raw & Fresh: Introducing isotopic Mass-to-charge Ratio and Envelope Fingerprinting (iMEF) and ProteinGoggle for Protein Database Search Zhixin(Michael)
Overview of Mass Spectrometry
EBI is an Outstation of the European Molecular Biology Laboratory. In silico analysis of accurate proteomics, complemented by selective isolation of peptides.
Aggressive Enumeration of Peptide Sequences for MS/MS Peptide Identification Nathan Edwards Center for Bioinformatics and Computational Biology.
Improving the Sensitivity of Peptide Identification by Meta-Search, Grid-Computing, and Machine-Learning Nathan Edwards Georgetown University Medical Center.
Improving the Sensitivity of Peptide Identification for Genome Annotation Nathan Edwards Department of Biochemistry and Molecular & Cellular Biology Georgetown.
Proteomics Informatics (BMSC-GA 4437) Instructor David Fenyö Contact information
Novel Peptide Identification using ESTs and Genomic Sequence Nathan Edwards Center for Bioinformatics and Computational Biology University of Maryland,
Deducing protein composition from complex protein preparations by MALDI without peptide separation.. TP #419 Kenneth C. Parker SimulTof Corporation, Sudbury,
Constructing high resolution consensus spectra for a peptide library
김지형. Introduction precursor peptides are dynamically selected for fragmentation with exclusion to prevent repetitive acquisition of MS/MS spectra.
a) b) c) d) e)
Post translational modification n- acetylation Peptide Mass Fingerprinting (PMF) is an analytical technique for identifying unknown protein. Proteins to.
Mass Spectrometry 101 (continued) Hackert - CH 370 / 387D
Protein/Peptide Quantification
Proteomics Informatics David Fenyő
Proteomics Informatics –
Top-down protein identification.
2D-LC-MS/MS analysis of tryptic digest of HEK293-SUMO3 cells (2 μg inj
Proteomics Informatics David Fenyő
Operation manual of AI SIDA
Presentation transcript:

Top-down characterization of proteins in bacteria with unsequenced genomes Colin Wynne Catherine Fenselau University of Maryland, College Park Nathan Edwards Georgetown University Medical Center

2 Microorganism Identification Important application of mass spectrometry Match spectra with sequence for identity Many bacteria will never be sequenced... Pathogen simulants, for example...but many have – about 1000 to date. Can we use the available sequence to identify proteins from unsequenced bacteria? Yes, for some proteins in some organisms! Yersinia rohdei, Erwinia herbicola, Enterobacter cloacae

3 Intact protein LC-MS/MS Crude cell lysate Capilary HPLC C8 column LTQ-Orbitrap XL Precursor scan: 400 m/z Data-dependent precursor selection: 5 most abundant ions 10 second dynamic exclusion Charge-state +3 or greater CID product ion scan 400 m/z

4 CID Protein Fragmentation Spectrum from Y. rohdei

5 Enterobacteriaceae Protein Sequences Exhaustive set of all Enterobacteriaceae protein sequences from Swiss-Prot, TrEMBL, RefSeq, Genbank, and CMR Plus, Glimmer3 predictions on Enterobacteriaceae genomes from RefSeq Primary and alternative translation start-sites Filter for intact mass in range 1 kDa – 20 kDa 253,626 distinct protein sequences, 256 species Derived from "Rapid Microorganism Identification Database" (RMIDb.org) infrastructure.

6 ProSightPC 2.0 Product ion scan decharging Enabled by high-resolution fragment ion measurements THRASH algorithm implementation Absolute mass search mode 15 ppm fragment ion match tolerance 250 Da precursor ion match tolerance "Single-click" analysis of entire LC-MS/MS datafile.

7 CID Protein Fragmentation Spectrum from Y. rohdei Match to Y. pestis 50S Ribosomal Protein L32

8 Identified E. herbicola proteins 30S Ribosomal Protein S19 m/z , z 15+, E-value 1.96e-16, Δ Six proteins identified with |Δ| < 0.02

9 DNA-binding protein HU-alpha m/z , z 13+, E-value 7.5e-26, Δ Eight proteins identified with "large" |Δ| Identified E. herbicola proteins

10 DNA-binding protein HU-alpha m/z , z 13+, E-value 1.91e-58, Δ 0.11 Use "Sequence Gazer" to find mass shift Identified E. herbicola proteins

11 DNA-binding protein HU-alpha m/z , z 13+, E-value 7.5e-26, Δ Extract N- and C-terminus sequence supported by at least 3 b- or y-ions Identified E. herbicola proteins

12 E. herbicola protein sequences

13 E. herbicola sequences found in other species

14 Phylogenetic placement of E. herbicola Phylogram Cladogram phylogeny.fr – "One-Click"

15 Genome Annotation Correction Serratia proteamaculans CSR, RPS19 Citrobacter koseri RPL32 Enterobacter sakazakii RPS21 RPL30 Enterobacter sakazakii Sodalis glossinidius Photorhabdus luminescens* Erwinia tasmaniensis Enterobacter sp. 638 Some spectra match Glimmer predictions only!

16 Conclusions Protein identification for unsequenced organisms. Identification and localization for sequence mutations and post-translational modifications. Extraction of confidently established sequence suitable for phylogenetic analysis. Genome annotation correction. New paradigm for phylogenetic analysis?

17 Acknowledgements Dr. Catherine Fenselau Colin Wynne, Joe Cannon University of Maryland Biochemistry Dr. Yan Wang University of Maryland Proteomics Core Dr. Art Delcher University of Maryland CBCB Funding: NIH/NCI

18 Shared "Biomarker" Proteins

19 Shared "Biomarker" Proteins