A New Strategy of Protein Identification in Proteomics Xinmin Yin CS Dept. Ball State Univ.

Slides:



Advertisements
Similar presentations
Genomes and Proteomes genome: complete set of genetic information in organism gene sequence contains recipe for making proteins (genotype) proteome: complete.
Advertisements

From Genome to Proteome Juang RH (2004) BCbasics Systems Biology, Integrated Biology.
In-depth Analysis of Protein Amino Acid Sequence and PTMs with High-resolution Mass Spectrometry Lian Yang 2 ; Baozhen Shan 1 ; Bin Ma 2 1 Bioinformatics.
Molecular Genetics DNA RNA Protein Phenotype Genome Gene
How to identify peptides October 2013 Gustavo de Souza IMM, OUS.
Peptide Mass Fingerprinting
Mass Spectrometry in a drug discovery setting Claus Andersen Senior Scientist Sienabiotech Spa.
De Novo Sequencing v.s. Database Search Bin Ma School of Computer Science University of Waterloo Ontario, Canada.
Proteomics The proteome is larger than the genome due to alternative splicing and protein modification. As we have said before we need to know All protein-protein.
阮雪芬 Department of Life Aug 6, 2004
PROTEIN IDENTIFICATION BY MASS SPECTROMETRY. OBJECTIVES To become familiar with matrix assisted laser desorption ionization-time of flight mass spectrometry.
Proteomics: A Challenge for Technology and Information Science CBCB Seminar, November 21, 2005 Tim Griffin Dept. Biochemistry, Molecular Biology and Biophysics.
Announcements: Proposal resubmissions are due 4/23. It is recommended that students set up a meeting to discuss modifications for the final step of the.
Previous Lecture: Regression and Correlation
Scaffold Download free viewer:
Proteomics Informatics (BMSC-GA 4437) Course Director David Fenyö Contact information
My contact details and information about submitting samples for MS
Proteomics Informatics (BMSC-GA 4437) Course Director David Fenyö Contact information
Evaluated Reference MS/MS Spectra Libraries Current and Future NIST Programs.
Proteome.
ESI and MALDI LC/MS-MS Approaches for Larger Scale Protein Identification and Quantification: Are They Equivalent? 1P. Juhasz, 1A. Falick,1A. Graber, 1S.
GSAT501 - proteomics Name, home-town Students – previous lab experience –Lab you hope to end up in? Teachers – what is your current project.
INF380 - Proteomics-91 INF380 – Proteomics Chapter 9 – Identification and characterization by MS/MS The MS/MS identification problem can be formulated.
Finish up array applications Move on to proteomics Protein microarrays.
Proteomics and annotation. Definition of proteomics Study of all the proteins in an organism Derived from genomics all the DNA in an organsim On some.
CS 461b/661b: Bioinformatics Tools and Applications Software Algorithm Mathematical Models Biology Experiments and Data.
High throughput Protein Measurement Techniques Harin Kanani.
Lecture 9. Functional Genomics at the Protein Level: Proteomics.
In-Gel Digestion Why In-Gel Digest?
Genomics II: The Proteome Using high-throughput methods to identify proteins and to understand their function.
Software Project MassAnalyst Roeland Luitwieler Marnix Kammer April 24, 2006.
Proteomics What is it? How is it done? Are there different kinds? Why would you want to do it (what can it tell you)?
Central dogma: the story of life RNA DNA Protein.
1 From Mendel to Genomics Historically –Identify or create mutations, follow inheritance –Determine linkage, create maps Now: Genomics –Not just a gene,
Overview of Mass Spectrometry
EBI is an Outstation of the European Molecular Biology Laboratory. In silico analysis of accurate proteomics, complemented by selective isolation of peptides.
Separates charged atoms or molecules according to their mass-to-charge ratio Mass Spectrometry Frequently.
Proteome and Gene Expression Analysis Chapter 15 & 16.
Proteomics Informatics (BMSC-GA 4437) Instructor David Fenyö Contact information
Novel Peptide Identification using ESTs and Genomic Sequence Nathan Edwards Center for Bioinformatics and Computational Biology University of Maryland,
Salamanca, March 16th 2010 Participants: Laboratori de Proteomica-HUVH Servicio de Proteómica-CNB-CSIC Participants: Laboratori de Proteomica-HUVH Servicio.
Deducing protein composition from complex protein preparations by MALDI without peptide separation.. TP #419 Kenneth C. Parker SimulTof Corporation, Sudbury,
Proteomics Informatics (BMSC-GA 4437) Course Directors David Fenyö Kelly Ruggles Beatrix Ueberheide Contact information
2014 생화학 실험 (1) 6주차 실험조교 : 류 지 연 Yonsei Proteome Research Center 산학협동관 421호
Ch. 10 Molecular Biology of the Gene 10.6 – 10.8 The flow of genetic information from DNA to RNA to Protein.
Using Scaffold OHRI Proteomics Core Facility. This presentation is intended for Core Facility internal training purposes only.
Proteomics: Technology and Cell Signaling Presenter: Ido Tal Advisor: Prof. Michal Linial י " ג סיון תשע " ה.
RANIA MOHAMED EL-SHARKAWY Lecturer of clinical chemistry Medical Research Institute, Alexandria University MEDICAL RESEARCH INSTITUTE– ALEXANDRIA UNIVERSITY.
Date of download: 6/24/2016 Copyright © The American College of Cardiology. All rights reserved. From: Proteomic Strategies in the Search of New Biomarkers.
Yonsei Proteome Research Center Peptide Mass Finger-Printing Part II. MALDI-TOF 2013 생화학 실험 (1) 6 주차 자료 임종선 조교 내선 6625.
Post translational modification n- acetylation Peptide Mass Fingerprinting (PMF) is an analytical technique for identifying unknown protein. Proteins to.
Mass Spectrometry makes it possible to measure protein/peptide masses (actually mass/charge ratio) with great accuracy Major uses Protein and peptide identification.
Ribosomes and Protein Synthesis
Bottom-Up Proteomics Data collection
The Covalent Structure of Proteins
The Syllabus. The Syllabus Safety First !!! Students will not be allowed into the lab without proper attire. Proper attire is designed for your protection.
Gene Expression.
“Proteomics is a science that focuses on the study of proteins: their roles, their structures, their localization, their interactions, and other factors.”
Predicting Active Site Residue Annotations in the Pfam Database
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Proteomics Informatics David Fenyő
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Protein Synthesis Lecture 5
Volume 20, Issue 12, Pages (December 2013)
Mass Spectrometry THE MAIN USE OF MS IN ORG CHEM IS:
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Proteomics Informatics David Fenyő
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Presentation transcript:

A New Strategy of Protein Identification in Proteomics Xinmin Yin CS Dept. Ball State Univ.

Contents 1.Background. 2.Problems. 3.Proposal. 4.Conclusion.

What’s Proteomics ? Use of quantitative protein-level measurement of gene expression to characterize biological process and to decipher the mechanisms of gene expression control.

What’s Proteomics ? DNA RNA PROTEIN transcription translation

Current Research in Proteomics Quantitative measurement of proteins expressed in a cell * protein identification * protein quantification

Experimental Flow Chart of Protein Identification cell proteins peptides mass & sequence data processing

Laser Detector1 E1 MS1 E2 Peptides MS2 Detector2 Schematic of MALDI MS/MS Matrix Assisted Laser-induced Desorption Ionization

An Example of MALDI MS/MS Spectrum

Problem of Scaling up MALDI MS/MS to Proteomics Number of peptides is huge Human genome (~60,000 proteins) 2.6 X 10 6 peptides It will take THREE MONTHS to identify all proteins in a human cell.

My Proposal Avoid using MS2 if a peptide can be identified in MS1. Reduce size of peptide database during data analysis To develop a new data analysis method

Facts of MALDI MS/MS Experiment 40% peptides are unique in mass when mass accuracy of MALDI is better than 10ppm. Analyzing MS1 spectrum is much easier than analyzing MS2 spectrum. It takes about 1 to 2 min for analysis of a single MS2 spectrum through database.

My Proposal Using local protein database for data analysis. Developing a peptide filter. Automating data transfer from MS1 to MS2

Local Protein Database Download protein database from public sites. Digest proteins in computer. Select peptides according experimental methods, and store them in memory. Calculate peptide chemical modification. PURPOSE: 1. Make peptide database small according to experimental methods 2. Do online data processing

Generating Peptide Database Protein Database Enzyme Digestion Peptide Database Each protein can produce 30 ~ 100 peptides

Peptide Filter Identify peptide by mass alone. Sort identified peptides according to proteins. Delete rest peptides of the identified protein from peptide database. Pass unidentified mass information to MS2. PURPOSE: 1. Do online data processing 2. Reduce peptides database size 2. Reduce number of peptides passed to MS2

Identifying Peptide by Mass Alone M th M exp MM  M = M th – M exp If (  M <  ) N pep ++ N pep + + N pep : number of mass matched peptides  : mass accuracy of MS1

Reducing Size of Peptide Database If (N pep == 1) Peptide is unique; Deleting peptides from the same protein Else Send peptide to MS2 for sequence information.

FLOWCHARTFLOWCHART

Program Design Computer Language: C++ Classes:Protein Linked list Peptide

Conclusion 1. My program will generate a list of peptides from the protein database. 2. Peptide identification is carried out while doing experiment. 3. Data analysis time will be significantly reduced. 4. This program will make identification of proteins from a cell possible and reliable.