Understanding Proteomics through Bioinformatics Chris Evelo BiGCaT Bioinformatics Group – BMT-TU/e & UM Masterclass Nutrigenomics; May 11 2004.

Slides:



Advertisements
Similar presentations
Genomes and Proteomes genome: complete set of genetic information in organism gene sequence contains recipe for making proteins (genotype) proteome: complete.
Advertisements

Recombinant DNA Technology
BiGCaT Bioinformatics Hunting strategy of the bigcat.
BioASP Roadshow Maastricht; May Brought to you by: BioASP and BiGCaT Bioinformatics.
Characterization of Transcriptional Regulatory Networks controlling plant cell adaptation to environmental stresses.
Oncomine Database Lauren Smalls-Mantey Georgia Institute of Technology June 19, 2006 Note: This presentation contains animation.
Prof. Carolina Ruiz Computer Science Department Bioinformatics and Computational Biology Program WPI WELCOME TO BCB4003/CS4803 BCB503/CS583 BIOLOGICAL.
Peter Tsai Bioinformatics Institute, University of Auckland
WAGENINGE N CENTRE FOR FOOD SCIENCES How You Can Contribute Wageningen Centre for Food Sciences is an alliance of research and food industry partners to.
The Sense of Sequense The Sense of Sequense Chris Evelo BiGCaT Bioinformatics Universiteit Maastricht.
The European Nutrigenomics Organisation Understanding what you find in the context of what is already known Chris Evelo BiGCaT Bioinformatics Maastricht.
Toxicology in the omics era. Chris Evelo BiGCaT Bioinformatics Group – BMT-TU/e & UM.
Bioinformatics Needs for the post-genomic era Dr. Erik Bongcam-Rudloff The Linnaeus Centre for Bioinformatics.
Tutorial 7 Genome browser. Free, open source, on-line broswer for genomes Contains ~100 genomes, from nematodes to human. Many tools that can be used.
Gene Expression And Regulation Bioinformatics January 11, 2006 D. A. McClellan
Introduction to Genomics, Bioinformatics & Proteomics Brian Rybarczyk, PhD PMABS Department of Biology University of North Carolina Chapel Hill.
Introduction to BioInformatics GCB/CIS535
Kate Milova MolGen retreat March 24, Microarray experiments. Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
CSE182-L12 Gene Finding.
CHAPTER 15 Microbial Genomics Genomic Cloning Techniques Vectors for Genomic Cloning and Sequencing MS2, RNA virus nt sequenced in 1976 X17, ssDNA.
Kate Milova MolGen retreat March 24, Microarray experiments. Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
Proteomics: A Challenge for Technology and Information Science CBCB Seminar, November 21, 2005 Tim Griffin Dept. Biochemistry, Molecular Biology and Biophysics.
Transcriptional profiling I – microarrays and proteomics
1 Characterization, Amplification, Expression Screening of libraries Amplification of DNA (PCR) Analysis of DNA (Sequencing) Chemical Synthesis of DNA.
Modeling Functional Genomics Datasets CVM Lesson 1 13 June 2007Bindu Nanduri.
Microarrays: Theory and Application By Rich Jenkins MS Student of Zoo4670/5670 Year 2004.
Protein and Function Databases
Bioinformatics page 12, part of ch. 21 Cell and Mol Biol Lab.
Why microarrays in a bioinformatics class? Design of chips Quantitation of signals Integration of the data Extraction of groups of genes with linked expression.
Kate Milova MolGen retreat March 24, Microarray experiments. Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
High Throughput Sequencing
A combination of the words Proteomics and Genomics. Proteogenomics commonly refer to studies that use proteomic information, often derived from mass spectrometry,
Microarray Gene Expression Data Analysis A.Venkatesh CBBL Functional Genomics Chapter: 07.
Biotechnology SB2.f – Examine the use of DNA technology in forensics, medicine and agriculture.
The European Nutrigenomics Organisation Deciding and acting on quality of microarray experiments in genomics Chris Evelo BiGCaT Bioinformatics Maastricht.
Detecting enriched regions (Chip- seq, RIP-seq) Statistical evaluation of enriched regions Data displayed in Genome Browser Detection of enriched motifs.
Finish up array applications Move on to proteomics Protein microarrays.
Remember the limitations? –You must know the sequence of the primer sites to use PCR –How do you go about sequencing regions of a genome about which you.
MPL Identification of alternative spliced mRNA variants related to cancers by genome-wide ESTs alignment KIM DAE SOO Oncogene Apr.
Biological Databases Biology outside the lab. Why do we need Bioinfomatics? Over the past few decades, major advances in the field of molecular biology,
The European Nutrigenomics Organisation Using pathway information to understand genomics results Chris Evelo BiGCaT Bioinformatics.
Systems Biology through Pathway Statistics Chris Evelo BiGCaT Bioinformatics Group – BMT-TU/e & UM Diepenbeek; May
Biological databases Exercises. Discovery of distinct sequence databases using ensembl.
RNA-Seq Primer Understanding the RNA-Seq evidence tracks on the GEP UCSC Genome Browser Wilson Leung08/2014.
BIOLOGICAL DATABASES. BIOLOGICAL DATA Bioinformatics is the science of Storing, Extracting, Organizing, Analyzing, and Interpreting information in biological.
Central dogma: the story of life RNA DNA Protein.
The European Nutrigenomics Organisation Pathway content improvement. How to store an expert’s brain and use it to understand omics. Chris Evelo NuGO WP7.
Microarray analysis Quantitation of Gene Expression Expression Data to Networks BIO520 BioinformaticsJim Lund Reading: Ch 16.
Bioinformatics support at School of Biological Sciences
GeWorkbench Overview Support Team Molecular Analysis Tools Knowledge Center Columbia University and The Broad Institute of MIT and Harvard.
Gene Expression Analysis Gabor T. Marth Department of Biology, Boston College BI420 – Introduction to Bioinformatics.
Around the triangle Chris Evelo BiGCaT Bioinformatics Maastricht May arrays QTLs paths.
Genomic Signal Processing Dr. C.Q. Chang Dept. of EEE.
1 Genomics Advances in 1990 ’ s Gene –Expressed sequence tag (EST) –Sequence database Information –Public accessible –Browser-based, user-friendly bioinformatics.
Transcriptome What is it - genome wide transcript abundance How do you obtain it - Arrays + MPSS What do you do with it when you have it - ?
Biotechnology and Bioinformatics: Bioinformatics Essential Idea: Bioinformatics is the use of computers to analyze sequence data in biological research.
Notes: Human Genome (Right side page)
Post translational modification n- acetylation Peptide Mass Fingerprinting (PMF) is an analytical technique for identifying unknown protein. Proteins to.
The Transcriptional Landscape of the Mammalian Genome
Post-GWAS and Mechanistic Analyses
Advanced PGDB Editing: Regulation GO Terms
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Proteomics Informatics David Fenyő
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Integrative omic approaches for the study of host–pathogen interactions Integrative omic approaches for the study of host–pathogen interactions (A) Proteomic.
Proteomics Informatics David Fenyő
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Changes of gene expression induced by 1 mmol/L valproic acid in vitro.
Presentation transcript:

Understanding Proteomics through Bioinformatics Chris Evelo BiGCaT Bioinformatics Group – BMT-TU/e & UM Masterclass Nutrigenomics; May

BiGCaT Bioinformatics Where the cat hunts

BiGCaT Bioinformatics, bridge between two universities Universiteit Maastricht Patients, Experiments, Arrays and Loads of Data TU/e Ideas & Experience in Data Handling BiGCaT

BiGCaT Bioinformatics, between two research fields Cardiovascular Research Nutritional & Environmental Research BiGCaT

If transcriptomics is: The study of genome wide gene expression on the transcriptional level Where genome wide means: >20K genes. And transcriptional level means that somehow >20K mRNA sequences have to be analyzed And >20K expression values have to be filtered, normalized, replicate treated, clustered and understood Thus no transcriptomics without bioinformatics

Gene expression arrays Microarrays: relative fluorescense signals. Identification. Macroarrays: absolute radioactive signal. Validation.

Then proteomics would be: The study of genome wide gene expression on the translational level Where genome wide would mean: >20K proteins. Then proteomics does not yet exist! Does it already need bioinformatics?

Identification of proteins found (method annotation) Antibody techniques: build in. You know what the antigen is or you wouldn’t use it. Mass identification: Fragment libraries derived from Swissprot Not normally a user (scientist) problem. Or practically build in as well. No current need for bioinformatics But please use Swissprot ID’s!!

Data filtering and normalization Appears to become a problem on antibody arrays (see yesterdays presentation by Rachelle van Haaften). Start with expertise from mRNA microarrays. Use bioinformatics to improve techniques Not to cover up problems

2 time Expr. level Clustering: find proteins with same expression patterns T1 signal T2 signal Left hand picture shows expression patterns for 2 proteins (these should probably end up in the same cluster). Right hand picture shows the expression vector for one protein for the first 2 dimensions. Can be normalized by amplitude (circle) or relatively (square).

Clustering and grouping of proteins with parallel expressions Fancy techniques clustering, principal component analysis, self organizing maps, etc. etc. But… Only useful for high numbers (and maybe not even then) Currently not important for proteomics But might be useful in combined mRNA/protein studies

Two things left Functional understanding of proteomics results Understanding protein modifications

Functional understanding Map changed proteins (quantitatively or qualitatively) to known pathways. Or use information from the Gene Ontology (GO) database Steal and smartly adapt a transcriptomics tool: GenMapp/MappfinderMappfinder Let me show you an example from a simple nutrigenomics (starvation) study. Data from Johan Renes.

Understanding protein modifications Map changed proteins (quantitatively or qualitatively) to known pathways. Or use information from the Gene Ontology (GO) database Steal and smartly adapt a transcriptomics tool: GenMapp/MappfinderMappfinder Let me show you an example from a simple nutrigenomics (starvation) study. Data from Johan Renes.

Protein variants derived from single genes Phosphorylation? Modification? Alternative splicing?Phosphorylation? Alternative splicing? Modification?

Understanding modifications Look up the protein in SwissProt For instance: – Glyceraldehyde 3-phosphate dehydrogenase Glyceraldehyde 3-phosphate dehydrogenase – Pyruvate kinase (note splice variants) Pyruvate kinase Or use Prosite Search For instance:Prosite Search – Glyceraldehyde 3-phosphate dehydrogenase with: PKC phosphorylation site and: its own GAPDH pattern Glyceraldehyde 3-phosphate dehydrogenasePKC phosphorylation siteits own GAPDH pattern Bioinformatics helps to see the possibilities

We should start developing Bioinformatics for Proteomics Now - To help improve the techniques - To make the most of the data - To prevent drowning in data in the future - And to really understand all that transcriptomics stuff