Presentation is loading. Please wait.

Presentation is loading. Please wait.

The European Nutrigenomics Organisation Understanding what you find in the context of what is already known Chris Evelo BiGCaT Bioinformatics Maastricht.

Similar presentations


Presentation on theme: "The European Nutrigenomics Organisation Understanding what you find in the context of what is already known Chris Evelo BiGCaT Bioinformatics Maastricht."— Presentation transcript:

1 the European Nutrigenomics Organisation Understanding what you find in the context of what is already known Chris Evelo BiGCaT Bioinformatics Maastricht The bioinformatics of proteomics

2 BiGCaT Bioinformatics To see the pattern might save you a lot of trouble

3 the European Nutrigenomics Organisation The transfer of information from DNA to protein. The transfer proceeds by means of an RNA intermediate called messenger RNA (mRNA). In procaryotic cells the process is simpler than in eucaryotic cells. In eucaryotes the coding regions of the DNA (in the exons,shown in color) are separated by noncoding regions (the introns). As indicated, these introns must be removed by an enzymatically catalyzed RNA-splicing reaction to form the mRNA. From: Alberts et al. Molecular Biology of the Cell, 3rd edn. Gene Expression

4 the European Nutrigenomics Organisation Transcriptomics Study of genome wide gene expression on the transcriptional level: >20k mRNA sequences must be annotated >20k expression values must be filtered, normalized, replicate treated, clustered and understood Therefore: No transcriptomics without bioinformatics

5 the European Nutrigenomics Organisation Proteomics would be Study of genome wide gene expression on the translational level Where genome wide would mean: >20K proteins. Then proteomics does not exist yet! Does it already need bioinformatics?

6 The genomics workflow

7 the European Nutrigenomics Organisation Identification Antibody techniques: build in. You know what the antigen is or you wouldn’t use it. Mass identification: Fragment libraries derived from UniProt Not normally a user (scientist) problem. Or practically: build in as well. No current need for bioinformatics But please use UniProt ID’s!!

8 the European Nutrigenomics Organisation Upcoming challenge Tiling arrays

9 the European Nutrigenomics Organisation Upcoming challenge Phosphorylation? Modification? Alternative splicing?Phosphorylation? Alternative splicing? Modification?

10 the European Nutrigenomics Organisation Do this exon wise?

11 the European Nutrigenomics Organisation Understanding Modifications Look up the protein in UniProt/ENSEMBL For instance: –Glyceraldehyde 3-phosphate dehydrogenaseGlyceraldehyde 3-phosphate dehydrogenase –Pyruvate kinase (note splice variants)Pyruvate kinase Or use Prosite Search For instance:Prosite Search –Glyceraldehyde 3-phosphate dehydrogenase with: PKC phosphorylation site and: its own GAPDH patternGlyceraldehyde 3-phosphate dehydrogenasePKC phosphorylation siteits own GAPDH pattern Bioinformatics helps to see the possibilities

12 the European Nutrigenomics Organisation Data filtering and normalization Use expertise from microarrays Use to find problems not to cover up From bad to acceptable: a bad move! Antibody example

13 Example of QC: Antibody Microarray BD Biosciences (Clontech) Chip-based technology Monoclonal antibodies printed at high density on a glass slide Profiling hundreds of proteins Analyses virtually any biological sample (cells, whole tissue and body fluids)

14 Content of antibody array

15

16 Two slides with flipped samples

17 Internally normalized results Sampling method controls for differences in labeling efficiency Internally Normalized Ratio can be calculated (represents the relative abundance of an antigen in sample A relative to that of sample B)

18 First arrays did not look good...

19 Array 2

20 Array 3

21 Technique improvement...

22

23 Less background problems but also less signal…

24 Spotfire analysis showed: Technique needs improvements! Location of the antibodies on the Microarray Some high background antibodies Procedure Normalization method

25 The genomics workflow

26 the European Nutrigenomics Organisation T1 signal T2 signal Clustering 2 time Expr. level Patterns for 2 proteins (these should probably end up in the same cluster). Expression vector for one protein for the first 2 dimensions. Normalized by amplitude (circle) or relatively (square).

27 the European Nutrigenomics Organisation Hierarchical Clustering

28 the European Nutrigenomics Organisation Fancy techniques clustering, principal component analysis, self organizing maps, etc. etc. But… Only useful for high numbers (and maybe not even then) Limited use for proteomics (low numbers) Might be useful in combined mRNA/protein studies

29 The genomics workflow

30 the European Nutrigenomics Organisation  Array - B a c k g r o u n d -

31 the European Nutrigenomics Organisation Understanding  Array data Typical procedure 1.Annotate the reporters with something useful (UniProt!) 2.Sort based on fold change 3.Search for your favorite genes/proteins 4.Throw away 95% of the array

32 the European Nutrigenomics Organisation

33 Understanding  Array data Typical procedure 1.Annotate the reporters with something useful (UniProt!) 2.Sort based on fold change 3.Search for your favorite genes/proteins 4.Throw away 95% of the array

34 the European Nutrigenomics Organisation Understanding  Array data “Advanced” procedures oGene clustering or principal component analysis oGet groups of genes with parallel expression patterns oUseful for diagnosis oNot adding much to understanding (unless combined)

35 the European Nutrigenomics Organisation Mapping Annotation/ coupling

36 the European Nutrigenomics Organisation Best known: GenMAPP Free, academic initiative with editable mapps, collaborates with NuGO

37 the European Nutrigenomics Organisation Best known: GenMAPP Full content of GO database Textbook like local mapps Geneboxes with active backpages, coupled to online databases Visualize anything numerical (fold changes on arrays, p-values, present calls, proteomics results) Update mapps yourself

38 the European Nutrigenomics Organisation GenMAPP: Full GO content

39 the European Nutrigenomics Organisation GenMAPP: Textbook like maps Extensive backpages present with links to online databases

40 the European Nutrigenomics Organisation 2D gels of 3T3-L1 (pre)-adipocytes Enlarged sections gels derived from: A: 3T3-L1 pre- adipocytes, B: 3T3-L1 adipocytes, C: 3T3-L1 adipocytes with caloric restriction D: 3T3-L1 adipocytes with caloric restriction and TNF-a.

41 the European Nutrigenomics Organisation GenMAPP: visualize anything numerical Example Proteomics results (2D gels with GC-MS identification). Fasting/feeding study shows regulation of glycolysis (data from Johan Renes, UM). Other useful things: - p-values, present calls - presence in clusters - presence in QTLs

42 the European Nutrigenomics Organisation MAPPfinder Ranks mapps where relatively many changes occur Useful to find unexpected pathways Statistics hardly developed

43 the European Nutrigenomics Organisation MAPPfinder z-score Number of genes/proteins changed on this mapp Expected number of changes Standard deviation of observed number many dependencies to overcome

44 the European Nutrigenomics Organisation MAPPfinder Next example from heart failure study (Schroen et al. Circ Res; 2004 95: 506-514)

45 the European Nutrigenomics Organisation GenMAPP: Full GO content

46 the European Nutrigenomics Organisation Update mapps yourself You can do anything. E.g. add genes, annotation, backpage information, graphics Next page shows a combination of metabolic mapps. “The Nutrigenomics Masterpiece” created by Milka Sokolović (AMC Amsterdam)

47

48 the European Nutrigenomics Organisation Scientist know GenMapp Advantages: Free, Runs on (high end) MS Windows, Relatively easy to use, Reasonable visualization, Some pathway statistics, Interesting content (Including GO, KEGG), Content editable, Adopting standards (e.g. BioPax), Open source.

49 the European Nutrigenomics Organisation Scientist know GenMapp Disadvantages: Small academic initiative, uncertain lifespan No info on reactions, metabolites, location No change (e.g. time course) visualization Hard to cope with ambiguous reporters (we are working on that) Content could be better!

50 the European Nutrigenomics Organisation Datasources GenMAPP local MAPPs: Largely created by a single postdoc (Dr.Kam Dahlquist).

51 the European Nutrigenomics Organisation Metacore example GeneGo, Inc Systems Reconstruction TM Technology www.genego.com

52 the European Nutrigenomics Organisation AgilentAffymetrixProteomicSAGE Concurrent visualization of different data types

53 the European Nutrigenomics Organisation GeneGo: primitive view of multiple conditions Can you really see what happens?

54 the European Nutrigenomics Organisation Build new network using Metacore TM from GeneGO Around p53 protein Making use of biological DB Filtered to reduce complexity: –for ‘rat ortholog’ –for ‘transcriptional regulation’ –for ‘liver’

55 the European Nutrigenomics Organisation

56 Filtering needed to reduce complexity

57 the European Nutrigenomics Organisation

58 The future We should develop Bioinformatics for Proteomics Now To help improve the techniques To make the most of the data To prevent drowning in data in the future And to really understand all that transcriptomics stuff


Download ppt "The European Nutrigenomics Organisation Understanding what you find in the context of what is already known Chris Evelo BiGCaT Bioinformatics Maastricht."

Similar presentations


Ads by Google