The challenge of bioinformatics Chris Glasbey Biomathematics & Statistics Scotland
Talk plan 1. DNA 2. mRNA 3. Protein 4. Genetic networks
1. DNA
Frank Wright et al BioSS
1.DNA
TOPALi
2. mRNA Prepare cDNA targets Label with fluorescent dyes Combine Equal Amounts Hybridise for hours Scanning
2. mRNA Scanner’s PMT setting is one of the sources of contamination. Scanner’s setting is to be raised to a certain level to make the weakly expressed genes visible. This may cause highly expressed genes to get censored (at 2 16 –1= 65535) expression values.
2. mRNA Censored spot Imputed values With GTI (Edinburgh)
2. mRNA Multiple scans
Mizan Khondoker
2. mRNA Jim McNicol
3. Proteins Electrophoresis gel Lars Pedersen DTU, Denmark
3. Proteins Protein separation by 1.pH 2.Mol. Wt.
3. Proteins gel 1 gel 2 How to compare gels 1 and 2?
3. Proteins John Gustafsson, Chalmers University, Sweden WARP
3. Proteins Two gels superimposed (in different colours)
3. Proteins Statistical Design 3 complete reps of 15 treatment combinations. (3 ecotypes by 5 heavy metals) Maximum of 1400 protein spots per gel Statistical Analyses Filter data – remove spots with low intensity values and low quality scores (leaving ~290 spots) Individual proteins – ANOVA, main effects and interactions
3. Proteins Principal Components Analysis Identify groups of proteins that are affected in a consistent manner by treatments Protein identity Loadings Jim McNicol
4. Genetic networks
Is it possible to infer the network from gene expression data such as these? Dirk Husmeier
4. Genetic networks Bayesian network
4. Genetic networks truth inferred
“I genuinely believe that we are living through the greatest intellectual moment in human history.” (Matt Ridley, Genome, 1999) “Grand Unified Systems Biology”