Statistical Bioinformatics Genomics Transcriptomics Proteomics Systems Biology.

Slides:



Advertisements
Similar presentations
Martin John Bishop UK HGMP Resource Centre Hinxton Cambridge CB10 1 SB
Advertisements

Machine learning methods for the analysis of heterogeneous, multi- source data Ilkka Huopaniemi Statistical machine learning and.
Characterization of Transcriptional Regulatory Networks controlling plant cell adaptation to environmental stresses.
Gene Regulation in Eukaryotic Cells. Gene regulation is complex Regulation, and therefore, expression of a gene is complex. Regulation of these genes.
Transcriptomics Breakout. Topics Discussed Transcriptomics Applications and Challenges For Each Systems Biology Project –Host and Pathogen Bacteria Viruses.
1 MicroArray -- Data Analysis Cecilia Hansen & Dirk Repsilber Bioinformatics - 10p, October 2001.
Microarray technology and analysis of gene expression data Hillevi Lindroos.
Systems Biology Existing and future genome sequencing projects and the follow-on structural and functional analysis of complete genomes will produce an.
CISC667, F05, Lec26, Liao1 CISC 667 Intro to Bioinformatics (Fall 2005) Genetic networks and gene expression data.
27803::Systems Biology1CBS, Department of Systems Biology Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break.
Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break 14:45 – 15:15Regulatory pathways lecture 15:15 – 15:45Exercise.
Introduction to BioInformatics GCB/CIS535
GCB/CIS 535 Microarray Topics John Tobias November 8th, 2004.
27803::Systems Biology1CBS, Department of Systems Biology Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break.
CISC667, F05, Lec24, Liao1 CISC 667 Intro to Bioinformatics (Fall 2005) DNA Microarray, 2d gel, MSMS, yeast 2-hybrid.
Problems at the interface between statistics, biomathematics and bioinformatics Chris Glasbey Biomathematics & Statistics Scotland.
25 and 27 February, 2004 Chapter 6C Proteomics Structural and Functional Characterization in the Post- genomic era.
Applications of protomic Presented By: Muhammad Rizwan Roll no: Department of Bioinformatics.
Proteomics Understanding Proteins in the Postgenomic Era.
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
Paola CASTAGNOLI Maria FOTI Microarrays. Applicazioni nella genomica funzionale e nel genotyping DIPARTIMENTO DI BIOTECNOLOGIE E BIOSCIENZE.
Bayesian integration of biological prior knowledge into the reconstruction of gene regulatory networks Dirk Husmeier Adriano V. Werhli.
Statistical Bioinformatics QTL mapping Analysis of DNA sequence alignments Postgenomic data integration Systems biology.
A number of slides taken/modified from:
Biotechnology Overview
Paper in “Evolution and Development” The Evolution of Signaling Pathways In Animal Development Andr é Pires-daSilva & Ralf J. Sommer Nature Reviews Genetics,
Biotechnology SB2.f – Examine the use of DNA technology in forensics, medicine and agriculture.
Using Bayesian Networks to Analyze Expression Data N. Friedman, M. Linial, I. Nachman, D. Hebrew University.
CDNA Microarrays MB206.
Genomics in Drug Organon, Oss Tim Hulsen.
Gene expression and DNA microarrays Old methods. New methods based on genome sequence. –DNA Microarrays Reading assignment - handout –Chapter ,
Gene Regulatory Network Inference. Progress in Disease Treatment  Personalized medicine is becoming more prevalent for several kinds of cancer treatment.
Introduction to Bioinformatics Spring 2002 Adapted from Irit Orr Course at WIS.
Reverse engineering gene regulatory networks Dirk Husmeier Adriano Werhli Marco Grzegorczyk.
Learning regulatory networks from postgenomic data and prior knowledge Dirk Husmeier 1) Biomathematics & Statistics Scotland 2) Centre for Systems Biology.
Sequence analysis – an overview A.Krishnamachari
1 RNA Bioinformatics Genes and Secondary Structure Anne Haake Rhys Price Jones & Tex Thompson.
The challenge of bioinformatics Chris Glasbey Biomathematics & Statistics Scotland.
Discovering the Correlation Between Evolutionary Genomics and Protein-Protein Interaction Rezaul Kabir and Brett Thompson
Problem Limited number of experimental replications. Postgenomic data intrinsically noisy. Poor network reconstruction.
Extracting quantitative information from proteomic 2-D gels Lecture in the bioinformatics course ”Gene expression and cell models” April 20, 2005 John.
Epidemiology 217 Molecular and Genetic Epidemiology Bioinformatics & Proteomics John Witte.
Central dogma: the story of life RNA DNA Protein.
Learning Bayesian networks from postgenomic data with an improved structure MCMC sampling scheme Dirk Husmeier Marco Grzegorczyk 1) Biomathematics & Statistics.
KEY CONCEPT Biotechnology relies on cutting DNA at specific places.
IMPROVED RECONSTRUCTION OF IN SILICO GENE REGULATORY NETWORKS BY INTEGRATING KNOCKOUT AND PERTURBATION DATA Yip, K. Y., Alexander, R. P., Yan, K. K., &
While gene expression data is widely available describing mRNA levels in different cancer cells lines, the molecular regulatory mechanisms responsible.
Recombination breakpoints Family Inheritance Me vs. my brother My dad (my Y)Mom’s dad (uncle’s Y) Human ancestry Disease risk Genomics: Regions  mechanisms.
Bioinformatics and Computational Biology
ANALYSIS OF GENE EXPRESSION DATA. Gene expression data is a high-throughput data type (like DNA and protein sequences) that requires bioinformatic pattern.
Identification of co-expression networks by comparison of a multitude of different functional states of genome activity Marc Bonin 1, Stephan Flemming.
Reverse engineering of regulatory networks Dirk Husmeier & Adriano Werhli.
Shankar Subramaniam University of California at San Diego Data to Biology.
Biological Networks. Can a biologist fix a radio? Lazebnik, Cancer Cell, 2002.
Post-genomic Virology The impact of bioinformatics, microarrays and proteomics on investigating host and pathogen interactions Steven Masson.
1 Genomics Advances in 1990 ’ s Gene –Expressed sequence tag (EST) –Sequence database Information –Public accessible –Browser-based, user-friendly bioinformatics.
HISPIG – A Discriminative Model Refinement Approach with Iterations for Detecting Regulatory Regions Takuma Tsukahara
Effect of Alcohol on Brain Development NormalFetal Alcohol Syndrome.
9.1 Manipulating DNA KEY CONCEPT Biotechnology relies on cutting DNA at specific places.
Biotechnology Kline FHS. What can biotechnology do? Reunite families? Identify a criminal? Find your baby daddy? Clone your pet that died? Make new vaccines?
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Biotechnology.
New genes can be added to an organism’s DNA.
Scientists use several techniques to manipulate DNA.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Introduction to Bioinformatics
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Presentation transcript:

Statistical Bioinformatics Genomics Transcriptomics Proteomics Systems Biology

Statistical Bioinformatics Genomics Transcriptomics Proteomics Systems Biology

Multiple Sequence Alignment (MSA)

Multiple Sequence Alignments (MSA):

Some past forces shaping MSAs Divergence of sequences by speciation and nucleotide substitution (Phylogenetics). Horizontal gene transfer (recombination), especially in bacteria and viruses.

TOPALi v.1 Recombination detection Frank Wright,Iain Milne & Dirk Husmeier

TOPALi applied to Roseburia and Eubacterium sequences

Some past forces shaping MSAs Divergence of sequences by speciation and nucleotide substitution (Phylogenetics). Horizontal gene transfer (recombination), especially in bacteria and viruses. Selective pressure acting on functional domains.

TOPALi v2 Future plans Detect genomic regions under selective pressure  functional domains in proteins Methodology development: combined prediction of breakpoints due to recombination and evolutionary rate change. Improved phylogenetic analysis Investigate use of UK GRID computational resources for faster analyses

Statistical Bioinformatics Genomics Transcriptomics Proteomics Systems Biology

Genes differently expressed between two conditions –Affymetrix microarray Mouse liver experiment –Low fat diet vs high fat diet (6 per group) –Plot of log-fold change vs. average log intensity. –Points far away from the horizontal line seem “differentially expressed”. –Which are significant?

Statistical Methods (SAM, Limma,…) help to detect significant genes BUT: Many methods assume that the variances in both groups are the same If this is not the case: –Algorithms might give wrong answers –The definition of “differential expression” becomes more difficult

Claus Mayer (BioSS) More complex statistical tests for detecting differential gene expression. Situations where standard assumptions are violated. Allows for different variance-covariance structures in both populations.

Statistical Bioinformatics Genomics Transcriptomics Proteomics Systems Biology

Proteomics: 2-D Gels gel 1 gel 2 How to compare gels 1 and 2?

Chris Glasbey: Nonlinear Warping John Gustafsson, Chalmers University, Sweden WARP

2-D Gel Comparison Two gels superimposed (in different colours)

Proteomics: 2-D Gel Interpretation Graham Horgan Identify spots which differ between treatments using variance and covariance information from other spots  differently expressed proteins Assessment of associations between spot densities and physiological variables.

Statistical Bioinformatics Genomics Transcriptomics Proteomics Systems Biology

Detect active pathways in a “known” network Network of protein-protein and protein-DNA interactions “known” from the literature Gene expression profiling for different conditions –Bacterial strains: promoting - preventing inflammation –Mice on a low-fat vs. high-fat diet Can we identify different pathways associated with these conditions? We need a robust method –Expression data: noisy, missing values –Post-translational modifications

Cytokine Network Collaboration with SCGTI Interferon Pathway –Cytokines –Pivotal role in modulating the innate and adaptive mammalian immune system Network of protein-protein and protein-DNA interactions from the literature Two gene expression times series from bone marrow-derived macrophages in mice –Infected with cytomegalovirus –Infected and treated with IFN-gamma

Reverse Engineering of Regulatory Networks Can we learn the network structure from postgenomic data themselves? Statistical methods to distinguish between –Direct correlations –Indirect correlations Challenge: Distinguish between –Correlations –Causal interactions Breaking symmetries with active interventions: –Gene knockouts (VIGs, RNAi)

Evaluation: Raf signalling pathway Cellular signalling network of 11 phosphorylated proteins and phospholipids in human immune systems cell Laboratory data from cytometry experiments –Down-sampled to 100 measurements –Sample size indicative of microarray experiments Two types of experiments: –Passive observations –Active interventions (gene knockouts) Literature: “gold-standard” network