CBS Ph.D. course, 14. april 2005 Using Microarrays to Study Cell Cycle Regulation Ulrik de Lichtenberg Center for Biological Sequence Analysis The Technical.

Slides:



Advertisements
Similar presentations
Microarray statistical validation and functional annotation
Advertisements

STRING Prediction of protein networks through integration of diverse large-scale data sets Lars Juhl Jensen EMBL Heidelberg.
Announcements Monday, April 16: the cell cycle, pp Wednesday, April 18: protein synthesis, pp Friday, April 20: protein targeting,
Cell and Molecular Biology Behrouz Mahmoudi Cell cycle 1.
D ISCOVERING REGULATORY AND SIGNALLING CIRCUITS IN MOLECULAR INTERACTION NETWORK Ideker Bioinformatics 2002 Presented by: Omrit Zemach April Seminar.
Global Mapping of the Yeast Genetic Interaction Network Tong et. al, Science, Feb 2004 Presented by Bowen Cui.
The STRING database Michael Kuhn EMBL Heidelberg.
Timothy H. W. Chan, Calum MacAulay, Wan Lam, Stephen Lam, Kim Lonergan, Steven Jones, Marco Marra, Raymond T. Ng Department of Computer Science, University.
Work Process Using Enrich Load biological data Check enrichment of crossed data sets Extract statistically significant results Multiple hypothesis correction.
STRING Modeling of biological systems through cross-species data integration.
Genome-wide prediction and characterization of interactions between transcription factors in S. cerevisiae Speaker: Chunhui Cai.
Detecting Differentially Expressed Genes Pengyu Hong 09/13/2005.
Gene expression analysis summary Where are we now?
Gene Regulation in Eukaryotes Same basic idea, but more intricate than in prokaryotes Why? 1.Genes have to respond to both environmental and physiological.
. Class 1: Introduction. The Tree of Life Source: Alberts et al.
DNA Microarray Bioinformatics - #27612 Normalization and Statistical Analysis.
Public data - available for projects 6 data sets: –Human Tissues –Leukemia –Spike-in –FARO compendium – Yeast Cell Cycle –Yeast Rosetta Find one yourself.
Differentially expressed genes
Yeast Dataset Analysis Hongli Li Final Project Computer Science Department UMASS Lowell.
Microarrays and Cancer Segal et al. CS 466 Saurabh Sinha.
Introduction to BioInformatics GCB/CIS535
Bio277 Lab 3: Finding Transcription Factor Binding Motifs Adapted from a Lab Written by Prof Terry Speed Jess Mar Department of Biostatistics Quackenbush.
10. Lecture SS 20005Cell Simulations1 V10: Proteome during the Yeast Cell Cycle Bioinformatics, 21, 1164 (2005) Science 307, 724 (2005)
Evaluation of Signaling Cascades Based on the Weights from Microarray and ChIP-seq Data by Zerrin Işık Volkan Atalay Rengül Çetin-Atalay Middle East Technical.
Microarray analysis 2 Golan Yona. 2) Analysis of co-expression Search for similarly expressed genes experiment1 experiment2 experiment3 ……….. Gene i:
Inferring the nature of the gene network connectivity Dynamic modeling of gene expression data Neal S. Holter, Amos Maritan, Marek Cieplak, Nina V. Fedoroff,
Review of important points from the NCBI lectures. –Example slides Review the two types of microarray platforms. –Spotted arrays –Affymetrix Specific examples.
Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Networks and Network Topology Thomas Skøt Jensen Center for Biological.
Comparative Expression Moran Yassour +=. Goal Build a multi-species gene-coexpression network Find functions of unknown genes Discover how the genes.
Why microarrays in a bioinformatics class? Design of chips Quantitation of signals Integration of the data Extraction of groups of genes with linked expression.
Genomics I: The Transcriptome RNA Expression Analysis Determining genomewide RNA expression levels.
Analysis of microarray data
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
Computational Molecular Biology Biochem 218 – BioMedical Informatics Gene Regulatory.
Genome of the week - Deinococcus radiodurans Highly resistant to DNA damage –Most radiation resistant organism known Multiple genetic elements –2 chromosomes,
What is bioinformatics?. What are bioinformaticians up to, actually? Manage molecular biological data –Store in databases, organise, formalise, describe...
Using Bayesian Networks to Analyze Expression Data N. Friedman, M. Linial, I. Nachman, D. Hebrew University.
CSCE555 Bioinformatics Lecture 16 Identifying Differentially Expressed Genes from microarray data Meeting: MW 4:00PM-5:15PM SWGN2A21 Instructor: Dr. Jianjun.
A New Oklahoma Bioinformatics Company. Microarray and Bioinformatics.
Finish up array applications Move on to proteomics Protein microarrays.
Intelligent systems in bioinformatics Introduction to the course.
Using Bayesian Networks to Analyze Whole-Genome Expression Data Nir Friedman Iftach Nachman Dana Pe’er Institute of Computer Science, The Hebrew University.
Unraveling condition specific gene transcriptional regulatory networks in Saccharomyces cerevisiae Speaker: Chunhui Cai.
Bioinformatics Expression profiling and functional genomics Part II: Differential expression Ad 27/11/2006.
Analysis of the yeast transcriptional regulatory network.
CS5263 Bioinformatics Lecture 20 Practical issues in motif finding Final project.
Identification of cell cycle-related regulatory motifs using a kernel canonical correlation analysis Presented by Rhee, Je-Keun Graduate Program in Bioinformatics.
Intel Confidential – Internal Only Co-clustering of biological networks and gene expression data Hanisch et al. This paper appears in: bioinformatics 2002.
Biological Signal Detection for Protein Function Prediction Investigators: Yang Dai Prime Grant Support: NSF Problem Statement and Motivation Technical.
Introduction to Statistical Analysis of Gene Expression Data Feng Hong Beespace meeting April 20, 2005.
Data Mining the Yeast Genome Expression and Sequence Data Alvis Brazma European Bioinformatics Institute.
Central dogma: the story of life RNA DNA Protein.
While gene expression data is widely available describing mRNA levels in different cancer cells lines, the molecular regulatory mechanisms responsible.
Extracting binary signals from microarray time-course data Debashis Sahoo 1, David L. Dill 2, Rob Tibshirani 3 and Sylvia K. Plevritis 4 1 Department of.
Overview of Microarray. 2/71 Gene Expression Gene expression Production of mRNA is very much a reflection of the activity level of gene In the past, looking.
Cluster validation Integration ICES Bioinformatics.
ANALYSIS OF GENE EXPRESSION DATA. Gene expression data is a high-throughput data type (like DNA and protein sequences) that requires bioinformatic pattern.
GO enrichment and GOrilla
Lecture 10: Cell cycle Dr. Mamoun Ahram Faculty of Medicine
Discovering functional interaction patterns in Protein-Protein Interactions Networks   Authors: Mehmet E Turnalp Tolga Can Presented By: Sandeep Kumar.
Motif Search and RNA Structure Prediction Lesson 9.
Nonlinear differential equation model for quantification of transcriptional regulation applied to microarray data of Saccharomyces cerevisiae Vu, T. T.,
1 Genomics Advances in 1990 ’ s Gene –Expressed sequence tag (EST) –Sequence database Information –Public accessible –Browser-based, user-friendly bioinformatics.
Statistical Analysis for Expression Experiments Heather Adams BeeSpace Doctoral Forum Thursday May 21, 2009.
Microarray: An Introduction
Analyzing circadian expression data by harmonic regression based on autoregressive spectral estimation Rendong Yang and Zhen Su Division of Bioinformatics,
Inferring Regulatory Networks from Gene Expression Data BMI/CS 776 Mark Craven April 2002.
1 Department of Engineering, 2 Department of Mathematics,
1 Department of Engineering, 2 Department of Mathematics,
1 Department of Engineering, 2 Department of Mathematics,
Presentation transcript:

CBS Ph.D. course, 14. april 2005 Using Microarrays to Study Cell Cycle Regulation Ulrik de Lichtenberg Center for Biological Sequence Analysis The Technical University of Denmark

Outline Introduction to cell division and the cell cycle Cell culture synchronization and arrest methods Microarrays and periodically expressed genes Our own research

The Origin of Life How does one cell become many?

The cell cycle

The cell cycle – a hot scientific topic Papers about microarray analysis of the cell cycle have been extensively cited Spellman et al., 1998 (yeast) 697 citations Cho et al., 1998 (yeast) 385 citations Cho et al., 2002 (human) 61 citations Leland Hartwell Tim Hunt Sir Paul Nurse The 2001 Nobel Prize in Physiology or Medicine "for their discoveries of key regulators of the cell cycle“ Annual number of new articles in MEDLINE about “cell cycle” Cell cycle related research topics: Cancer, apoptosis, development, metabolic engineering, biomass increase, fertility, protein degradation, phosphorylation, signalling, etc.

A complex biological system

Evolutionary Conservation & Check-points The core machinery of the cell cycle is largely conserved throughout eukaryotes (from yeast to humans) Especially the multi-cellular organisms have extensive and sophisticated control mechanisms (check-points) that check if one step (e.g. DNA replication) is completed and error-free, before proceeding to the next steps (e.g. segregation of the chromosomes)

Different levels of control in the cell The activity of a protein depends on –regulation of its transcription (by activator/inhibitors) –its mRNA stability –the efficiency of translation (from mRNA to protein) –stability of the mature protein –the subcellular localization of the protein –regulation of the protein function by binding of activator and inhibitors –regulation of the protein function by post-translational modification (e.g. phosphorylation) –targeted degradation of the protein (e.g. by ubiquitination)

The “just-in-time” principle Some products you use all the time, while others are only needed once in a while People tend to buy or order things right before they need them…

Just-in-time synthesis in the cell cycle

Periodically expressed genes G1 SG2M G1 SG2M G1 SG2M G1 SG2M

Outline Introduction to cell division and the cell cycle Cell culture synchronization and arrest methods Microarrays and periodically expressed genes Our own research

In a normal culture, cells grow out of synchrony

Arrest-and-release synchronization Temperature sensitive mutants – unable to pass a certain point in the cell cycle at high temperatures Alpha factor – all cells arrest in the same stage until a factor is supplied to the culture

Outline Introduction to cell division and the cell cycle Cell culture synchronization and arrest methods Microarrays and periodically expressed genes Our own research

Using microarrays with synchronized cells Synchrone yeast culture Microarrays Gene expressionExpression Profile Take samples from a synchronized cell culture Hybridize each sample to a microarray chip Normalize to make the chips comparable String the data-points together to a profile Look for genes whose expression is periodic

Time-series versus comparative studies Most microarray experiments compare measurements from condition A with condition B to find those genes that are differentially expressed –the typical analysis is a statistical test, e.g. t-test Time-series experiments are usually aimed at studying how gene expression changes over time (i.e. identifying genes that show a specific pattern of expression) –you can’t make a t-test, instead you have to look for some pattern!

Look for sine/cosine-like waves G1 SG2M G1 SG2M G1 SG2M G1 SG2M

Measuring Periodicity Fourier-scoring: extracting the ”signal” at the interdivision frequency.

Fourier score distribution

The thresholding problem YeastHuman

Sensitivity versus Specificity edible poisonous Sensitivity: how many of the edible mushrooms do you accept Specificity: how many of the mushrooms you accept are edible

Specificity Sensitivity Sensitivity versus Specificity find little, of which most is right find a lot, of which most is right find little, of which most is wrong find a lot, of which most is wrong Sensitivity: how many of the edible mushrooms do you accept Specificity: how many of the mushrooms you accept are edible edible poisonous x

The thresholding problem Every threshold corresponds to a sensitivity and a specificity There is (almost always) a trade-off between the two

Outline Introduction to cell division and the cell cycle Cell culture synchronization and arrest methods Microarrays and periodically expressed genes Our own research

Benchmark of computational methods Many different computational methods have been developed to find periodically expressed genes, but we were the first to compare them to see which ones work the best!

Benchmarking of periodicity analysis methods Computational Methods Cho et al., 1998 –visual inspection Spellman et al., 1998 –fourier scoring and correlation Zhao et al., 2001 –single pulse model Johansson et al., 2003 –partial least squares regression Luan & Li, 2004 –cubic spline fitting Lu et al., 2004 –Bayesian, mixture model de Lichtenberg et al., 2004/2005 –permutation based statistics Benchmark sets B1113 known periodic genes from small scale experiments B2352 genes bound by at least one of nine known cell cycle transcription factors, excluding those in set B1 B3518 genes annotated as “cell cycle and DNA processing” in MIPS, excluding “meiosis” and those in set B1 How do we benchmark? We assume that the benchmark sets B1-B3 are enriched in cell cycle regulated genes. We then benchmarks the ability of each computational method to retrieve these genes from the data (rank them high)

Comparison of different methods --- random Cho et al. Spellman et al. Zhao et al. Johansson et al. Luan & Li Lu et al. de Lichtenberg et al.

Our periodicity analysis of microarray data “is the gene significantly regulated?” –p(regulation): compare actual standard deviation to randomly sampled background distribution “is the expression pattern periodic?” –p(periodicity): compare the actual fourier signal to distribution made by randomly shuffling the data point within each profile Combined score with penalty function –Combi score: Combination of p-values for regulation and periodicity, with penalty terms that require both components to be significant.

Regulation versus periodicity --- random regulation alone periodicity alone combined combined + re-norm combined + consistency

Better methods give better overlap Different experimental conditions, such as cell culture synchronization methods Handling of samples, amplification, hybridization, etc. DNA microarray data are noisy and replicates rarely overlap completely. Signal-to-noise ratio low for weakly expressed –and regulated genes Why not perfect overlap?

Finding the weakly expressed genes List of putative cell cycle regulated genes

Validation of the predictions Used a temperature sensitive mutant yeast strain to synchronize a cell culture in a batch fermentor Took out samples every 20 min Treated and froze the samples Extracted RNA from the samples hybridized to custom made DNA microarrays, where the probes were designed with our own software tool, OligoWiz Performed the raw data treatment and normalization Analyzed the data with our own permutation- based method, which is good at finding weakly expressed –or regulated genes.

Experimental setup - sampling –Samples were shaken with ice (2°C I a few sec) –Centrifuged 1-1/2 min. –Supernatant discarded –Frozen in liquid N2 (-196°C) –Stored at -80°C until RNA extraction

Monitoring synchrony The culture exhibited synchrony with an interdivision time of ~148 minutes, meaning that the experiment covers two entire cell cycles

The results

New genes are weakly expressed

Examples of new discoveries

Mapping gene expression on networks cell cycle microarra y data Physical PPI interactions with confidence scores Expand the set of proteins to include non-periodic proteins that are strongly connected to periodic proteins. Raw DataNode selection List of periodically expressed proteins with peak time Interactions Require compatible compartments and high confidence Extract cell cycle network

Dynamics of protein complex formation

CBS, BioCentrum-DTU, Denmark Søren Brunak (Professor, Center director) Steen Knudsen (Professor) Rasmus Wernersson (synchronization, sampling, probe/array design) Thomas Skøt Jensen (design & data analysis) Henrik Bjørn Nielsen (array normalization & data analysis) Anders Fausbøll (data analysis) Flemming Bryde Hansen (sampling) EMBL, Heidelberg, Germany Lars Juhl Jensen (Prediction and benchmarking) Peer Bork (group leader) Thanks to all the other people involved!

New Weakly Expressed Cell Cycle Regulated Genes in Yeast Ulrik de Lichtenberg, Rasmus Wernersson, Thomas Skøt Jensen, Henrik Bjørn Nielsen, Anders Fausbøll, Peer Schmidt, Flemming Bryde Hansen, Steen Knudsen and Søren Brunak Submitted, 2005 Dynamic Complex Formation During the Yeast Cell Cycle Ulrik de Lichtenberg, Lars Juhl Jensen, Søren Brunak and Peer Bork Science, 307 (5710), , 2005 [PubMed]PubMed Comparison of Computational Methods for the Identification of Cell Cycle Regulated Genes Ulrik de Lichtenberg, Lars Juhl Jensen, Anders Fausbøll, Thomas Skøt Jensen, Peer Bork and Søren Brunak Bioinformatics, 2004 (advance online access) [PubMed]PubMed Protein Feature Based Identification of Cell Cycle Regulated Proteins in Yeast Ulrik de Lichtenberg, Thomas Skøt Jensen, Lars Juhl Jensen and Søren Brunak Journal of Molecular Biology, 239(4), , 2003 [PubMed]PubMed Further reading…