Download presentation
Presentation is loading. Please wait.
Published byAnna Osborne Modified over 9 years ago
1
Regulatory Genomics Lecture 2 November 2012 Yitzhak (Tzachi) Pilpel 1
2
Course requirements Attendance and participation Two reading assignments A final take home papers reading-based exam website No meeting next week on Nov 15 th 2
3
Expression regulation of genes determines complex spatio-temporal patterns 3
4
Monitor expression during cell cycle Time mRNA expression level G1 S G2 M 4
5
Time-point 1 Time-point 3 Time-point 2 -1.8 -1.3 -0.8 -0.3 0.2 0.7 1.2 123 -2 -1.5 -0.5 0 0.5 1 1.5 123 -1.5 -0.5 0 0.5 1 1.5 123 Time -point Normalized Expression Normalized Expression Normalized Expression Genes can be clustered based on time-dependent expression profiles 5
6
The K-means algorithm Start with random positions of centroids. Iteration = 0 6
7
K-means Start with random positions of centroids. Assign data points to centroids Iteration = 1 7
8
K-means Start with random positions of centroids. Assign data points to centroids. Move centroids to center of assigned points. Iteration = 1 8
9
K-means Start with random positions of centroids. Assign data points to centroids. Move centroids to center of assigned points. Iterate till minimal cost. Iteration = 3 9
10
An expression cluster
11
1D and 2D clustering of gene expression data
12
Hierarchical clustering
13
How to join sets? f edc b a
14
How to measure a distance between expression profiles? 14 Gene x Gene y t1 t2 t3 Gene x Gene y t4 t5
15
Clustering the data http://home.dei.polimi.it/matteucc/Clustering/tutorial_html/AppletH.html http://home.dei.polimi.it/matteucc/Clustering/tutorial_html/AppletKM.html Try these two applets at home (needs java)
16
The common distance matrices 16
17
Promoter Motifs and expression profiles CGGCCCCGCGGA CTCCTCCCCCCCTTCTGGCCAATCA ATGTACGGGTG 17
18
Formaldehyde crosslinks living yeast cells Binding site TF Binding site Inside the yeast nucleus: ChIP - chromatin immunoprecipitation Reversal of the crosslinks to separate DNA segments from proteins, and fluorescence labeling of each pool separately (enriched DNA) hybridization to DNA array of all yeast intergenic sequences (unenriched DNA) TF = epitope tag = TF of interest Harvest and sonicate; results in DNA fragments (some of which are bound to proteins) 18
19
P-value 0.05 35,365 interactions P-value 0.01 12,040 interactions P-value 0.005 8,190 interactions P-value 0.001 3,985 interactions P-value, or confidence level, for each spot in array The total number of protein-DNA interactions in the location analysis data set, using a range of P value thresholds: A P-value was selected which minimizes false positives, at the expense of gaining false negatives. P-value 0.001 19
20
Genome-wide Distribution of Transcriptional Regulators Promoter regions of 2343 of 6270 yeast genes (37%) were bound by 1 or more of the 106 transcriptional regulators (P=.001) Avg.: regulator binds 38 promoter regions At P= 0.001, significantly more intergenic regions bind 4 or more regulators than expected by chance 20
21
Network Motifs 21
22
Network Motifs 22
23
Network Motifs in the Yeast Regulatory Network -Based on algorithmic analyses performed in Matlab; http://jura.wi.mit.edu/cgi- bin/young_public/navframe.cgi?s=17&f=networkmotif 10 3 49 90 81 18 8 23 Protein Gene
24
The Cell Cycle Transcriptional Regulatory Network: Various stages of cell cycle Blue boxes represent sets of genes bound by a common set of regulators. Each box is positioned according to the time of peak expression levels for the genes represented by the box. Ovals represent regulators, connected to genes they regulate Length of arc defines the period of activity of that regulator 24
25
Network of Transcriptional Regulators Binding to Genes Encoding Other Transcriptional Regulators 25
26
Network of Transcriptional Regulators Binding to Genes Encoding Other Transcriptional Regulators 26
27
Network of Transcriptional Regulators Binding to Genes Encoding Other Transcriptional Regulators 27
28
DNAmRNAProtein Inactive DNA The Central Dogma of Molecular Biology Expressing the genome RNA 28
29
Translation consists of initiation, elongation and termination 5’ 3’STOP Codon Anti-codon 29
30
The ribosome attachment site determines initiation rate E. coli Yeast 30
31
A consensus for S. cerevisiae ribosome attachment sites? position relative to ATG 100% 0% sequence How good is it as a “ribosomal attachment site” ? ribosomal attachment site score 31
32
5’ 3’ CTG CGC GCG CAG GCG 32
33
Rank ribosomal attachment site score The sequence adaptation score of proteins in yeast CRP good score bad score 33
34
Multiple codons for the same amino acid C1 C2 C3 C4 C5 C6 Serine: UCU UCC UCA UCG AGC AGU Cysteine: UGU UGC Methionine: UGG STOP: UAA, UAG UGA 34
35
G T R Y E C Q A S F D C1C1C1C1C1C1C1C1C1C1C1 C2C2C2C2C2C2C2C2C2C2C2 C1C1C2C1C1C2C1C1C2C1C1 C2C2C2C2C1C1C1C1C1C1C1 C1C1C1C1C1C1C1C2C2C2C2 For a hypothetical protein of 300 amino acids with two-codon each, There are 2^300 possible nucleotide sequences These variants will code for the same protein, and are thus considered “synonymous”. Indeed evolution would easily exchange between them But are they all really equivalent?? 35
36
Selection of codons might affect: Accuracy Throughput Costs Folding RNA-structure 36
37
W i /W max if W i 0 w i = w mean else { dos Reis et al. NAR 2004 The tRNA Adaptation Index (tAI) ATCCCAAAATCGAAT … … … A simple model for translation efficiency Wobble Interaction 37
38
Supply demand and charging 38
39
How the RNA structure influences translation? ? 39
40
No correlation between CAI and protein expression Positive correlation between structure’s energy and expression The 5’ window needs to be un-folded for high expression Protein abundance Protein abundance Conclusions from synthetic library 40
41
Formaldehyde crosslinks living yeast cells Binding site TF Binding site Inside the yeast nucleus: ChIP - chromatin immunoprecipitation Reversal of the crosslinks to separate DNA segments from proteins, and fluorescence labeling of each pool separately (enriched DNA) hybridization to DNA array of all yeast intergenic sequences (unenriched DNA) TF = epitope tag = TF of interest Harvest and sonicate; results in DNA fragments (some of which are bound to proteins) 41
42
A genome-wide method to measure translation efficiency (Ingolia Science 2009) 42
43
Translational response to starvation 43
44
DNAmRNAProtein Inactive DNA The Central Dogma of Molecular Biology Expressing the genome RNA 44
45
mRNA abundance Option 1Option 2Option 3Option 4 Production degradation 45
46
Relationship between gene expression levels and mRNA decay rates across genes. A study in human population examined decay and steady-state mRNA level variation across people. Found strong negative or positive correlations between mRNA level and decay rates. Fast responding genes show “discordant” relation suggesting that increased expression is often accompanied by increased decay rate
47
The various phases are coupled 47
48
At the hardware level (post- transcription: RNA binding proteins) G1 1 1 1 0 G2 1 0 0 1 G3 0 1 1 1 48
49
At the hardware level (post- transcription: microRNA) G1 1 1 1 0 G2 1 0 0 1 G3 0 1 1 1 RISC 49
50
Yang CGFR 16:397, 2005 50
51
Computational approaches to find microRNA genes MiRscan (Lim, et al. 2003) –Scan to find conserved hairpin structures in both C. elegans and C. briggsae. –Using known microRNA genes (50) as training set. 51
52
What is the effect of over expression of a miR? 52
53
53 None-Coding RNAs are often co- targeted with their own targets for various cellular needs
54
miR-124 decreases similarly the abundance and translation of mRNA targets 54
55
microRNA expression profiles classify human cancers Lu et al. Nature 435: 834, 2005 Samples (patients) miRs 55
56
Gene expression is noisy 56
57
Fluorescence distribution shapes 57
58
The cell intrinsic and extrinsic contributions to noise 58
59
DNA RNA Protein Regulation by transcription factors RNA Polymerase Ribosome Extrinsic Intrinsic Chromatin remodeling Transcription process Translation process Φ Protein degradation The actual intrinsic and extrinsic sources of noise: Extrinsic – variation in copy numbers of molecules among cells; Intrinsic: stochastic events 59
60
A theoretical approach 60 DNAmRNAProtein
61
The ratio of transcription to translation should affect noise 61
62
Transcription bursts should affect noise 62
63
Can noise be useful?
64
The native net shows longer and more duration-diverse competence periods
65
Native networks does better on a wider range of extracellular [DNA] The trade-off: High competence allows finding solutions, but reduces growth rate
66
Questions about noise What are the sources of noise? How is noise regulated in cells How is it tolerated by the biological systems that need to be noise free? When is noise advantageous /deleterious/ neutral? 66
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.