Genomics of Gene Regulation ANSC 497B Ross Hardison Nov. 10, 2009.

Slides:



Advertisements
Similar presentations
Methods to read out regulatory functions
Advertisements

Epigenetics Xiaole Shirley Liu STAT115, STAT215, BIO298, BIST520.
Regulomics II: Epigenetics and the histone code Jim Noonan GENE760.
Functional Non-Coding DNA Part II DNA Regulatory Elements BNFO 602/691 Biological Sequence Analysis Mark Reimers, VIPBG.
Transcriptional regulation in Eukaryotes The regulatory elements of bacterial, yeast, and human genes.
Inferring Transcriptional Regulation Using Transctiptomics Carsten O. Daub September 1 st, 2014 StratCan Summer School 2014 Vår Gård, Saltsjöbaden.
Outline Questions from last lecture? P. 40 questions on Pax6 gene Mechanism of Transcription Activation –Transcription Regulatory elements Comparison between.
Polycomb Complexes & Epigenetics Adam Baim MBB306 10/2/07.
3B1 Gene regulation results in differential GENE EXPRESSION, LEADING TO CELL SPECIALIZATION.
Understanding the Human Genome: Lessons from the ENCODE project
Speaker: HU Xue-Jia Supervisor: WU Yun-Dong Date: 19/12/2013.
[Bejerano Fall10/11] 1 Thank you for the midterm feedback! Projects will be assigned shortly.
[Bejerano Spr06/07] 1 TTh 11:00-12:15 in Clark S361 Profs: Serafim Batzoglou, Gill Bejerano TAs: George Asimenos, Cory McLean.
[Bejerano Aut07/08] 1 MW 11:00-12:15 in Redwood G19 Profs: Serafim Batzoglou, Gill Bejerano TA: Cory McLean.
CS 374: Relating the Genetic Code to Gene Expression Sandeep Chinchali.
[Bejerano Fall09/10] 1 Thank you for the midterm feedback!
[Bejerano Aut08/09] 1 MW 11:00-12:15 in Beckman B302 Profs: Serafim Batzoglou, Gill Bejerano TAs: Cory McLean, Aaron Wenger.
“An integrated encyclopedia of DNA elements in the human genome” ENCODE Project Consortium. Nature 2012 Sep 6; 489: Michael M. Hoffman University.
[BejeranoFall13/14] 1 MW 12:50-2:05pm in Beckman B302 Profs: Serafim Batzoglou & Gill Bejerano TAs: Harendra Guturu & Panos.
Comparative Genomics II: Functional comparisons Caterino and Hayes, 2007.
1 1 - Lectures.GersteinLab.org Overview of ENCODE Elements Mark Gerstein for the "ENCODE TEAM"
Molecular Biology Topics for Second Part of Molecular Biology Course: Structural Organization of Eukaryotic Chromosomes Spring 2009 BI 425 UOG.
Chromatin Structure & Gene Expression The Histone Code.
Current Topics in Genomics and Epigenomics – Lecture 2.
An Introduction to ENCODE Mark Reimers, VIPBG (borrowing heavily from John Stamatoyannopoulos and the ENCODE papers)
The Genome is Organized in Chromatin. Nucleosome Breathing, Opening, and Gaping.
Eukaryotic Gene Expression Managing the Complexities of Controlling Eukaryotic Genes.
* only 17% of SNPs implicated in freshwater adaptation map to coding sequences Many, many mapping studies find prevalent noncoding QTLs.
Genomics and High Throughput Sequencing Technologies: Applications Jim Noonan Department of Genetics.
ChIP-on-Chip and Differential Location Analysis Junguk Hur School of Informatics October 4, 2005.
Regulation of Gene Expression Chapter 18. Warm Up Explain the difference between a missense and a nonsense mutation. What is a silent mutation? QUIZ TOMORROW:
TGCAAACTCAAACTCTTTTGTTGTTCTTACTGTATCATTGCCCAGAATAT TCTGCCTGTCTTTAGAGGCTAATACATTGATTAGTGAATTCCAATGGGCA GAATCGTGATGCATTAAAGAGATGCTAATATTTTCACTGCTCCTCAATTT.
Copyright © 2009 Pearson Education, Inc. Regulation of Gene Expression in Eukaryotes Chapter 17 Lecture Concepts of Genetics Tenth Edition.
Dynamics of epigenetic states during erythroid differentiation Ross Hardison July 20, 2010 Barcelona.
Eukaryotic Gene Expression Managing the Complexities of Controlling Eukaryotic Genes.
Controlling Chromatin Structure
Regulation of eukaryotic genes Gene silencing Enhancers Activators Functional domains of activators.
Recombination breakpoints Family Inheritance Me vs. my brother My dad (my Y)Mom’s dad (uncle’s Y) Human ancestry Disease risk Genomics: Regions  mechanisms.
Genomics of Gene Regulation Genomic and Proteomic Approaches to Heart, Lung, Blood and Sleep Disorders Jackson Laboratories Ross Hardison September 9,
Genomics of Gene Regulation
Overview of ENCODE Elements
CS173 Lecture 9: Transcriptional regulation III
Biol 456/656 Molecular Epigenetics Lecture #5 Wed. Sept 2, 2015.
Outline Molecular Cell Biology Assessment Review from last lecture Role of nucleoporins in transcription Activators and Repressors Epigenetic mechanisms.
Eukaryotic Gene Expression
Genomics 2015/16 Silvia del Burgo. + Same genome for all cells that arise from single fertilized egg, Identity?  Epigenomic signatures + Epigenomics:
Using public resources to understand associations Dr Luke Jostins Wellcome Trust Advanced Courses; Genomic Epidemiology in Africa, 21 st – 26 th June 2015.
Enhancers and 3D genomics Noam Bar RESEARCH METHODS IN COMPUTATIONAL BIOLOGY.
Understanding GWAS SNPs Xiaole Shirley Liu Stat 115/215.
Integrative Genomics. Double-helix DNA strands are separated in the gene coding region Which enzyme detects the beginning of a gene ? RNA Polymerase (multi-subunit.
Control of Gene Expression in Bacteria
The Chromatin State The scientific quest to decipher the histone code Lior Zimmerman.
Epigenetics Continued
Functional Elements in the Human Genome
Introduction to Genetic Analysis
Dynamic epigenetic enhancer signatures reveal key transcription factors associated with monocytic differentiation states by Thu-Hang Pham, Christopher.
High-Resolution Profiling of Histone Methylations in the Human Genome
High-Resolution Profiling of Histone Methylations in the Human Genome
Control of the Embryonic Stem Cell State
Volume 67, Issue 6, Pages e6 (September 2017)
Evolution of Alu Elements toward Enhancers
Volume 39, Issue 6, Pages (September 2010)
Volume 132, Issue 2, Pages (January 2008)
Volume 132, Issue 6, Pages (March 2008)
Genome Regulation by Polycomb and Trithorax Proteins
Adam C. Wilkinson, Hiromitsu Nakauchi, Berthold Göttgens  Cell Systems 
Volume 7, Issue 2, Pages (August 2010)
Genetic and Epigenetic Regulation of Human lincRNA Gene Expression
The 3D Genome in Transcriptional Regulation and Pluripotency
Derek de Rie and Imad Abuessaisa Presented by: Cassandra Derrick
Presentation transcript:

Genomics of Gene Regulation ANSC 497B Ross Hardison Nov. 10, 2009

DNA sequences involved in regulation of gene transcription Protein-DNA interactions Chromatin effects

Distinct classes of regulatory regions Maston G, Evans S and Green M (2006) Annu Rev Genomics Hum Genetics 7:29-59 Act in cis, affecting expression of a gene on the same chromosome. Cis-regulatory modules (CRMs)

General features of promoters A promoter is the DNA sequence required for correct initiation of transcription It affects the amount of product from a gene, but does not affect the structure of the product. Most promoters are at the 5’ end of the gene. Maston, Evans & Green (2006) Ann Rev Genomics & Human Genetics, 7:29-59 TATA box + Initiator: Core or minimal promoter. Site of assembly of preinitiation complex Upstream regulatory elements: Regulate efficiency of utilization of minimal promoter RNA polymerase II

Conventional view of eukaryotic gene promoters Maston, Evans & Green (2006) Ann Rev Genomics & Human Genetics, 7:29-59

Most promoters in mammals are CpG islands TATA, no CpG island About 10% of promoters CpG island, no TATA About 90% of promoters Carninci … Hayashizaki (2006) Nature Genetics 38:626

Differences in specificity of start sites for transcription for TATA vs CpG island promoters Carninci … Hayashizaki (2006) Nature Genetics 38:626 Fraction of mRNAs

Enhancers Cis-acting sequences that cause an increase in expression of a gene Act independently of position and orientation with respect to the gene. Pennacchio et al., gov/ Tested UCE Over half of ultraconserved noncoding sequences are developmental enhancers Pennacchio et al. (2006) Nature 444: lacZUCEprluciferaseprCRM About half of the enhancers predicted by interspecies alignments are validated in erythroid cells Wang et al. (2006) Genome Research 16:

CRMs are clusters of specific binding sites for transcription factors Hardison (2002) on-line textbook Working with Molecular Genetics

Enhancers can occur in a variety of positions with respect to genes Transcription unitP Ex1Ex2 Enhancer Adjacent Downstream Internal Distal Upstream

Silencer Cis-acting sequences that cause a decrease in gene expression Similar to enhancer but has an opposite effect on gene expression Gene repression - inactive chromatin structure (heterochromatin) SIR proteins (Silent Information Regulators) Nucleates assembly of multi-protein complex –hypoacetylated N-terminal tails of histones H3 and H4 –methylated N-terminal tail of H3 (Lys 9)

Insulators and boundaries A boundary in chromatin marks a transition from open to closed chromatin An insulator blocks activation of promoter by an enhancer –Requires CTCF Example: HS4 from chick HBB complex has both functions neoRPr Enhancer Insu- lator Neo-resistant colonies % of maximum Silencer

Repression by PcG proteins via chromatin modification Polycomb Group (PcG) Repressor Complex 2: ESC, E(Z), NURF-55, and PcG repressor SU(Z)12 Methylates K27 of Histone H3 via the SET domain of E(Z) me3 H3 N-tailK27 OFF

trx group (trxG) proteins activate via chromatin changes SWI/SNF nucleosome remodeling Histone H3 and H4 acetylation Methylation of K4 in histone H3 –Trx in Drosophila, MLL in humans 3 Me1,2,3 H3 N-tail K4 ON

Histone modifications modulate chromatin structure Uta-Maria Bauer H3K27me3H3K4me2, 3

Repressed and active chromatin Dustin Schones and Keiji Zhao (2008) Nature Reviews Genetics 9: 179

Biochemical features of DNA in CRMs Pol IIa Pol II Coactivators Accessible to cleavage: DNase hypersensitive site Bound by specific transcription factors Associated with RNA polymerase and general transcription factors Nucleosomes with histone modifications: Acetylation of H3 and H4 Methylation of H3K4 Lack of methylation at H3K27 or H3K9 … Clusters of binding site motifs

Methods in Genomics of Gene Regulation

Chromatin immunoprecipitation: Greatly enrich for DNA occupied by a protein Elaine Mardis (2007) Nature Methods 4:

ChIP-chip: High throughput mapping of DNA sequences occupied by protein Bing Ren’s lab

Enrichment of sequence tags reveals function Barbara Wold & Richard M Myers (2008) “Sequence Census Methods” Nature Methods 5:19-21

Illumina (Solexa) short read sequencing - 8 lanes per run - 10 M to 20 M reads of 36 nucleotides (or longer) per run. - 1 lane can produce enough reads to map locations of a transcription factor in a mammalian genome.

Example of ChIP-seq ChIP vs NRSF = neuron-restrictive silencing factor Jurkat human lymphoblast line NPAS4 encodes neuronal PAS domain protein 4 Johnson DS, Mortazavi A, Myers RM, Wold B. (2007) Genome-Wide Mapping of in Vivo Protein-DNA Interactions. Science 316:

ChIP-seq for chromatin modifications Dustin Schones and Keiji Zhao (2008) Nature Reviews Genetics 9: 179

Histone modifications around HBB locus Known CRMs UCSC genes DNase hyper- sensitive sites Polycomb trithorax Transcription associated mark

Distributions at all GenCode TSSs Birney et al. (2007) Nature 477: Symmetrical distribution of: - H3K4me3, H3K4me2 - H3Ac, H4Ac, DHS - E2F1, E2F4, Myc, Pol II

Distribution of histone modifications and factor binding around regulatory regions Promoters –H3K4me3, H3K4me2 –E2F1, E2F4, Myc, Pol II Distal HSs –H3K4me1: enhancers –CTCF: insulators Birney et al. (2007) Nature, 447:

Enhancers predicted from chromatin signatures (2009) Nature 459:

Enhancer predictions in human cells

Characteristics and validation of predicted enhancers

Data Resources for Genomics of Gene Regulation

UCSC Genome Browser Visualize data described in publications, e.g. –Expression data Affymetrix gene arrays, GNF, Su et al –Regulation Kim et al. 2005, PICs (TAF1) Kim et al., 2008, CTCF Boyle et al., 2008, DNase hypersensitive sites Heintzman et al., 2009, Enhancers predicted by H3K4me1 Mikkelsen et al., 2007, Chromatin modifications in pluripotent and lineage-committed cells ENCODE project, Production phase –Expression Affy high density tiling arrays RNA-seq from several sources (CSHL, Helicos) –Regulation Broad histone modifications HAIB DNA methylation Open Chromatin UW DNase HS HAIB TFBS Yale TFBS SUNY RBP

Factor occupancy and DNase hypersen- sitivity ENCODE Tracks: Broad histone modifications, Open chromatin, UW DHS, Yale TFBSs Locus control region HS

Collated sets of published regulatory regions Noncoding DNA segments with high regulatory potential PRPs: Intersection of the High RP segments and the PReMods (clusters of conserved transcription factor binding site motifs) Most constrained DNA segments, phastCons DNase hypersensitive sites in CD4+ T cells DNA segments occupied by CTCF in primary fibroblasts Preinitiation complexes (TAF1) in IMR90 cells Predicted erythroid cis-regulatory modules

GeneTrack Genomic data analysis and integration –Istvan Albert, Frank Pugh, et al., PSU – Install on your system Gallery of data for visualization –Yeast H2AZ nucleosome predictions, 454 sequencing –Drosophila H2AZ nucleosome predictions, 454 sequencing

Yeast nucleosome map

HIS3: nucleosome- free region

modENCODE Worm and Fly Gene annotations Expression Chromatin modifications TFBs in vivo, etc.

Experimental Tests in the Genomics of Gene Regulation

G1E-ER4 cells GATA-1 is required for erythroid maturation Aria Rad, MEP Hematopoietic stem cell Common myeloid progenitor Myeloblast Basophil Common lymphoid progenitor Neutrophil Eosinophil Monocyte, macrophage GATA-1 G1E cells

GATA1-induced changes in gene expression and occupancy genome-wide Genes induced or repressed after restoration of GATA1 Occupancy by TFs and histone modifications along a 60 Mb region

High sensitivity and specificity of high throughput occupancy data

High throughput occupancy matches known CRMs at Hbb locus

Confirmed and novel regulatory regions for Gypa Known CRMs Gypa gene Response DHSs GATA1 TAL1 Trx: H3K4me1 Trx: H3K4me3 PcG: H3K27me3 Input DNA

Induced genes have GATA1 occupied segments close to their TSS

DNA segments occupied by GATA-1 were tested for enhancer activity on transfected plasmids Occupied segments

Some of the DNA segments occupied by GATA-1 are active as enhancers Cheng et al. (2008) Genome Research 18:

Binding site motifs in occupied DNA segments can be deeply preserved during evolution Consensus binding site motif for GATA-1: WGATAR or YTATCW 5997 constrained 7308 not constrained 2055 no motif

All GATA1-occupied segments active as enhancers are also occupied by SCL and LDB1

Genetic Determinants of Variation in Gene Expression

Variation of gene expression among individuals Levels of expression of many genes vary in humans (and other species) Variation in expression is heritable Determinants of variability map to discrete genomic intervals Often multiple determinants This variation indicates an abundance of cis-regulatory variation in the human genome For example: –Microarray expression analyses of 3554 genes in 14 families Morley M … Cheung VG (2004) Nature 430: Expression analysis of about 16 HapMap individuals Storey et al. (2007) AJHG 80: –Expression analysis of all 270 individuals genotypes in HapMap Stranger BE … Dermitzakis E (2007) Nature Genetics 39:

Variation in expression between populations Figure 5.Allele-specific qPCR analysis of SH2B3. a, Log2-fold change of SH2B3 expression for all CEU and YRI individuals, relative to the average expression level in the YRI sample obtained from allele-specific qPCR. The distribution of SH2B3 expression is significantly different between samples (t-test, P=.0157), which confirms the microarray results. b, Allele-specific qPCR of a coding polymorphism (rs ), which demonstrates that the log2-fold change of the G allele relative to the A allele is significantly different between heterozygous DNA (Het DNA) and heterozygous cDNA (Het cDNA) samples (t-test, P=.00118). Storey et al., 2007, AJHG 80:

Mapping determinants of expression variation Stranger et al., 2007, Nature Genetics 39: Expression analysis of EBV-transformed lymphoblastoid cells from all 270 individuals genotypes in HapMap –30 Caucasian trios (90) of European descent in Utah (CEU) –30 Yoruba trios (90) from Ibadan, Nigeria (YRI) –45 unrelated Chinese individuals from Beijing Univ (CHB) –45 unrelated Japanese individuals from Tokyo (JPT) Measure levels of expression of 47,294 probes (about 24,000 genes) in each individual –Focus on 13,643 genes “selected on criteria of variance and population differentiation” Already know genotypes at about 2.2 million SNPs for each individual (HapMap) Test for significant association of variation at each SNP with variation in expression of each gene –Linear regression model –Spearman rank correlation test Evaluate significance of regression P values by 10,000 permutations of the data, focus on those associations above the permutation threshold

Association of SNPs with expression Stranger et al., 2007, Nature Genetics 39: Significant association between expression and cis- SNPs (within 1 Mb) 831 genes in at least one population 310 genes in at least 2 populations 62 genes in all 4 populations Also find associated SNPs in trans: perhaps regulatory proteins

Location of expression-associated SNPs Most are “close” to transcription start site (TSS) Symmetrical arrangement (similar to biochemical features of promoters) Three of the SNPs have been shown to affect promoter activity in transfection assays (Hoogendoorn et al. (2004) Human Mutation 24: Figure 4 Properties of significant cis associations as a function of SNP distance from the transcription start site. Stranger et al., 2007, Nature Genetics 39:

Relevance to human health "We predict that variants in regulatory regions make a greater contribution to complex disease than do variants that affect protein sequence” – Manolis Dermitzakis, ScienceDaily

Risk loci in noncoding regions (2007) Science 316:

Biochemical features of DNA in CRMs Pol IIa Pol II Coactivators Accessible to cleavage: DNase hypersensitive site Bound by specific transcription factors Associated with RNA polymerase and general transcription factors Nucleosomes with histone modifications: Acetylation of H3 and H4 Methylation of H3K4 Clusters of binding site motifs

Candidate functions in T2D SNP intervals Overlap of SNP rs with DHS suggests a role in transcriptional regulation, but overlap with an exon of a noncoding RNA suggests a role in post-transcriptional regulation. Different hypotheses to test in future work.