A genomic code for nucleosome positioning

Slides:



Advertisements
Similar presentations
Methods to read out regulatory functions
Advertisements

Periodic clusters. Non periodic clusters That was only the beginning…
DNA packaging summary 1.Problem is packaging 2.Levels of chromatin structure (nucleosomes, 30-nm fiber, loops, bands) 3.Histone code marks active and.
Chromosomes, Chromatin, and the Nucleosome
Systematic Evolution of Ligands by Exponential Enrichment: RNA Ligands to Bacteriophage T4 DNA Polymerase CRAIG TUERK AND LARRY GOLD.
Transcriptional regulation and promoter analysis
A Genomic Code for Nucleosome Positioning Authors: Segal E., Fondufe-Mittendorfe Y., Chen L., Thastrom A., Field Y., Moore I. K., Wang J.-P. Z., Widom.
Nucleosome Positioning Histones and DNA Bending. DNA packaging 3 X 10 9 base pairs in human genome ~1 m if unraveled Compacted into nucleus –100  m in.
Chap. 6 Problem 2 Protein coding genes are grouped into the classes known as solitary (single) genes, and duplicated or diverged genes in gene families.
Nucleosomes: what, why and where? Rob Brewster. Outline What is a nucleosome? - how is DNA packaged/organized in Eukaryotes? Why do nucleosomes form?
The Necessity For DNA Condensation The human genome (haploid) is 3  10 9 base pairs. The helical rise of dsDNA per base pair is 0.3  10 –9 meters. If.
Thermodynamic Models of Gene Regulation Xin He CS598SS 04/30/2009.
Epigenetics 12/05/07 Statisticians like data.
. Class 1: Introduction. The Tree of Life Source: Alberts et al.
Computational Approaches in Epigenomics Guo-Cheng Yuan Department of Biostatistics and Computational Biology Dana-Farber Cancer Institute Harvard School.
Protein and DNA interaction How to detect nucleosome position Xuefeng Zhu Department of Medical Biochemistry and Cell Biology
A System Approach to Measuring the Binding Energy Landscapes of Transcription Factors Authors: Sebastian J. et. al Presenter: Hongliang Fei.
Transcription Transcription- synthesis of RNA from only one strand of a double stranded DNA helix DNA  RNA(  Protein) Why is RNA an intermediate????
MicroRNA Targets Prediction and Analysis. Small RNAs play important roles The Nobel Prize in Physiology or Medicine for 2006 Andrew Z. Fire and Craig.
MBV2010/BIO2140 Colloquium, February 19. RULES Multiple choice Only one correct answer seconds to answer No textbook, computer, mobile phone, please.
Chapter 19 Organization and Control of Eukaryotic Genomes …Or How To Fit All of the Junk In the Trunk.
DNA double helix Nucleosomes Chromosome Felsenfeld & Groudine, Nature (2003) Chromatin structure and cancer epigenetics 30 nm fiber.
Vidyadhar Karmarkar Genomics and Bioinformatics 414 Life Sciences Building, Huck Institute of Life Sciences.
The genomic code for nucleosome positioning DNA double helix Nucleosomes Chromosome Felsenfeld & Groudine, Nature (2003)
Genome Organization & Evolution. Chromosomes Genes are always in genomic structures (chromosomes) – never ‘free floating’ Bacterial genomes are circular.
Dynamics of assembly and disassembly of nucleosomes along a stretched DNA University of Illinois at Chicago/ Northwestern University Ranjith Padinhateeri.
Recombinant DNA Technology and Genomics A.Overview: B.Creating a DNA Library C.Recover the clone of interest D.Analyzing/characterizing the DNA - create.
Outline Group Reading Quiz #2 on Thursday (covers week 5 & 6 readings Chromosome Territories Chromatin Organization –Histone H1 Mechanism of Transcription.
Conservation and Evolution of Cis-Regulatory Systems Tal El-Hay Computational Biology Seminar חנוכה תשס"ו December 2005.
Gene expression. The information encoded in a gene is converted into a protein  The genetic information is made available to the cell Phases of gene.
Comparative genomics Haixu Tang School of Informatics.
Data Mining the Yeast Genome Expression and Sequence Data Alvis Brazma European Bioinformatics Institute.
A biophysical approach to predicting intrinsic and extrinsic nucleosome positioning signals Alexandre V. Morozov Department of Physics & Astronomy and.
Genes and Genomes. Genome On Line Database (GOLD) 243 Published complete genomes 536 Prokaryotic ongoing genomes 434 Eukaryotic ongoing genomes December.
Molecular Genetics Introduction to
A genomic code for nucleosome positioning DNA double helix Nucleosomes Chromosome Felsenfeld & Groudine, Nature (2003)
Differences in DNA Heterochromatin vs. Euchromatin
Motif Search and RNA Structure Prediction Lesson 9.
The Contribution of Ploidy to Functional Genomic Comparisons in Yeasts Eric Delgado Regev Group Summer Research Program in Genomics.
Transcription factor binding motifs (part II) 10/22/07.
DNA-Protein Interactions & Complexes. Prokaryotic promoter Consensus sequence is not present in majority of prokaryotic promoters. Sequence motifs.
Javad Jamshidi Fasa University of Medical Sciences, November 2015 Genes, Genomes and Chromatin Organization.
Gene Regulation, Part 2 Lecture 15 (cont.) Fall 2008.
Topics to be covers Basic features present on plasmids
From Gene to Protein pp Discover Biology: C15 From Gene to Protein pp
Context Cell nucleus chromosome gene double helix.
Differences in DNA Heterochromatin vs. Euchromatin
DNA TRANSCRIPTION Making mRNA.
Topic DNA.
Regulation of Gene Expression by Eukaryotes
Control of eukaryotic gene expression
Eukaryotic Comparative Genomics
Controlling Chromatin Structure
Dynamics of nucleosomal DNA
Introduction to Bioinformatics II
Volume 38, Issue 4, Pages (May 2010)
Chromosome Organization
Volume 94, Issue 12, Pages (June 2008)
Nucleic Acid.
Phylogenetic footprinting and shadowing
Chromatin Domains: The Unit of Chromosome Organization
Presented by, Jeremy Logue.
Nucleosomes Nucleosomes consist of DNA tightly wrapped around proteins called histones 75-90% of DNA is believed to be present in nucleosomes From faculty.
Volume 128, Issue 6, Pages (March 2007)
Volume 47, Issue 2, Pages (July 2012)
Ariel Afek, Itamar Sela, Noa Musa-Lempel, David B. Lukatsky 
Presented by, Jeremy Logue.
Volume 22, Issue 2, Pages (April 2006)
Major Determinants of Nucleosome Positioning
Tips for writing papers
Presentation transcript:

A genomic code for nucleosome positioning

Hierarchical DNA folding in eukaryotic chromosomes DNA double helix Nucleosomes Felsenfeld & Groudine,, Nature 421: 448-453 (2003)

Most eukaryotic DNA is sharply looped Luger et al., 1997

Physical selection for stable nucleosome formation on chemically synthetic random DNAs Random 220-mer pool (1 each of 5 x 1012 individuals) PCR amplify, HPLC purify Select best 10% (dialysis from 2.0M NaCl) 2x Sucrose gradient purification Extract DNA Population diversity analysis Free energy analysis Clone, sequence, analyze individuals Lowary & Widom, 1998

Differing DNA sequences exhibit a > 5,000-fold range of affinities for nucleosome formation Lowary & Widom, 1998 Thåström et al., 1999 Widom, 2001 Thåström et al., 2004

Basepair steps as fundamental units of DNA mechanics Zhurkin Olson

DNA sequence motifs that stabilize nucleosomes and facilitate spontaneous sharp looping AA TT TA GC Segal et al., 2006

Why DNA some sequences have especially high affinity for histone octamer More or better bonds Appropriately bent More easily bendable Appropriate twist More easily twistable Widom, 2001 Cloutier & Widom, 2004

Felsenfeld, G. & Groudine, M. (2003), Nature 421: 448-453

~10.2 bp periodicity of AA/TT steps in the C. elegans genome Widom, 1996

Understanding and predicting the genome’s nucleosome-forming potential Collect nucleosome-bound sequences Associate intrinsic encoding with biological function Predict intrinsically encoded nucleosome organization Construct nucleosome-DNA interaction model Compare with in vivo positions Validate model So we used a combined experimental and computational approach to study this problem and we started by experimentally collecting sequences that are wrapped in nucleosomes. We then construct a model that captures the commonalities that we find in these sequences so that we describe in this model the DNA preferences of nucleosomes, validate the model and then use it to computationally predict the organization of the entire genome into nucleosomes, and then compare this intrinsic organization with in vivo positions and study its biological implications. Segal et al., 2006

Isolation of natural nucleosome core DNA Alberts et al., 4th ed., Fig. 4–24 (2002)

Center alignment of yeast nucleosome DNAs ACGTAGCTGTAGTGTACTGACGTACGTCGTC ACTAGCTGATACGGAGACCCGCGCGATTTTGCGGTC ACTGTTCGTCGTGTGTGTGTGCTGCTGTAGACTTGTGTG TACTGTTTTTATTTTGCGGGCATGCTTGT Center align ~10bp periodicity of AA/TT/TA Same period for GC, out of phase with AA/TT/TA Signals important for DNA bending NO signal from alignment of randomly chosen genomic regions Yeast (in vivo nucleosomes) Segal et al., 2006

Location mixture model alignment vs center alignment Wang & Widom, 2005

Center alignment of chicken nucleosome DNAs Chicken + Yeast merge Chicken (in vivo) (Satchwell et al., 1986) Yeast (in vivo) Segal et al., 2006

Alignments of nucleosomes selected in vitro Chicken (in vivo) (Satchwell et al., 1986) Yeast (in vivo) Yeast (in vitro) Mouse (in vitro) (Widlund et al., 1997) Random DNA (in vitro) (Lowary & Widom, 1998) Segal et al., 2006

In vitro experimental validation of histone-DNA interaction model Adding key motifs increases nucleosome affinity Deleting motifs or disrupting their spacing decreases affinity Segal et al., 2006

In vitro experimental validation of histone-DNA interaction model Adding key motifs increases nucleosome affinity Deleting motifs or disrupting their spacing decreases affinity dyad 8 18 28 38 48 58 68 78 88 98 108 118 128 138 To make sure that these really are the rules of the game we did in vitro experiments, and what we found is that indeed if you start from a sequence and add key motifs at the right positions you can get sequences that have better binding affinities, whereas deleting these motifs or changing their spacing decreases their affinity. Segal et al., 2006

Summary Differing DNA sequences exhibit a > 5,000-fold range of affinities for nucleosome formation We have a predictive understanding of the DNA sequence motifs that are responsible

Nucleosome–DNA model Differences from TF model Signal is weak Information in dinucleotides Two-fold symmetry axis AA TT TA GC Position specific dinucleotide distributions Add reverse complement of each sequence Local (3 bp) averaging Segal et al., 2006

Summary Differing DNA sequences exhibit a > 5,000-fold range of affinities for nucleosome formation We have a predictive understanding of the DNA sequence motifs that are responsible Sequences matching these motifs are abundant in eukaryotic genomes, and are occupied by nucleosomes in vivo

Placing nucleosomes on the genome A free energy landscape, not just scores and a threshold !! Log likelihood Genomic Location (bp) Nucleosomes occupy 147 bp and exclude 157 bp Segal et al., 2006

Equilibrium configurations of nucleosomes on the genome One of very many possible configurations P(S) PB(S) P(S) PB(S) P(S) PB(S) P(S) PB(S) Chemical potential – apparent concentration Probability of placing a nucleosome starting at each allowed basepair i of S Probability of any nucleosome covering position i ( average occupancy) Locations i with high probability for starting a nucleosome ( stable nucleosomes) Segal et al., 2006

Cross-correlation of average occupancies predicted using yeast and chicken models Segal et al., 2006

Nucleosome coding potential at the GAL1–10 locus: predicted distribution compared to experimental Segal et al., 2006

Nucleosome coding potential at the CHA1 locus Segal et al., 2006

Reliable classification of nucleosome occupancies on depleted vs occupied regions Yeast ORFs depleted of nucleosomes (Lee et al.) (0.82, 0.68) Yeast ORFs occupied by nucleosomes (Lee et al.) (0.88, 0.76) (0.88, 0.55) p<10–6 p<10–9 (0.82, 0.32) Yeast intergenics occupied by nucleosomes (Bernstein et al.) Yeast intergenics depleted of nucleosomes (Bernstein et al.) Segal et al., 2006

In vivo nucleosome occupancies at predicted high-occupancy locations + Controls – Controls High Predictions Segal et al., 2006

In vivo nucleosome occupancies at predicted low-occupancy locations Segal et al., 2006

Summary Differing DNA sequences exhibit a > 5,000-fold range of affinities for nucleosome formation We have a predictive understanding of the DNA sequence motifs that are responsible Sequences matching these motifs are abundant in eukaryotic genomes, and are occupied by nucleosomes in vivo A model based only on these DNA sequence motifs and nucleosome-nucleosome exclusion explains ~50% of in vivo nucleosome positions

Understanding and predicting the genome’s nucleosome-forming potential Collect nucleosome-bound sequences Construct nucleosome-DNA interaction model Validate model So we used a combined experimental and computational approach to study this problem and we started by experimentally collecting sequences that are wrapped in nucleosomes. We then construct a model that captures the commonalities that we find in these sequences so that we describe in this model the DNA preferences of nucleosomes, validate the model and then use it to computationally predict the organization of the entire genome into nucleosomes, and then compare this intrinsic organization with in vivo positions and study its biological implications. Predict intrinsically encoded nucleosome organization Compare with in vivo positions Associate intrinsic encoding with biological function Segal et al., 2006

Nucleosome occupancy varies with chromosome region type Centromere Convergent intergenic regions Autonomously replicating sequences Unique intergenic regions Ribosomal proteins Divergent intergenic regions Protein coding regions Telomeres Conserved DNA binding sites Ribosomal RNAs tRNAs Average nucleosome occupancy Segal et al., 2006

Does the genome’s intrinsic nucleosome organization facilitate occupancy of functional binding sites? Segal et al., 2006

The yeast genome encodes low nucleosome occupancy over functional binding sites Segal et al., 2006

Semi-stable nucleosomes Distinctive nucleosome occupancy adjacent to TATA elements at yeast promoters Stable nucleosome Semi-stable nucleosomes Permuted Model TATA Box Segal et al., 2006

S. cerevisiae S. Paradoxus S. mikatae S. kudriavzevii S. bayanus S. castelli C. glabrata S. kluyveri K. walti K. lactis A. gossypii D. hansenii C. albicans Y. lipolytica A. nidulans S. pombe -36 -41 -22 -42 -23 -53 -21 -14 -26 -3 -16 -9 -56 -2 Nucleosome organization near 5’ ends of genes is conserved through evolution Segal et al., 2006

Predicted nucleosome organization near 5’ ends of genes – comparison to experiment Segal et al., 2006

Summary Differing DNA sequences exhibit a > 5,000-fold range of affinities for nucleosome formation We have a predictive understanding of the DNA sequence motifs that are responsible Sequences matching these motifs are abundant in eukaryotic genomes, and are occupied by nucleosomes in vivo A model based only on these DNA sequence motifs and nucleosome-nucleosome exclusion explains ~50% of in vivo nucleosome positions These intrinsically encoded nucleosome positions are correlated with, and may facilitate, essential aspects of chromosome structure and function

Factors present in the nucleus Genome sequence influences the competition between nucleosomes and DNA binding proteins Factors present in the nucleus None + 200 100 Promoter region 1 20 -20 1 Nucleosome score landscape Nucleosome occupancy Stable nucleosome Unstable nucleosome DNA binding protein DNA binding site / Promoter region 2 I II III Segal et al., 2006

Toward a proper free energy model for the sequence-dependent cost of DNA wrapping AA TT TA GC Morozov, Segal, Widom, & Siggia

Acknowledgements Nucleosome positioning in vivo Eran Segal (Weizmann Inst.) Yvonne Fondufe-Mittendorf Lingyi Chen Annchristine Thåström Yair Field (Weizmann Inst.) Irene Moore Jiping Wang (Northwestern U. Statistics) Alexandre Morozov (Rockefeller U.) Eric Siggia (Rockefeller U.)