The Genetics of Transcription Factor DNA Binding Variation

Slides:



Advertisements
Similar presentations
Figure 1. Annotation and characterization of genomic target of p63 in mouse keratinocytes (MK) based on ChIP-Seq. (A) Scatterplot representing high degree.
Advertisements

Volume 43, Issue 1, Pages (July 2011)
Alternative Computational Analysis Shows No Evidence for Nucleosome Enrichment at Repetitive Sequences in Mammalian Spermatozoa  Hélène Royo, Michael Beda.
Volume 22, Issue 2, Pages (April 2006)
Genetic-Variation-Driven Gene-Expression Changes Highlight Genes with Important Functions for Kidney Disease  Yi-An Ko, Huiguang Yi, Chengxiang Qiu, Shizheng.
Volume 38, Issue 4, Pages (May 2010)
Quantitative Genetic Interactions Reveal Biological Modularity
High-Resolution Profiling of Histone Methylations in the Human Genome
Volume 54, Issue 5, Pages (June 2014)
A Phase Separation Model for Transcriptional Control
Volume 57, Issue 6, Pages (March 2015)
Protein Occupancy Landscape of a Bacterial Genome
Lu Bai, Andrej Ondracka, Frederick R. Cross  Molecular Cell 
Volume 9, Issue 3, Pages (September 2017)
Modification of Enhancer Chromatin: What, How, and Why?
Transcriptional Addiction in Cancer
A CRISPR Connection between Chromatin Topology and Genetic Disorders
Revealing Global Regulatory Perturbations across Human Cancers
A twin approach to unraveling epigenetics
Impulse Control: Temporal Dynamics in Gene Transcription
Chromatin Domains: The Unit of Chromosome Organization
Adrien Le Thomas, Georgi K. Marinov, Alexei A. Aravin  Cell Reports 
Volume 54, Issue 5, Pages (June 2014)
Volume 67, Issue 6, Pages e6 (September 2017)
Volume 20, Issue 6, Pages (August 2017)
Mapping Global Histone Acetylation Patterns to Gene Expression
Hannah K. Long, Sara L. Prescott, Joanna Wysocka  Cell 
Volume 149, Issue 7, Pages (June 2012)
Transcription Initiation at Its Most Basic Level
Mechanisms and Consequences of Alternative Polyadenylation
High-Resolution Profiling of Histone Methylations in the Human Genome
Alex M. Plocik, Brenton R. Graveley  Molecular Cell 
Volume 32, Issue 6, Pages (March 2015)
Volume 23, Issue 1, Pages 9-22 (January 2013)
Volume 11, Issue 3, Pages (April 2015)
Control of the Embryonic Stem Cell State
Revealing Global Regulatory Perturbations across Human Cancers
Volume 49, Issue 2, Pages (January 2013)
The Human Transcription Factors
Volume 67, Issue 6, Pages e6 (September 2017)
Isabelle S. Peter, Eric H. Davidson  Cell 
Human Promoters Are Intrinsically Directional
Volume 5, Issue 3, Pages e7 (September 2017)
Volume 132, Issue 2, Pages (January 2008)
Volume 10, Issue 10, Pages (October 2017)
Volume 47, Issue 2, Pages (July 2012)
Volume 132, Issue 6, Pages (March 2008)
Volume 50, Issue 2, Pages (April 2013)
Volume 122, Issue 6, Pages (September 2005)
Volume 66, Issue 4, Pages e4 (May 2017)
Genome Regulation by Polycomb and Trithorax Proteins
Polycomb Protein Ezh1 Promotes RNA Polymerase II Elongation
Alternative Splicing: New Insights from Global Analyses
Volume 42, Issue 6, Pages (June 2011)
Adam C. Wilkinson, Hiromitsu Nakauchi, Berthold Göttgens  Cell Systems 
DNA Looping Facilitates Targeting of a Chromatin Remodeling Enzyme
Volume 3, Issue 4, Pages (April 2013)
Functional and Mechanistic Diversity of Distal Transcription Enhancers
Volume 36, Issue 6, Pages (December 2009)
Enhancer Malfunction in Cancer
Volume 52, Issue 1, Pages (October 2013)
Major Determinants of Nucleosome Positioning
Genome-wide “Re”-Modeling of Nucleosome Positions
The 3D Genome in Transcriptional Regulation and Pluripotency
IMPACT: Genomic Annotation of Cell-State-Specific Regulatory Elements Inferred from the Epigenome of Bound Transcription Factors  Tiffany Amariuta, Yang.
A New Cohesive Team to Mediate DNA Looping
The Structure of T. aquaticus DNA Polymerase III Is Distinct from Eukaryotic Replicative DNA Polymerases  Scott Bailey, Richard A. Wing, Thomas A. Steitz 
Enhancer Control of Transcriptional Bursting
Volume 33, Issue 1, Pages (July 2010)
A Splicing-Independent Function of SF2/ASF in MicroRNA Processing
Presentation transcript:

The Genetics of Transcription Factor DNA Binding Variation Bart Deplancke, Daniel Alpern, Vincent Gardeux  Cell  Volume 166, Issue 3, Pages 538-554 (July 2016) DOI: 10.1016/j.cell.2016.07.012 Copyright © 2016 Elsevier Inc. Terms and Conditions

Figure 1 A Methodological Workflow for Identifying Regulatory Variants Sequence-based, computational methodologies that evaluate the impact of potential regulatory variants on TF-DNA binding and downstream regulatory processes are schematically presented. For every putative variant (SNV, as in this example, or indel), a reference and alternate (containing the variant) sequence of pre-defined length (illustrated by the distinct shades of gray) is extracted. The chosen length defines the “sequence environment” and varies according to the type of model that is used. The middle yellow panel shows the common workflow, where both sequences are scored (SALT and SREF) according to a specific model representation to obtain a differential score (ΔS) that may indicate a change in DNA binding or more generally in chromatin state. As shown, ΔS supports a model in which the variant impacts a gene regulatory process. The bottom part of the figure illustrates the two main strategies that are employed for modeling the regulatory effect of a variant. The choice of the strategy depends on the posed question: does the variant impact (1) the binding of a TF (left) or (2) the local chromatin landscape (right)? In the first scenario, computational methods are used that depend on the availability of a comprehensive catalog of TF binding sequences or motifs (Box 1). The “de novo motif discovery” part schematizes the procedure that is required to obtain such a catalog, illustrating the use of sequence over-representation strategies that are applied on both in vivo (ChIP-seq, DHS-seq, etc.) and in vitro (e.g., PBM, HT-SELEX, or B1H) derived datasets. These strategies then produce TF motifs that can be represented either in regular expression format or using PWM- or HMM-based models. In this example, the linear HMM model is a generic representation of the PWM motif, with each node (state) of the HMM representing the position of a base in the motif. Additionally, a second HMM model is depicted, which inherently takes a variable space within the motif into account, for accurate representation of more complex binding scenarios (e.g., TF dimers). To answer the second question (lower-right), computational methods mainly rely on machine learning models that are trained on a wide variety of features such as a k-mer vocabulary built on regulatory versus background sequences or additional (epi)genomic datasets. These more elaborate models can also be used to score the two input sequences. The pipeline then evaluates the regulatory nature of the variant by directly assessing the differential score ΔS or by calculating a p value based on the distribution of the scores. Of note, this pipeline can be applied multiple times on different variants, after which the results can be aggregated and compared to prioritize variants. Cell 2016 166, 538-554DOI: (10.1016/j.cell.2016.07.012) Copyright © 2016 Elsevier Inc. Terms and Conditions

Figure 2 Distinct Modes of Genetic Variation-Mediated Changes in TF-DNA Binding (A) Only a minority of variable TF-DNA binding events are caused by DNA variants disrupting the cognate TF recognition motif. (B–D) The majority of variably binding events are motif variation independent, signifying that a variant located either proximally (<200 bp, B and C) or distally (D) to the focal motif affects the binding of the respective TF. Proximal variants can affect local cooperative DNA binding (B), which involves physical protein-protein interactions that require overlapping or very closely located (a few bp) motifs, or collaborative DNA binding (C), which reflects TF interdependencies needed, for example, to compete with nucleosomes and thus to access DNA. In contrast, distal variants (D) may alter chromatin state or conformation (e.g., DNA loops), which could affect the stability of interactions with DNA and between TFs. Cell 2016 166, 538-554DOI: (10.1016/j.cell.2016.07.012) Copyright © 2016 Elsevier Inc. Terms and Conditions

Figure 3 Variable Chromatin Modules (A) Correlated TF (e.g., PU.1 or RNA polymerase II [PolII]) binding and chromatin mark (e.g., H3K27Ac, H3K4me1, H3K4me3) enrichment analyses across individuals allows the mapping of “variable chromatin modules” (VCMs) (shown in light green in the upper panel and in network format in the panel below). VCMs thus embody variable regions with highly coordinated, molecular phenotypes. (B) The majority of VCMs have a “totem” structure of stacked molecular phenotypes that do not correlate with other neighboring molecular phenotypes and, as such, reflect local chromatin state variation. Multi-VCMs encompass sub-Mb regions involving distinct regulatory elements whose activity is highly coordinated and driven by a single or a few highly penetrating variants (“vcmQTL”) with enrichment in TF-bound regions. (C) VCMs constitute functional entities of higher-order chromatin organization embedded within topologically associating domains (TADs) and provide a molecular rationale as to how TF-DNA binding can be affected by distal genetic variation. Cell 2016 166, 538-554DOI: (10.1016/j.cell.2016.07.012) Copyright © 2016 Elsevier Inc. Terms and Conditions