Dennis Shasha, Courant Institute, New York University With

Slides:



Advertisements
Similar presentations
Exploiting transcription factor binding site clustering to identify cis-regulatory modules involved in pattern formation in the Drosophila genome ECS289A.
Advertisements

Periodic clusters. Non periodic clusters That was only the beginning…
Computational discovery of gene modules and regulatory networks Ziv Bar-Joseph et al (2003) Presented By: Dan Baluta.
Outline Questions from last lecture? P. 40 questions on Pax6 gene Mechanism of Transcription Activation –Transcription Regulatory elements Comparison between.
Promoter and Module Analysis Statistics for Systems Biology.
Transcriptional Networks BINDING SITES: TRANSCRIPTION FACTORS BIND TO THESE. MAY INDUCE AND MAY REPRESS.
20,000 GENES IN HUMAN GENOME; WHAT WOULD HAPPEN IF ALL THESE GENES WERE EXPRESSED IN EVERY CELL IN YOUR BODY? WHAT WOULD HAPPEN IF THEY WERE EXPRESSED.
A Probabilistic Dynamical Model for Quantitative Inference of the Regulatory Mechanism of Transcription Guido Sanguinetti, Magnus Rattray and Neil D. Lawrence.
From Sequence to Expression: A Probabilistic Framework Eran Segal (Stanford) Joint work with: Yoseph Barash (Hebrew U.) Itamar Simon (Whitehead Inst.)
STRATEGY FOR GENE REGULATION 1.INFORMATION IN NUCLEIC ACID – CIS ELEMENT CIS = NEXT TO; ACTS ONLY ON THAT MOLECULE 2.TRANS FACTOR (USUALLY A PROTEIN) BINDS.
TRANSFAC Project Roadmap Discussion.  Structure DNA-binding domain (DBD)  The portion (domain) of the transcription factor that binds DNA Trans-activating.
Microarrays and Cancer Segal et al. CS 466 Saurabh Sinha.
BACKGROUND E. coli is a free living, gram negative bacterium which colonizes the lower gut of animals. Since it is a model organism, a lot of experimental.
1 Is Gene Position Adaptively Favored?. 2 Why do we care? Genomic clusters of genes Yeast 98% of genes in metabolic pathways cluster (Lee & Sonnhammer)
Fuzzy K means.
The Hardwiring of development: organization and function of genomic regulatory systems Maria I. Arnone and Eric H. Davidson.
Regulatory element detection using correlation with expression (REDUCE) Literature search WANG Chao Sept 14, 2004.
Promoter Analysis using Bioinformatics, Putting the Predictions to the Test Amy Creekmore Ansci 490M November 19, 2002.
Protein-Protein Interaction Screens. Bacterial Two-Hybrid System selectable marker RNA polymerase DNA binding protein bait target sequence target.
Computational Molecular Biology Biochem 218 – BioMedical Informatics Gene Regulatory.
Motif finding: Lecture 1 CS 498 CXZ. From DNA to Protein: In words 1.DNA = nucleotide sequence Alphabet size = 4 (A,C,G,T) 2.DNA  mRNA (single stranded)
Searching for TFBSs with TRANSFAC - Hot topics in Bioinformatics.
Reconstructing gene networks Analysing the properties of gene networks Gene Networks Using gene expression data to reconstruct gene networks.
Chapter 11 Phage strategies.
10/17/05 D Dobbs ISU - BCB 444/544X: Genes & Genomes1 10/17/05 Genes & Genomes (formerly Gene Prediction - 1)
Unraveling condition specific gene transcriptional regulatory networks in Saccharomyces cerevisiae Speaker: Chunhui Cai.
Identification of cell cycle-related regulatory motifs using a kernel canonical correlation analysis Presented by Rhee, Je-Keun Graduate Program in Bioinformatics.
The Lac Operon An operon is a length of DNA, made up of structural genes and control sites. The structural genes code for proteins, such as enzymes.
The Operon 操縱元 a functioning unit of genomic material containing a cluster of genes under the control of a single regulatory signal or promoter.
Data Mining the Yeast Genome Expression and Sequence Data Alvis Brazma European Bioinformatics Institute.
Regulation of Gene Expression in Bacteria and Their Viruses
Case Study: Characterizing Diseased States from Expression/Regulation Data Tuck et al., BMC Bioinformatics, 2006.
Knowledge prior to this research? Questions addressed? Experimental approaches used? and what outcome? Impact of these findings? Future experiments? Research.
Enhancers and 3D genomics Noam Bar RESEARCH METHODS IN COMPUTATIONAL BIOLOGY.
Identifying submodules of cellular regulatory networks Guido Sanguinetti Joint work with N.D. Lawrence and M. Rattray.
Regulation of Gene Expression
A Tale of Three Inferences
Reverse-engineering transcription control networks timothy s
Control of Gene Expression
Control of Gene Expression
Motifs BCH364C/394P - Systems Biology / Bioinformatics
Regulation of Gene Expression
System Structures Identification
Structure of proximal and distant regulatory elements in the human genome Ivan Ovcharenko Computational Biology Branch National Center for Biotechnology.
Prediction of Regulatory Elements for Non-Model Organisms Rachita Sharma, Patricia.
Genetic Regulatory Networks
Control of Gene Expression
Mechanisms of lncRNA function.
Chapter 12.5 Gene Regulation.
Effect of polymorphisms on transcriptional regulation in mice
1 Department of Engineering, 2 Department of Mathematics,
Cell Signaling.
1 Department of Engineering, 2 Department of Mathematics,
Relationship between Genotype and Phenotype
A Zero-Knowledge Based Introduction to Biology
1 Department of Engineering, 2 Department of Mathematics,
Relationship between Genotype and Phenotype
Patterns of control of gene expression
From Prescription to Transcription: Genome Sequence as Drug Target
Control of Gene Expression in Eukaryotic cells
Even-skipped gene regulation
Revealing Global Regulatory Perturbations across Human Cancers
Principle of Epistasis Analysis
Revealing Global Regulatory Perturbations across Human Cancers
Genome-wide analysis of p53 occupancy.
Schematic model of effector pathways that mediate tumor suppression by p53. Schematic model of effector pathways that mediate tumor suppression by p53.
MexT-associated downregulation of oprD expression.
Prokaryotic (Bacterial) Gene Regulation
Motifs BCH339N Systems Biology / Bioinformatics – Spring 2016
Relationship between Genotype and Phenotype
Presentation transcript:

What is the Best Way to Find the Binding Site for a Transcription Factor? Dennis Shasha, Courant Institute, New York University With Philip Benfey and Ken Birnbaum Biology Department, New York University.

Transcriptional Networks Induction Specificity Modularity AAAA AAAA AAAA e.g. cis-element AAAA AAAA Repression AAAA AAAA AAAA AAAA Time 1 Time 2 Time 3

Genomic and Expression Data Cis-regulatory regions of expressed genes Expres-sion Level Time

Clusters of Co-Expressed Genes Expres-sion level Time a a a a a a a a a Over-represented motifs

Low Correlation Expression Level Time Time Transcription Factor Dowstream Genes

Modularity From: Arnone & Davidson, Development, 1997 later expression module early specification module From: Arnone & Davidson, Development, 1997

Transcription Factor X Expres-sion Time NAAAAAA.. ..TTTTTTN

Time Expres-sion Gene Cis - regulatory region A Transcription Factor X Expression levels of genes with ACA?GTC in their promoters Time Gene Cis - regulatory region ACA?GTC A

Time Expres-sion Gene Cis - regulatory region A B Transcription Factor X Expres-sion Expression levels of genes with ACA?GTC in their promoters Time Gene Cis - regulatory region ACA?GTC A ACA?GTC B

Time Expres-sion Gene Cis - regulatory region A B C Transcription Factor X Expres-sion Expression levels of genes with ACA?GTC in their promoters Time Gene Cis - regulatory region ACA?GTC A ACA?GTC B ACA?GTC C

Composite expression of ACAGTC at Time 5 Transcription Factor X Expres-sion Expression levels of genes with ACA?GTC in their promoters Time Gene Cis - Expression regulatory Level Example: Time 5 region ACA?GTC A 2 ACA?GTC B 4 ACA?GTC C 2 = 8 Composite expression of ACAGTC at Time 5

Composite expression of ACAGTC at Time 5 Transcription Factor X Composite expression of genes with ACA?GTC in promoter Expres-sion Time Expression levels of genes with ACA?GTC in their promoters Gene Cis - Expression regulatory Level Example: Time 5 region ACA?GTC A 2 ACA?GTC B 4 ACA?GTC C 2 = 8 Composite expression of ACAGTC at Time 5

Cooperative Binding Model TFs AND Binding Sites TF Expression Cell Type A Cell Type B Cell Type C Z Z Expression Level A B C Z

X Cooperative Binding Model TFs AND Binding Sites TF Expression Cell Type A Cell Type B Cell Type C X Z Expression Level A B C Target Gene Expression Expression Level A B C

X X X X Cooperative Binding Model TFs AND Binding Sites TF Expression Cell Type A Cell Type B Cell Type C Z Z Expression Level A B C Z Z X X Target Gene Expression Z Z Z Z Z X Expression Level X A B C

X X X X Cooperative Binding Model TFs AND Binding Sites TF Expression Cell Type A Cell Type B Cell Type C Z Z Expression Level A B C Z Z X X Target Gene Expression Z Z Z Z Z X Expression Level X A B C

Cooperative Binding Model TFs OR Binding Sites

Independent Binding Model: well handled by Bussemaker et al.

Assumptions: TF RNA Expression = TF protein (protein movement) TFs are active where they are expressed (co-factors) Binding sites are within 2 kb of initiation site

Testing the Method Yeast Fully sequenced genome Genome-wide mRNA expression profiles 300 knockout lines from Rosetta (Hughes et al. 2000) 2 datasets on yeast progressing through the cell cycle (Spellman et al. 1998; Cho et al. 1998)

Results for STE12 *From SCPD (Zhang, Cold Spring Harbor) Dennis, the point of the lower *From SCPD (Zhang, Cold Spring Harbor)

Results for STE12 *From SCPD (Zhang, Cold Spring Harbor) Dennis, the point of the lower *From SCPD (Zhang, Cold Spring Harbor)

Results for STE12 *From SCPD (Zhang, Cold Spring Harbor) Dennis, the point of the lower *From SCPD (Zhang, Cold Spring Harbor)

Overall Results

Conclusions Technique: correlate transcription factor expression with cis-element expression. Can capture information that would be missed by gene expression correlation. Can handle cooperative (AND) and independent (OR) cases. Does less well for complex circuits. Future effort: manipulate promoters to eliminate false positives. (Information theory + experiments)