Mapping protein-DNA interactions by ChIP-seq Zsolt Szilagyi Institute of Biomedicine.

Slides:



Advertisements
Similar presentations
Transcriptional regulatory code of a eukaryotic genome Harbison CT et al. Nature, 2004, Vol. 431, pp Milica Volar 3 March 2005.
Advertisements

MEDIP, MAP AND MIRA Biological Affinity-Based Methods of DNA Methylation Detecton: Genome Wide.
Transcriptional regulation and promoter analysis
Computational discovery of gene modules and regulatory networks Ziv Bar-Joseph et al (2003) Presented By: Dan Baluta.
Finding Transcription Factor Binding Sites BNFO 602/691 Biological Sequence Analysis Mark Reimers, VIPBG.
Bioinformatics Lectures at Rice
Detecting DNA-protein Interactions Xinghua Lu Dept Biomedical Informatics BIOST 2055.
Canadian Bioinformatics Workshops
Next-generation sequencing and PBRC. Next Generation Sequencer Applications DeNovo Sequencing Resequencing, Comparative Genomics Global SNP Analysis Gene.
Greg Phillips Veterinary Microbiology
Genome-wide prediction and characterization of interactions between transcription factors in S. cerevisiae Speaker: Chunhui Cai.
Gene expression analysis summary Where are we now?
CISC667, F05, Lec24, Liao1 CISC 667 Intro to Bioinformatics (Fall 2005) DNA Microarray, 2d gel, MSMS, yeast 2-hybrid.
Bryan Heck Tong Ihn Lee et al Transcriptional Regulatory Networks in Saccharomyces cerevisiae.
Genomic DNA purification
High Throughput Sequencing
January 2013 Training ChIP-Seq & Stem Cell Dye. Agenda 2 Active Motif is introducing 2 new products in our next issue of Motifvations which will reach.
Diabetes and Endocrinology Research Center The BCM Microarray Core Facility: Closing the Next Generation Gap Alina Raza 1, Mylinh Hoang 1, Gayan De Silva.
The impact of next-generation sequencing technology of genetics Elaine R. Mardis – 11 February Washington School of Medicine, Genome Sequencing Center.
ChIP-chip Data, Model and Analysis Ying Nian Wu Dept. Of Statistics UCLA Joint with Ming Zheng, Leah Barrera, Bing Ren.
Location analysis of transcription factor binding sites Guy Naamati Andrei Grodzovky.
es/by-sa/2.0/. Large Scale Approaches to the Study of Gene Expression Prof:Rui Alves Dept.
The virochip (UCSF) is a spotted microarray. Hybridization of a clinical RNA (cDNA) sample can identify specific viral expression.
The Genome is Organized in Chromatin. Nucleosome Breathing, Opening, and Gaping.
Chapter 13. The Impact of Genomics on Antimicrobial Drug Discovery and Toxicology CBBL - Young-sik Sohn-
Genomics – Next-Gen sequencing and Microarrays
Kristen Horstmann, Tessa Morris, and Lucia Ramirez Loyola Marymount University March 24, 2015 BIOL398-04: Biomathematical Modeling Lee, T. I., Rinaldi,
Massive Parallel Sequencing
* only 17% of SNPs implicated in freshwater adaptation map to coding sequences Many, many mapping studies find prevalent noncoding QTLs.
Finish up array applications Move on to proteomics Protein microarrays.
Next Generation Sequencing and its data analysis challenges Background Alignment and Assembly Applications Genome Epigenome Transcriptome.
ChIP-on-Chip and Differential Location Analysis Junguk Hur School of Informatics October 4, 2005.
Reconstruction of Transcriptional Regulatory Networks
Chromatin Immunoprecipitation DNA Sequencing (ChIP-seq)
Vidyadhar Karmarkar Genomics and Bioinformatics 414 Life Sciences Building, Huck Institute of Life Sciences.
Microarrays and Their Uses Brad Windle, Ph.D
Discovering Macromolecular Interactions. An experimental strategy for identifying new molecular actors in a process candidate approach general screen.
Proteome and interactome Bioinformatics.
I519 Introduction to Bioinformatics, Fall, 2012
Different approaches: Site degeneracy data from libraries with randomized target sites (in vitro, complexity of the initial library is crucial, involves.
Molecular Testing and Clinical Diagnosis
Alistair Chalk, Elisabet Andersson Stem Cell Biology and Bioinformatic Tools, DBRM, Karolinska Institutet, September Day 5-2 What bioinformatics.
Starting Monday M Oct 29 –Back to BLAST and Orthology (readings posted) will focus on the BLAST algorithm, different types and applications of BLAST; in.
Algorithms in Bioinformatics: A Practical Introduction
Analysis of protein-DNA interactions with tiling microarrays
Proteomics, the next step What does each protein do? Where is each protein located? What does each protein interact with, if anything? What role does it.
Proteome and Gene Expression Analysis Chapter 15 & 16.
ANALYSIS OF GENE EXPRESSION DATA. Gene expression data is a high-throughput data type (like DNA and protein sequences) that requires bioinformatic pattern.
Two powerful transgenic techniques Addition of genes by nuclear injection Addition of genes by nuclear injection Foreign DNA injected into pronucleus of.
Trends Biomedical Science
Proteome and Gene Expression Analysis Chapter 15 & 16.
Microarrays and Other High-Throughput Methods BMI/CS 576 Colin Dewey Fall 2010.
Lecture-5 ChIP-chip and ChIP-seq
Analysis of ChIP-Seq Data Biological Sequence Analysis BNFO 691/602 Spring 2014 Mark Reimers.
MCB 317 Genetics and Genomics MCB 317 Topic 10, part 6 A Story of Transcription.
Biol 456/656 Molecular Epigenetics Lecture #5 Wed. Sept 2, 2015.
March 6, 2016 EpiQ Chromatin Analysis Kit A New Tool for Epigenetic Research Gábor Kohut PhD Field Application Specialist Central and Eastern Europe.
Higher Human Biology Unit 1 Human Cells KEY AREA 5: Human Genomics.
Introduction to Next Generation Sequencing. Strategies For Interrogating the Transcriptome Known genes Predicted genes Surrogate strategy Exon verification.
Library QA & QC Day 1, Video 3
Special Topics in Genomics ChIP-chip and Tiling Arrays.
Introduction The stem cell derived transcription factors SOX4, POU2F2 and BACH2 are known to be important in B-cell differentiation and B-cell malignancies.
Additional high-throughput sequencing techniques (finding all functional elements of genome) June 15, 2017.
Il principio della ChIP: arricchimento selettivo della frazione di cromatina contenente una specifica proteina La ChIP può anche esser considerata.
Sungkyunkwan University, School of Medicine.
Molecular techniques for the study of the interaction between……..
Chao He
Claudia Sala, David C. Grainger, Stewart T. Cole  Cell Host & Microbe 
Material for Quiz 5 from Chapter 8
Transcriptional Regulation by p53 through Intrinsic DNA/Chromatin Binding and Site- Directed Cofactor Recruitment  Joaquin M Espinosa, Beverly M Emerson 
Presentation transcript:

Mapping protein-DNA interactions by ChIP-seq Zsolt Szilagyi Institute of Biomedicine

Genome analysis -A need to understand genome function on a global scale -Gene regulation shapes cellular function. Transcription factors (TFs) are important determinants of overall transcription output of cells. Finding transcription binding sites (TFBS) Chromatin/TFBS landscape of the normal and diseased cell. Understanding ES cell growth/differentiation/reprogramming.

Traditional approaches to the problem -Experimental determination of transcription factors binding sites. EMSA, DNAse footprinting – in vitro - Computational prediction of binding motifs. Is a motif a real binding site? - DNA microarray analysis of target genes Is the target direct or indirect? The need for a more direct in vivo method.

Chromatin immunoprecipitation (ChIP) An approach to detect specific DNA regions/sequences associated with a protein of interest, in vivo. Became a powerful tool to analyze protein/DNA interactions, furthermore to detect any signal/modification associated with DNA/Chromatin. Made a huge impact on chromatin biology, epigenetics transcription research, etc.

1. Cross linking with FA Optimization is crucial. Chromatin immunoprecipitation (ChIP)

2. Cell lysis and sonication

Epitope tagging of protein of interest (HA, myc) -no need for raising antibody, -utilize commercial antibodies better sensitivity decreased noise Anything associated with chromatin can be ChIPed, if an antibody can be raised. Chromatin immunoprecipitation (ChIP) 3. Immunoprecipitation (IP) The protein of interest is immunoprecipitated together with the crosslinked DNA

4.Decrosslinking and purification of the DNA Chromatin immunoprecipitation (ChIP) Reverse the FA crosslinking

Chromatin immunoprecipitation (ChIP) 5. Analysis of ChIP DNA Identification of DNA regions associated with the protein/modification of interest -real-time PCR -DNA microarray (ChIP-chip) -Sequencing (ChIP-seq)

Controls for ChIP (ChIP-seq) -Input DNA the Chromatin sample processed parallel to the other samples but lacks the immunoprecipitation step. -No antibody control (IgG) the Chromatin sample processed parallel to the other samples and immunoprecipitated without specific antibody -No tag control chromatin processed in the same way as samples but from a strain not having a tag on the protein to be analysed. Input DNA Chromatin immunoprecipitation (ChIP) no tag IgG

-Problems with real-time PCR - Not genome wide, making the identification of unknown, potentially interesting binding sites unlikely. -By the development of microarray technologies ChIP on chip ChIP DNA is amplified, labeled and hybridized to microarray Genome wide landscape of binding, good but a little laborious. -By the development of high throughput sequencing methods ChIP-seq Analysis of ChIP DNA

Chip-seq technology Instead of hybridizing the ChIP DNA to a microarray, each sample is processed directly into a DNA library for sequencing and subsequent bioinformatics analysis. ChIP-Seq has improved sensitivity and reduced background over ChIP-chip. 2-4 times more TFBS are determined, compared with ChiP-chip Development of novel high-throughput sequencing methods - revolutionized genomic studies - enabled large scale sequence analysis through the generation of millions of short sequencing reads ChIP-seq technology

ChIP DNA quality for ChIP-seq Quality is checked by gelelectrophoresis (Bioanalyzer) - Should be between bp (ideally) - Quantity should be at least 5ng DNA Often the most common cause of failure in library preparation. - Scaling up the ChIP is necessary.

DNA library preparation Must be done before sequencing Represents the population of short DNA fragments in the sample. Different ways of library preparation depending on the sequencing platforms (e.g. Solexa/Illumina, SOLID/ABI) A large number of short reads (~36bp) are good enough for ChIP-seq approaches – advantage for Illumina and SOLID – most popular platforms today

1.Oligonucleotid adapters are introduced to the ends of small DNA fragments DNA library preparation (Illumina)

DNA library preparation and sequencing (Illumina) 2. Adapter ligation, PCR amplification (16-17 cycles) Barcode sequencing – multiplex illumina sequencing

DNA library preparation and sequencing (Illumina) 4. Library sequencing. ~ reads for yeast genome 13 M reads for human genome One flowcell generates ~8 M reads.

- Determination of binding sites from the sequence data is a challenge. Conceptually, genomic regions with an increased number of sequencing reads (tags) compared to control is considered to be a TFBS - Statistical filtering criteria is used to determine if these putative sites represent true binding sites. Sequence data analysis - After statistical analysis of binding sites a further analysis of data is required. These may include analysis of location of binding, relative to transcription factor binding sites or potential nearby target genes.