 CHANGE!! MGL Users Group meetings will now be on the 1 st Monday of each month 3:00-4:00 Room 3-2550 Note the change of time and room.

Slides:



Advertisements
Similar presentations
Methods to read out regulatory functions
Advertisements

ChIP-seq Data Analysis
V Improvements to 3kb Long Insert Size Paired-End Library Preparation Naomi Park, Lesley Shirley, Michael Quail, Harold Swerdlow Wellcome Trust Sanger.
Processing of miRNA samples and primary data analysis
ChIP-seq analysis Ecole de bioinformatique AVIESAN – Roscoff, Jan 2013.
SOLiD Sequencing & Data
Detecting DNA-protein Interactions Xinghua Lu Dept Biomedical Informatics BIOST 2055.
Analysis of ChIP-Seq Data
Ribosomal Profiling Data Handling and Analysis
NHGRI/NCBI Short-Read Archive: Data Retrieval Gabor T. Marth Boston College Biology Department NCBI/NHGRI Short-Read.
Promoter Analysis using Bioinformatics, Putting the Predictions to the Test Amy Creekmore Ansci 490M November 19, 2002.
High Throughput Sequencing
Mapping protein-DNA interactions by ChIP-seq Zsolt Szilagyi Institute of Biomedicine.
Massive Parallel Sequencing
Next Generation DNA Sequencing
RNA-seq workshop ALIGNMENT
Chromatin Immunoprecipitation DNA Sequencing (ChIP-seq)
High-resolution computational models of genome binding events Yuan (Alan) Qi Joint work with Gifford and Young labs Dana-Farber Cancer Institute Jan 2007.
Advancing Science with DNA Sequence Metagenome definitions: a refresher course Natalia Ivanova MGM Workshop September 12, 2012.
ChIP-seq hands-on Iros Barozzi, Campus IFOM-IEO (Milan) Saverio Minucci, Gioacchino Natoli Labs.
ChIP-chip Data. DNA-binding proteins Constitutive proteins (mostly histones) –Organize DNA –Regulate access to DNA –Have many modifications Acetylation,
I519 Introduction to Bioinformatics, Fall, 2012
RNA surveillance and degradation: the Yin Yang of RNA RNA Pol II AAAAAAAAAAA AAA production destruction RNA Ribosome.
Chip – Seq Peak Calling in Galaxy Lisa Stubbs Chip-Seq Peak Calling in Galaxy | Lisa Stubbs | PowerPoint by Casey Hanson.
Next Generation Sequencing
Runx1-VE+ Runx1+VE+CD41-Runx1+VE+CD41+Runx1+VE-CD41+ Supplementary Figure 1 Supplementary Figure 1: Validation of cell populations for gene expression.
Alistair Chalk, Elisabet Andersson Stem Cell Biology and Bioinformatic Tools, DBRM, Karolinska Institutet, September Day 5-2 What bioinformatics.
Starting Monday M Oct 29 –Back to BLAST and Orthology (readings posted) will focus on the BLAST algorithm, different types and applications of BLAST; in.
Do not reproduce without permission 1 Gerstein.info/talks (c) (c) Mark Gerstein, 2002, Yale, bioinfo.mbb.yale.edu Gerstein Lab Aims in ModENCODE.
MBD-Chip Analysis Bioinformatics - Fish 546 Winter 2014.
Lecture-5 ChIP-chip and ChIP-seq
Analysis of ChIP-Seq Data Biological Sequence Analysis BNFO 691/602 Spring 2014 Mark Reimers.
No reference available
Motif Search and RNA Structure Prediction Lesson 9.
Supplemental Figure 1. Bias-corrected NGS bioinformatics strategies. Paired-end DNA sequencing reveals the sequence of the genomic clone, the sample ID.
Accessing and visualizing genomics data
Introduction of the ChIP-seq pipeline Shigeki Nakagome November 16 th, 2015 Di Rienzo lab meeting.
Chip – Seq Peak Calling in Galaxy Lisa Stubbs Lisa Stubbs | Chip-Seq Peak Calling in Galaxy1.
A brief guide to sequencing Dr Gavin Band Wellcome Trust Advanced Courses; Genomic Epidemiology in Africa, 21 st – 26 th June 2015 Africa Centre for Health.
HOMER – a one stop shop for ChIP-Seq analysis
Reliable Identification of Genomic Variants from RNA-seq Data Robert Piskol, Gokul Ramaswami, Jin Billy Li PRESENTED BY GAYATHRI RAJAN VINEELA GANGALAPUDI.
Genome-wide characteristics of sequence coverage by next-generation sequencing: how does this impact interpretation? Jen Taylor Bioinformatics Team CSIRO.
Flow cytometry data analysis: SPADE for cell population identification and sample clustering Narahara.
Metagenomic Species Diversity.
Figure 1. Annotation and characterization of genomic target of p63 in mouse keratinocytes (MK) based on ChIP-Seq. (A) Scatterplot representing high degree.
Gene expression from RNA-Seq
MBD-Chip.
Invest. Ophthalmol. Vis. Sci ;57(10): doi: /iovs Figure Legend:
Sequence based searches:
S1 Supporting information Bioinformatic workflow and quality of the metrics Number of slides: 10.
Day 5 Session 29: Questions and follow-up…. James C. Fleet, PhD
TSS Annotation Workflow
Discovery tools for human genetic variations
Volume 6, Issue 3, Pages (September 2000)
Exploring and Understanding ChIP-Seq data
Volume 10, Issue 6, Pages (December 2002)
Taichi Umeyama, Takashi Ito  Cell Reports 
Volume 54, Issue 1, Pages (April 2014)
Epigenetics System Biology Workshop: Introduction
Volume 32, Issue 6, Pages (March 2015)
Fine-Resolution Mapping of TF Binding and Chromatin Interactions
Fine-Resolution Mapping of TF Binding and Chromatin Interactions
Volume 63, Issue 6, Pages (September 2016)
(Top) Construction of synthetic long read clouds with 10× Genomics technology. (Top) Construction of synthetic long read clouds with 10× Genomics technology.
Genomewide profiling of chromatin accessibility in prostate cancer specimens Genomewide profiling of chromatin accessibility in prostate cancer specimens.
Volume 41, Issue 2, Pages (January 2011)
Taichi Umeyama, Takashi Ito  Cell Reports 
Identification of chromatin modifying complex recruiting H3K9 methyltransferases. a, A MEME-ChIP analysis was performed to identify the transcription factor.
Chromatin basics & ChIP-seq analysis
Presentation transcript:

 CHANGE!! MGL Users Group meetings will now be on the 1 st Monday of each month 3:00-4:00 Room Note the change of time and room

 Notes on ChIP-seq library preparation and initial handling MGL Users Group 1/21/15

Library Preparation  What is the desired resolution of the experiment?  Is it desired to be able to identify a specific recognition motif?  Do we want to resolve binding to bp resolution?  Do we just need general areas?  Target resolution will influence both sample preparation and initial library construction.  It may also influence a decision to do either paired-end or single-end sequencing.  Paired end will result in more reliable mapping, generally, by disambiguation of highly similar sequences with another proximal read.  It can also potentially resolve the ‘ends’ and sizes of protected fragments

HiSeq / SOLiD reads are short  Unless the ChIP targets have a small footprint, even paired end reads may not span a full pulldown target in some cases.  Only ends will be read, with intervening sequence ‘invisible’ to overall read density.  In the case of digested nucleosome fragments, for example, it may be possible to have more than one tandem ligand/target on a single resulting fragment.  Again, if resolution is key, it’s often desirable to ensure efficient cleavage/digestion of DNA prior to pulldown.  Additional steps may be added to size-select for fragments matching most closely in size to the expected single-ligand protected fragment.

Always run an Input control  ChIP-seq identifies regions of enrichment in the IP pool which may be affected by relative abundance of those in the input.  To control for artificial enrichment, the input control serves as a normalization for apparent peaks of enriched signal in the IP.

Additional treatments pre- library construction  Large pulldown fragments were subjected to additional Covaris shearing prior to library construction. Input Immuno-precipitated Mono-nucleosomes ~151 nt Poly-nucleosomes ~1,000-8,000 nt  >1kb necessitate mate-pair library construction otherwise.

Final library sizes were similar  After Covaris shearing, construction of standard fragment libraries proceeded as normal. Input Immuno-precipitated Library ~285 nt Library ~285 nt

The common workflow for ChIPseq following sequencing Bardet, et al. Nature Protocols (2012)

Data Alignment  Alignment strategies may differ, particularly if redundant or highly similar sequences are expected.  Alignment modes:  Unique alignment – Only uniquely best alignments are used  Random assignment – If equal best alignments are found, randomly assign the read to one of them  Multiple assignment – If equal best alignments are found, assign the same read to all of them  Depending on the aligner used, often a random/multi aligned read will be reported as mapping quality of 0.  Visualization tools may mask mapping quality below a certain threshold

Peak identification & Mining Software  Peak calling/comparisons  HOMER, MACS, ODIN, R (DBChIP, others), MAnorm  Peak annotation  HOMER, Bedtools, GREAT, Cistrome, R (ChIPpeakAnno)  Vizualization  IGV, UCSC Genome Browser, NGS-Plot, R (base & other packages)  Motif Identification  Interpro (known), Pfam (known), MEME (discovery/known)  Enrichment Analysis  DAVID, gProfiler, BiNGO, AmiGO, R (GOstats, others)