Introduction to epigenetics: chromatin modifications, DNA methylation and the CpG Island landscape (part 2) Héctor Corrada Bravo CMSC858P Spring 2012 (many.

Slides:



Advertisements
Similar presentations
Epigenetics Xiaole Shirley Liu STAT115, STAT215, BIO298, BIST520.
Advertisements

Visualising and Exploring BS-Seq Data
Special Topics in Modern Genetics: Epigenetics
Microarray technology and analysis of gene expression data Hillevi Lindroos.
Microarrays and Cancer Segal et al. CS 466 Saurabh Sinha.
Defective de novo methylation of viral and cellular DNA sequences in ICF syndrome cells Robertson K. et al. Human Molecular Genetics, 2002 Gergana Ugrinova.
Microarray Type Analyses using Second Generation Sequencing
Comparative Genomic Hybridization (CGH). Outline Introduction to gene copy numbers and CGH technology DNA copy number alterations in breast cancer (Pollack.
Estrogen and its receptors play an important role in breast carcinogenesis. In humans, there are two subtypes of estrogen receptors (ER), ER  and ER ,
Why microarrays in a bioinformatics class? Design of chips Quantitation of signals Integration of the data Extraction of groups of genes with linked expression.
Committee Meeting April 24 th 2014 Characterizing epigenetic variation in the Pacific oyster (Crassostrea gigas) Claire Olson School of Aquatic and Fishery.
DNA Methylation Assays High Throughput Data Analysis BIOS , VCU Winter 2010 Mark Reimers, PhD.
Epigenetics Lab December 1, 2008 Goals of today’s lab: 1.Understand the basic molecular techniques used in the lab to study epigenetic silencing in cancer.
Epigenome 1. 2 Background: GWAS Genome-Wide Association Studies 3.
The virochip (UCSF) is a spotted microarray. Hybridization of a clinical RNA (cDNA) sample can identify specific viral expression.
An Introduction to ENCODE Mark Reimers, VIPBG (borrowing heavily from John Stamatoyannopoulos and the ENCODE papers)
The Genome is Organized in Chromatin. Nucleosome Breathing, Opening, and Gaping.
Amandine Bemmo 1,2, David Benovoy 2, Jacek Majewski 2 1 Universite de Montreal, 2 McGill university and Genome Quebec innovation centre Analyses of Affymetrix.
Application of New DNA Sequencing Technologies for the Study of Epigenetic Abnormalities in Breast Cancer John R. Edwards Columbia Genome.
The Center for Medical Genomics facilitates cutting-edge research with state-of-the-art genomic technologies for studying gene expression and genetics,
Genomica Funcional Dr. Víctor Treviño A7-421
Vidyadhar Karmarkar Genomics and Bioinformatics 414 Life Sciences Building, Huck Institute of Life Sciences.
Microarrays and Their Uses Brad Windle, Ph.D
Epigenetic Analysis BIOS Statistics for Systems Biology Spring 2008.
Epigenetics Heritable characteristics of the genome other than the DNA sequence Heritable during cell-division (mitosis) To a lesser extent also over generations.
ChIP-chip Data. DNA-binding proteins Constitutive proteins (mostly histones) –Organize DNA –Regulate access to DNA –Have many modifications Acetylation,
Lawrence Hunter, Ph.D. Director, Computational Bioscience Program University of Colorado School of Medicine
Summarization of Oligonucleotide Expression Arrays BIOS Winter 2010.
Contribution of Epigenetic Variation to Expression Changes Among Tissues and Genotypes Steve Eichten – Springer Lab PAG iPlant Workshop 1/17/12.
Other genomic arrays: Methylation, chIP on chip… UBio Training Courses.
Analysis of protein-DNA interactions with tiling microarrays
Companion PowerPoint slide set DNA Methylation & Cadmium Exposure in utero An Epigenetic Analysis Activity for Students This teacher slide set was created.
Microarray analysis Quantitation of Gene Expression Expression Data to Networks BIO520 BioinformaticsJim Lund Reading: Ch 16.
Introduction to epigenetics: chromatin modifications, DNA methylation and the CpG Island landscape Héctor Corrada Bravo CMSC702 Spring 2013 (many slides.
Trends Biomedical Science
Supplemental Figure 1. False trans association due to probe cross-hybridization and genetic polymorphism at single base extension site. (A) The Infinium.
Gene Expression Profiling Brad Windle, Ph.D
ChIP-seq Downstream Analysis Xiaole Shirley Liu STAT115, STAT215, BIO298, BIST520.
Differential Methylation Analysis
DNA Methylation Regulates Gene Expression in Intracranial Aneurysms
Integrated veterinary unit research (IVRU)
Distribution of CpG dinucleotide in the human genome and differences in methylation patterns between normal and tumor cells. In the majority of the mammalian.
Discovery of Multiple Differentially Methylated Regions
ppmi EPIgenetics Andy Singleton and Dena Hernandez
Companion PowerPoint slide set DNA Methylation & Cadmium Exposure in utero An Epigenetic Analysis Activity for Students This teacher slide set was created.
Companion PowerPoint slide set DNA Methylation & Cadmium Exposure in utero An Epigenetic Analysis Activity for Students This teacher slide set was created.
Companion PowerPoint slide set DNA Methylation & Cadmium Exposure in utero An Epigenetic Analysis Activity for Students This teacher slide set was created.
Identification of imprinted genes and imprinted DMRs
Linking Genetic Variation to Important Phenotypes
Exploring and Understanding ChIP-Seq data
Volume 6, Issue 6, Pages (June 2010)
Addition of H19 ‘Loss of Methylation Testing’ for Beckwith-Wiedemann Syndrome (BWS) Increases the Diagnostic Yield  Jochen K. Lennerz, Robert J. Timmerman,
Robust Detection of DNA Hypermethylation of ZNF154 as a Pan-Cancer Locus with in Silico Modeling for Blood-Based Diagnostic Development  Gennady Margolin,
A twin approach to unraveling epigenetics
Volume 26, Issue 4, Pages (October 2014)
Epigenetic regulation of miR-193b in liposarcomagenesis.
Volume 2, Issue 2, Pages (February 2008)
Volume 5, Issue 6, Pages (December 2013)
Volume 23, Issue 1, Pages 9-22 (January 2013)
Allelic Skewing of DNA Methylation Is Widespread across the Genome
Core promoter methylation in mediators of adipogenesis.
The human GPR109A promoter is methylated and GPR109A expression is silenced in human colon carcinoma cells. The human GPR109A promoter is methylated and.
Other genomic arrays: Methylation, chIP on chip…
Volume 78, Issue 6, Pages (September 2010)
Integrative analysis of 111 reference human epigenomes
Peiyong Jiang, K.C. Allen Chan, Y.M. Dennis Lo
Symmetrical Dose-Dependent DNA-Methylation Profiles in Children with Deletion or Duplication of 7q11.23  Emma Strong, Darci T. Butcher, Rajat Singhania,
Methylation status of IGFBPL1 in human breast cancer.
Epigenetic regulation of p16INK4a in human gastric cancer.
Genome-wide DNA hypomethylation associated with DNMT3A mutation in murine and human FLT3ITD AML. Human: A–C, volcano plot (A) representation of mean methylation.
Presentation transcript:

Introduction to epigenetics: chromatin modifications, DNA methylation and the CpG Island landscape (part 2) Héctor Corrada Bravo CMSC858P Spring 2012 (many slides courtesy of Rafael Irizarry)

How do we measure DNA methylation?

Microarray Data

One question… Where do we measure? At least 7 arrays are needed to measure entire genome CpG are depleated Remaining CpGs cluster

CpG Islands

But variation seen outside

McRBC No Methylation Cuts at A m CG or G m CG Input

McRBC Methylation

McRBC after GEL Methylation

McRBC after GEL Methylation

Now unmethylated No Methylation

McRBC after Gel No Methylation

Gene Expression Normalization does not work well here

We use control probes

There are also waves

Smoothing

McRBC on tiling two channel array We smooth

Proportion of neighboring CpG also methylated/not methylated

True signal (simulated)

Observed data

Observed data and true signal

What is methylated (above 50%)?

Naïve approach

Many false positives (FP)

Smooth

No FP, but one false negative

Smooth less? No FN, lots of FP

We prefer this!

CHARM DMR for three tissues (five replicates) Irizarry et al, Nature Genetics 2009

Some findings [Irizarry et al., 2009, Nat. Genetics]

Tissue easily distinguished

Cancer DMR

Many Regions like this Note: hypo and hyper methylation

Both hyper and hypo methylated

Cancer and Tissue DMRs coincide

DMR enriched in Shores

Still affects expression T-DMRs

Still affects expression C-DMRs

USING SEQUENCING (BS-SEQ)

TTCGATTACGATTCGATTACGA AAGCTAATGCTAAGCTAATGCT CH 3 TTCGATTACGATTCGATTACGA AAGCTAATGCTAAGCTAATGCT LiverBrain

TTCGATTACGATTCGATTACGA AAGCTAATGCTAAGCTAATGCT CH 3 TTCGATTACGATTCGATTACGA AAGCTAATGCTAAGCTAATGCT TTCGATTACGATTCGATTACGA AAGCTAATGCTAAGCTAATGCT TTCGATTACGATTCGATTACGA AAGCTAATGCTAAGCTAATGCT TTCGATTACGATTCGATTACGA AAGCTAATGCTAAGCTAATGCT TTCGATTACGATTCGATTACGA AAGCTAATGCTAAGCTAATGCT TTCGATTACGATTCGATTACGA AAGCTAATGCTAAGCTAATGCT 85% Methylation chr3:44,031,616-44,031,626

Bisulfite Treatment

GGGGAGCAGCATGGAGGAGCCTTCGGCTGACT GGGGAGCAGTATGGAGGAGTTTTCGGTTGATT

BS-seq GTCGTAGTATTTGTCT GTCGTAGTATTTGTNN TGTCGTAGTATCTGTC TATGTCGTAGTATTTG TATATCGTAGTATTTT TATATCGTAGTATTTG NATATCGTAGTATNTG TTTTATATCGCAGTAT ATATTTTATGTCGTA ATATTTTATCTCGTA ATATTTTATGTCGTA GA-TATTTTATGTCGT GATCACAGGTCTATCACCCTATTAACCACTCACGGGAGCTCTCCATGCATTTGGTATTTTCGTCTGGGGGGTATGCACGCGATAGCATTGCGAGACGCTGGAGCCGGAGCACCCTATGTCGCAGTATCTGTCTTTGATTCCTGCCTCATCCTATTATTTATCGCACCTACG TTCAATATT Coverage: 13 Methylation Evidence: 13 Methylation Percentage: 100%

BS-seq GTCGTAGTATTTGTCT GTCGTAGTATTTGTNN TGTCGTAGTATCTGTC TATGTCGTAGTATTTG TATATTGTAGTATTTT TATATCGTAGTATTTG NATATTGTAGTATNTG TTTTATATTGCAGTAT ATATTTTATGTCGTA ATATTTTATCTTGTA ATATTTTATGTCGTA GA-TATTTTATGTCGT GATCACAGGTCTATCACCCTATTAACCACTCACGGGAGCTCTCCATGCATTTGGTATTTTCGTCTGGGGGGTATGCACGCGATAGCATTGCGAGACGCTGGAGCCGGAGCACCCTATGTCGCAGTATCTGTCTTTGATTCCTGCCTCATCCTATTATTTATCGCACCTACG TTCAATATT Coverage: 13 Methylation Evidence: 9 Methylation Percentage: 69%

BS-seq GTCGTAGTATTTGTCT GTCGTAGTATTTGTNN TGTTGTAGTATCTGTC TATGTTGTAGTATTTG TATATTGTAGTATTTT TATATTGTAGTATTTG NATATTGTAGTATNTG TTTTATATTGCAGTAT ATATTTTATGTCGTA ATATTTTATCTTGTA ATATTTTATGTTGTA GA-TATTTTATGTCGT GATCACAGGTCTATCACCCTATTAACCACTCACGGGAGCTCTCCATGCATTTGGTATTTTCGTCTGGGGGGTATGCACGCGATAGCATTGCGAGACGCTGGAGCCGGAGCACCCTATGTCGCAGTATCTGTCTTTGATTCCTGCCTCATCCTATTATTTATCGCACCTACG TTCAATATT Coverage: 13 Methylation Evidence: 4 Methylation Percentage: 31%

BS-seq Alignment is much trickier: – Naïve strategy: do nothing, hope not many CpG in a single read – Smarter strategy: “bisulfite convert” reference: turn all Cs to Ts Also needs to be done on reverse complement reference and reads – Smartest strategy: be unbiased and try all combinations of methylated/un-methylated CpGs in each read Computationally expensive (see Hansen et al, 2011, for a strategy)

BS-seq There are similarities to SNP calling (we’ll see this in a couple of weeks) EXCEPT: we want to measure percentages – Use a binomial model to estimate p, percentage of methylation – Allow for sequencing errors, coverage differences, etc.

Measuring DNA Methylation Estimating percentages Use “local-likelihood” method – Based on loess (Plot courtesy of Kasper Hansen)

BS-seq Lister et al. 2009, Nature

Gene Expression Regulation: DNA methylation in promoter regions Lister et al. 2009, Nature

DNA methylation patterns within genomic regions Lister et al. 2009

Putting it together

What were we after? The epigenetic progenitor origin of human cancer [Feinberg, et al., Nature Reviews Genetics, 2006] Stochastic epigenetic variation as driving force of disease [Feinberg & Irizarry, PNAS, 2009] Phenotypic variation, perhaps epigenetically mediated, increases disease susceptibility Increased epigenetic and gene expression variability of specific genes/regions is a defining characteristic of cancer

What did we do? Custom Illumina methylation microarray Confirmed increased epigenetic variability in specific regions across five cancer types

What did we do? Custom Illumina methylation microarray Confirmed increased epigenetic variability in specific regions across five cancer types

What did we do? Custom Illumina methylation microarray Confirmed increased epigenetic variability in specific regions across five cancer types Confirmed same sites are involved in tissue differentiation

What did we do? Custom Illumina methylation microarray Whole genome sequencing of bisulfite treated DNA – Found large blocks of hypo-methylation (sometimes Mbps long) in colon cancer

What did we do? Custom Illumina methylation microarray Whole genome sequencing of bisulfite treated DNA – Found large blocks of hypo-methylation (sometimes Mbps long) in colon cancer – These regions coincide with hyper-variable regions across cancer types

What did we do? Custom Illumina methylation microarray Whole genome sequencing of bisulfite treated DNA Gene Expression Analysis

Gene Expression Data

When using multiple microarray experiments, proper normalization is key [McCall, et al., Biostatistics 2010]

Normalization is key fRMA: a single-chip normalization procedure GNUSE: a single-chip quality metric Barcode: a single-chip common-scale measurement

What did we do? Custom Illumina methylation microarray Whole genome sequencing of bisulfite treated DNA Gene Expression Analysis – Genes with hyper-variable gene expression in colon cancer are enriched in hypo-methylation blocks [Corrada Bravo, et al., under review]

What are we doing next? Custom Illumina methylation microarray Whole genome sequencing of bisulfite treated DNA Gene Expression Analysis – Genes with hyper-variable gene expression in colon cancer are enriched in hypo-methylation blocks

Bigger gene expression study 7,741 HGU133plus2 samples 598 normal tissue samples, 4,886 tumor samples 176 different tissue types 175 different GEO studies

Bigger gene expression study [Corrada Bravo, et al., under review]

What are we doing next? Custom Illumina methylation microarray Whole genome sequencing of bisulfite treated DNA Gene Expression Analysis – Genes with hyper-variable gene expression in colon cancer are enriched in hypo-methylation blocks – Tissue-specific genes have hyper-variable gene expression across cancer types [Corrada Bravo, et al., under review]