Genetics Journal Club Sumeet A. Khetarpal 10 December 2015.

Slides:



Advertisements
Similar presentations
Methods to read out regulatory functions
Advertisements

Interpreting Variation in Human Non-Coding Genomic Regions Using Computational Approaches with Experimental Support Lisa Brooks, Ph.D., Mike Pazin, Ph.D.
What is Li-Fraumeni syndrome?
Speaker: HU Xue-Jia Supervisor: WU Yun-Dong Date: 19/12/2013.
CS 374: Relating the Genetic Code to Gene Expression Sandeep Chinchali.
Office hours Wednesday 3-4pm 304A Stanley Hall Review session 5pm Thursday, Dec. 11 GPB100.
Identification of obesity-associated intergenic long noncoding RNAs
Presented by Karen Xu. Introduction Cancer is commonly referred to as the “disease of the genes” Cancer may be favored by genetic predisposition, but.
Comparative Genomics II: Functional comparisons Caterino and Hayes, 2007.
ENCODE enhancers 12/13/2013 Yao Fu Gerstein lab. ‘Supervised’ enhancer prediction Yip et al., Genome Biology (2012) Get enhancer list away to genes DNase.
Epigenome 1. 2 Background: GWAS Genome-Wide Association Studies 3.
TGCAAACTCAAACTCTTTTGTTGTTCTTACTGTATCATTGCCCAGAATAT TCTGCCTGTCTTTAGAGGCTAATACATTGATTAGTGAATTCCAATGGGCA GAATCGTGATGCATTAAAGAGATGCTAATATTTTCACTGCTCCTCAATTT.
GWAS Hits and Functional Implications Peter Castaldi February 1, 2013.
Differential Principal Component Analysis (dPCA) for ChIP-seq
Recombination breakpoints Family Inheritance Me vs. my brother My dad (my Y)Mom’s dad (uncle’s Y) Human ancestry Disease risk Genomics: Regions  mechanisms.
Thoughts on ENCODE Annotations Mark Gerstein. Simplified Comprehensive (published annotation, mostly in '12 & '14 rollouts)
Biol 456/656 Molecular Epigenetics Lecture #5 Wed. Sept 2, 2015.
Using public resources to understand associations Dr Luke Jostins Wellcome Trust Advanced Courses; Genomic Epidemiology in Africa, 21 st – 26 th June 2015.
Transcriptional Enhancers Looking out for the genes and each other Sridhar Hannenhalli Department of Cell Biology and Molecular Genetics Center for Bioinformatics.
Different microarray applications Rita Holdhus Introduction to microarrays September 2010 microarray.no Aim of lecture: To get some basic knowledge about.
1 Finding disease genes: A challenge for Medicine, Mathematics and Computer Science Andrew Collins, Professor of Genetic Epidemiology and Bioinformatics.
The regulation of Caspase 8 chIP-seq motifs mRNA expression DNA methylation.
Additional high-throughput sequencing techniques (finding all functional elements of genome) June 15, 2017.
CRISPRi-based genome-scale identification of functional long noncoding RNA loci in human cells Presented by Nur Ata Bruss and Xinyi Ma.
Genetics Journal Club Robert C. Bauer October 22nd, 2015.
Genomic Analysis: GWAS
Common variation, GWAS & PLINK
Epigenetics Continued
Monica Britton, Ph.D. Sr. Bioinformatics Analyst June 2016 Workshop
Complex disease and long-range regulation: Interpreting the GWAS using a Dual Colour Transgenesis Strategy in Zebrafish.
Epigenetics 04/04/16.
Functional Mapping and Annotation of GWAS: FUMA
breast cancer 2, early onsetpro What does this protein make up or do?
Introduction to bioinformatics lecture 11 SNP by Ms.Shumaila Azam
Gene Hunting: Design and statistics
What makes a mutant?.
Figure 3 Example of how a noncoding regulatory rheumatoid
Volume 145, Issue 2, Pages (August 2013)
Dynamic epigenetic enhancer signatures reveal key transcription factors associated with monocytic differentiation states by Thu-Hang Pham, Christopher.
Polymorphisms GWAS traits.
Beyond GWAS Erik Fransen.
Volume 152, Issue 3, Pages (January 2013)
Type 2 Diabetes With type 2 diabetes, your body either resists the effects of insulin — a hormone that regulates the movement of sugar into your cells.
Genetic-Variation-Driven Gene-Expression Changes Highlight Genes with Important Functions for Kidney Disease  Yi-An Ko, Huiguang Yi, Chengxiang Qiu, Shizheng.
Relationship between Genotype and Phenotype
In these studies, expression levels are viewed as quantitative traits, and gene expression phenotypes are mapped to particular genomic loci by combining.
High-Resolution Genetic Maps Identify Multiple Type 2 Diabetes Loci at Regulatory Hotspots in African Americans and Europeans  Winston Lau, Toby Andrew,
Polymorphisms GWAS traits.
by Denes Hnisz, Abraham S. Weintraub, Daniel S
A Comprehensive cis-eQTL Analysis Revealed Target Genes in Breast Cancer Susceptibility Loci Identified in Genome-wide Association Studies  Xingyi Guo,
Transcriptional Addiction in Cancer
Volume 58, Issue 4, Pages (May 2015)
Enhancer Connectome Nominates Target Genes of Inherited Risk Variants from Inflammatory Skin Disorders  Mark Y. Jeng, Maxwell R. Mumbach, Jeffrey M. Granja,
One SNP at a Time: Moving beyond GWAS in Psoriasis
Genetics of Human Cardiovascular Disease
Systematic mapping of functional enhancer-promoter connections with CRISPR interference by Charles P. Fulco, Mathias Munschauer, Rockwell Anyoha, Glen.
A systems view of genetics in chronic kidney disease
Volume 167, Issue 2, Pages e10 (October 2016)
Presentation by: Hannah Mays UCF - BSC 4434 Professor Xiaoman Li
Systematic mapping of functional enhancer–promoter connections with CRISPR interference by Charles P. Fulco, Mathias Munschauer, Rockwell Anyoha, Glen.
Adam C. Wilkinson, Hiromitsu Nakauchi, Berthold Göttgens  Cell Systems 
Anh Pham Conserved epigenomic signals in mice and humans reveal immune basis of Alzheimer’s disease.
Hunting for Celiac Disease Genes
Genetic mapping and epigenetic landscape of RUNX3 locus overlapping rs
CaQTL analysis identifies genetic variants affecting human islet cis-RE use. caQTL analysis identifies genetic variants affecting human islet cis-RE use.
Pan-cancer genome and transcriptome analyses of 1,699 paediatric leukaemias and solid tumours By: Anh Pham.
IMPACT: Genomic Annotation of Cell-State-Specific Regulatory Elements Inferred from the Epigenome of Bound Transcription Factors  Tiffany Amariuta, Yang.
The Genetics of Transcription Factor DNA Binding Variation
Beyond GWASs: Illuminating the Dark Road from Association to Function
Presentation transcript:

Genetics Journal Club Sumeet A. Khetarpal 10 December 2015

A common overall problem…

A common overall problem… Phenotype/Disease Unbiased genomic discovery Causal variant? Mechanism? Variants Gene Pathway Actionable deliverable

Published Genome-Wide Associations through 2013 ~12,000 SNP-trait associations NHGRI GWA Catalog

Maris lab at CHOP: Heritability of Neuroblastoma Bosse & Maris. Cancer. 2015

Neuroblastoma is a Pediatric Cancer Tumor of sympathetic nervous system most frequently presents in adrenal gland 650 cases/year in USA 10% of all pediatric cancers and 15% of all pediatric cancer deaths <50% of high-risk cancers contain recurrent somatic mutations (Pugh et al. 2013) Germline genetic variation may substantially impact heritability Germline mutations in the ALK/PHOX2B genes are implicated in rare familial neuroblastoma cases J Nucl Med, 2004

Neuroblastoma susceptibility loci from GWAS DUSP12 PLoS Genet 2011 HSD17B12 DDX4/IL31RA Low-Risk BARD1 Nat Genet 2009 LINC00340 NEJM 2008 LMO1 Nature 2011 High-Risk HACE1/LIN28B Nat Genet, 2012 TP53 JNCI, 2014

LIM Domain Only 1 (LMO1) Transcriptional co-regulator Part of a family of related proteins (LMO1-4) with oncogenic activity in T-ALL, breast cancer, neuroblastoma Matthews et al. Nat Rev Cancer. 2013

(Protective Genotype) (Protective Genotype) LMO1 is a neuroblastoma oncogene implicated in both tumor initiation and maintenance LMO1 Knockdown (Risk Genotype) LMO1 Knockdown (Protective Genotype) LMO1 Overexpression (Protective Genotype) Wang et al. Nature, 2011.

Hypothesis GWAS associated noncoding SNPs at LMO1 perturb LMO1 gene expression Goal Determine the causal variant and mechanism underlying association of LMO1 noncoding SNPs with neuroblastoma risk

Approach Fine mapping of the LMO1 locus Characterization of SNPs using publicly available datasets Impact of lead SNP on gene expression Impact of lead SNP on gene transcription

Fine Mapping of the LMO1 Locus Neuroblastoma GWAS ~2800 cases vs. 7500 controls Germline SNP Imputation from 1000 Genomes 2100 4200

Fine Mapping of the LMO1 Locus 27 SNPs with MAF>0.01 and P<10-5 Fig. 1a

Direct genotyping and meta-analysis of rs2168101 association 3200 Cases vs. 8500 Controls Common in Europeans + East Asians, rare/absent in Africans Direct genotyping of 146 / 2101 cases from GWAS  86% imputation accuracy Table 1

Conditional analysis of other LMO1 SNPs after adjusting for rs2168101 Ext. Fig. 1a

Conditional analysis of rs2168101 after adjusting for other LMO1 SNPs rs2168101 is the lead SNP of the lone signal at the LMO1 locus Ext. Fig. 1b

ENCODE ChIP Seq overlap of LMO1 SNPs in SKNSH neuroblastoma cell line Fig. 1b

Does the LMO1 lead SNP affect transcription factor binding? JASPAR Database Consensus TF binding motifs based on experimental ChIP-Seq data 202 nonredundant profiles in vertebrates http://jaspar.genereg.net

Does the LMO1 lead SNP affect transcription factor binding? Lead LMO1 SNP falls in conserved enhancer and GATA binding motif Fig. 1b

Impact of lead SNP on LMO1 expression LMO1 RNA reads in 127 high-risk tumors separated by genotype No T/T genotypes found in high-risk neuroblastoma tumors  risk alleles ‘predispose’ to the high-risk subset Fig. 2a

Impact of lead SNP on LMO1 expression Allelic imbalance of LMO1 expression in tumors from hets Fig. 2b – 12 tumors were heterozygous. Fig. 2a-b

Impact of lead SNP on LMO1 expression Allelic imbalance of nascent LMO1 expression in heterozygous cell lines Ancestral GATA allele of lead SNP  increased LMO1 gene expression Cis regulation? Fig. 2a-c

G(O)TA find the GATA GATA2 and GATA3 are highly expressed GATA factors in neuroblastoma tumors Neuroblastoma cell lines overexpressing GATA3 show increased LMO1 expression if G allele present Ext. Fig. 5a

GATA3 binding at LMO1 by ChIP GATA3 ChIP-Seq in neuroblastoma cell lines with differing rs2168101 genotype Fig. 4a

GATA3 binding at LMO1 by ChIP Allele-specific GATA3 ChIP-Seq at LMO1 lead SNP in cell lines ~Zero GATA3 binding to rs2168101 T allele! Fig. 4b

LMO1 lead SNP lies in active enhancer Fig. 4c, e

Summary Fine mapping validates LMO1 locus-neuroblastoma association and identifies single signal Lead SNP lies in conserved GATA TF binding motif Protective allele disrupts GATA3 binding, enhancer activity, gene expression Protective allele associated with reduced cell growth & increased survival in high-risk disease

Lingering Questions Busted enhancer, broken TF binding, both, or one-in-the-same? Molecular mechanism of disruption? Why is this particular GATA binding site so important for this region? Other long-range interactions disrupted? (Is it really all just LMO1?) Is locus really a ‘super-enhancer’?

Can this locus be defined as a super-enhancer? Proposed model for impact of lead SNP on GATA3-mediated LMO1 transcription Risk “G” allele of rs2168101: Exon 1 GATA3 AGATAA Exon 1 Exon 2 Exon 3 Exon 4 LMO1 Gene Can this locus be defined as a super-enhancer? Protective “T” allele of rs2168101: GATA3 Exon 2 Exon 3 Exon 4 Exon 1 ATATAA LMO1 Gene

Enhancers and Super-enhancers DNA element that activates transcription from a distance Enrichment of H3K4me1, H3K27ac, DNase I hypersensitivity Few 100 bp Super-enhancers Unusually strong enrichment for coactivators (e.g. Med1) Enrichment of Pol II, eRNA, p300, CBP, H3K27ac, H3K4me1/2, DNase I hypersensitivity Few 1000 bp High controversy over distinction from enhancers Poor agreement over definition Pott & Lieb. Nat Genet. 2015

‘Super-enhancers’ are controversial

Back to our common problem…

Back to our common problem… Phenotype/Disease How can we find causal variants and their mechanisms? Unbiased genomic discovery Variants Gene Pathway Actionable deliverable

There are some tools to help… Spain & Barrett. Hum Mol Genet. 2015

…but it’s still a hot mess Spain & Barrett. Hum Mol Genet. 2015

...and only a few have ‘succeeded’ so far Spain & Barrett. Hum Mol Genet. 2015

Germline genetic variation and pediatric cancer risk Whole-genome or whole-exome sequencing in ~1100 pediatric participants with cancer vs. ~960 cancer-free participants from 1000 genomes 8.5% of cancer patients had deleterious germline mutation Underestimate of total germline heritability

Thanks for coming!