Interpreting Variation in Human Non-Coding Genomic Regions Using Computational Approaches with Experimental Support Lisa Brooks, Ph.D., Mike Pazin, Ph.D.

Slides:



Advertisements
Similar presentations
Ch 17 Gene Expression I: Transcription
Advertisements

Genetic Analysis in Human Disease
Ch 11 – Gene Expression The control of a gene at transcription, translation for even the polypeptide.
Speaker: HU Xue-Jia Supervisor: WU Yun-Dong Date: 19/12/2013.
Four of the many different types of human cells: They all share the same genome. What makes them different?
DNA and Gene Expression. DNA Deoxyribonucleic Acid Deoxyribonucleic Acid Double helix Double helix Carries genetic information Carries genetic information.
Identifying promoters and regulatory elements for DNA variation
© 2006 W.W. Norton & Company, Inc. DISCOVER BIOLOGY 3/e
CS 374: Relating the Genetic Code to Gene Expression Sandeep Chinchali.
“An integrated encyclopedia of DNA elements in the human genome” ENCODE Project Consortium. Nature 2012 Sep 6; 489: Michael M. Hoffman University.
Presented by Karen Xu. Introduction Cancer is commonly referred to as the “disease of the genes” Cancer may be favored by genetic predisposition, but.
Paola CASTAGNOLI Maria FOTI Microarrays. Applicazioni nella genomica funzionale e nel genotyping DIPARTIMENTO DI BIOTECNOLOGIE E BIOSCIENZE.
Manolis Kellis Broad Institute of MIT and Harvard
Genetic Analysis in Human Disease. Learning Objectives Describe the differences between a linkage analysis and an association analysis Identify potentially.
ENCODE The Human Genome project sequenced “the human genome” “the human genome” that we have labeled as such doesn’t actually exist What we call.
Epigenome 1. 2 Background: GWAS Genome-Wide Association Studies 3.
Current Topics in Genomics and Epigenomics – Lecture 2.
An Introduction to ENCODE Mark Reimers, VIPBG (borrowing heavily from John Stamatoyannopoulos and the ENCODE papers)
The Center for Medical Genomics facilitates cutting-edge research with state-of-the-art genomic technologies for studying gene expression and genetics,
Ch. 21 Genomes and their Evolution. New approaches have accelerated the pace of genome sequencing The human genome project began in 1990, using a three-stage.
GWAS Hits and Functional Implications Peter Castaldi February 1, 2013.
Michael Cummings David Reisman University of South Carolina Gene Regulation Part 2 Chapter 9.
Understanding Gene Regulation Through Integrated Analysis of Genomic Data Guo-Cheng Yuan Department of Biostatistics and Computational Biology Dana-Farber.
Bioinformatics MEDC601 Lecture by Brad Windle Ph# Office: Massey Cancer Center, Goodwin Labs Room 319 Web site for lecture:
The same gene can have many versions.
Complexities of Gene Expression Cells have regulated, complex systems –Not all genes are expressed in every cell –Many genes are not expressed all of.
Recombination breakpoints Family Inheritance Me vs. my brother My dad (my Y)Mom’s dad (uncle’s Y) Human ancestry Disease risk Genomics: Regions  mechanisms.
Bioinformatics and Computational Biology
The Future of Genetics Research Lesson 7. Human Genome Project 13 year project to sequence human genome and other species (fruit fly, mice yeast, nematodes,
Genomics A Systematic Study of the Locations, Functions and Interactions of Many Genes at Once.
Genomics A Systematic Study of the Locations, Functions and Interactions of Many Genes at Once.
Starter What do you know about DNA and gene expression?
BIOINFORMATICS Ayesha M. Khan Spring 2013 Lec-8.
Chapter 11 Review. Explain the difference between each of the following 1. Operator, promoter -Operator: DNA segment where an inhibitor protein binds.
Molecules and mechanisms of epigenetics. Adult stem cells know their fate! For example: myoblasts can form muscle cells only. Hematopoetic cells only.
Genetics of Gene Expression BIOS Statistics for Systems Biology Spring 2008.
Epigenetics Abira Khan. What is Epigenetics?  Histone code: Modifications associated with transcriptional activation- primarily methylation and acetylation-would.
KEY CONCEPT 8.5 Translation converts an mRNA message into a polypeptide, or protein.
Using public resources to understand associations Dr Luke Jostins Wellcome Trust Advanced Courses; Genomic Epidemiology in Africa, 21 st – 26 th June 2015.
TRANSCRIPTION AND TRANSLATION Vocabulary. GENE EXPRESSION the appearance in a phenotype characteristic or effect attributed to a particular gene.
Different microarray applications Rita Holdhus Introduction to microarrays September 2010 microarray.no Aim of lecture: To get some basic knowledge about.
1 Finding disease genes: A challenge for Medicine, Mathematics and Computer Science Andrew Collins, Professor of Genetic Epidemiology and Bioinformatics.
Integrative Genomics. Double-helix DNA strands are separated in the gene coding region Which enzyme detects the beginning of a gene ? RNA Polymerase (multi-subunit.
YOUR FUTURE STARTS WITH HOPE YOUR FUTURE STARTS WITH HOPE Genome Biology & Applied Bioinformatics Human Genome Mehmet Tevfik DORAK, MD PhD.
Looking Within Human Genome King abdulaziz university Dr. Nisreen R Tashkandy GENOMICS ; THE PIG PICTURE.
Genetics Journal Club Sumeet A. Khetarpal 10 December 2015.
Genomic Analysis: GWAS
Genomics A Systematic Study of the Locations, Functions and Interactions of Many Genes at Once.
Complex disease and long-range regulation: Interpreting the GWAS using a Dual Colour Transgenesis Strategy in Zebrafish.
Distribution of CpG dinucleotide in the human genome and differences in methylation patterns between normal and tumor cells. In the majority of the mammalian.
Functional Mapping and Annotation of GWAS: FUMA
Sunday, Tuesday & Thursday 2-3
Regulation of Gene Expression by Eukaryotes
Content and Labeling of Tests Marketed as Clinical “Whole-Exome Sequencing” Perspectives from a cancer genetics clinician and clinical lab director Allen.
Albert Xue, Binbin Huang, Jianrong Wang
The same gene can have many versions.
Concept 18.2: Eukaryotic gene expression can be regulated at any stage
The same gene can have many versions.
The same gene can have many versions.
From Prescription to Transcription: Genome Sequence as Drug Target
The same gene can have many versions.
The same gene can have many versions.
The same gene can have many versions.
The same gene can have many versions.
One SNP at a Time: Moving beyond GWAS in Psoriasis
The same gene can have many versions.
The same gene can have many versions.
The same gene can have many versions.
Deep Learning in Bioinformatics
The same gene can have many versions.
Presentation transcript:

Interpreting Variation in Human Non-Coding Genomic Regions Using Computational Approaches with Experimental Support Lisa Brooks, Ph.D., Mike Pazin, Ph.D. NHGRI Advisory Council May 20, 2013

“We do not know how to interpret variation in non-coding regions.” Whole-Genome Sequencing..ATTCGCAATTGTAAGGATA CATGCTGC....ACTCAC - ATTCTTAGGGTACATCATGCAGG..

Many Genes and Variants are Associated with a Disease Which are Causal?

Coding Regions Have the Genetic Code Variants: Synonymous Non-synonymous Stop Even this is complex, but a start.

1.5% of genome Looking under the lamppost for your keys Exon Regions

Non-coding DNA Variants are Known to Cause Human Diseases Prostate cancer Thyroid carcinoma Hereditary mixed polyposis syndrome Thalassemias Fragile X syndrome Alzheimer’s disease 6

Non-Coding DNA is Important for Therapeutics Non-coding variants are known to cause differences in drug metabolism Chemotherapy Anticoagulants Mood stabilizers 7 Blood May 6;115(18): Pharmacogenomics Apr;11(4): J Neurosci Mar 14;27(11):

NHGRI GWA Catalog dies Most GWAS Hits are Not in Exons 88-93% not in exons

Non-Coding DNA is Important for Disease and Adaptation About 90% of GWAS findings lie outside of protein-coding regions. More than 80% of recent adaptation signatures in three recent studies lie outside of protein- coding regions. 9 Stamatoyannopoulos, Science , 2012 Kingsley, Nature , 2012; Sabeti, Cell , 2013 Fraser, Genome Research 2013, PMID:

Non-Coding Regions Many types of functional elements

Concept for RFA Interpreting Variation in Human Non-Coding Genomic Regions Using Computational Approaches with Experimental Support Looking for highly innovative computational approaches to combine data sets to identify or narrow the set of potential variants that affect organismal function leading to disease risk or other traits.

Interpret the Variation Which variants potentially affect organismal function?Which variants potentially affect organismal function? May address how the effect is brought about.May address how the effect is brought about. May show the genetic architecture of a disease, including gene-gene and gene- environment interactions.May show the genetic architecture of a disease, including gene-gene and gene- environment interactions.

RFA Applications Include development of computational approaches to identify which variants potentially affect organismal functionalInclude development of computational approaches to identify which variants potentially affect organismal functional Include experimental data to support and assess these predictions.Include experimental data to support and assess these predictions. Not include production of functional dataNot include production of functional data (aside from data to support predictions) (aside from data to support predictions) Not include databases or simple aggregation of informationNot include databases or simple aggregation of information

Examples of Data Sets to Use GWAS, sequence, genotype, phenotype, ENCODE, expression, methylation, transcription factor binding sites, other functional studies, patterns of variation, genetic architecture of disease, gene x gene, gene x environment, animal models, …

Initiative Focus Genome-wide interpretation, not particular variants, genes, or regions. Focus on non-coding variants (OK if some are coding). May focus on particular classes of variants, such as CNVs, transcription factor binding sites, or CpG islands. The approaches must generalize beyond the specific datasets and diseases studied.

Narrow the Set of Variants (Example) whole genome GWAS region 1 region 2 region 3 Transcription in cell types of disease variants in region 2 genes ENCODE and regulation data small set of variants

Other Examples Variation data - genomic regions with selection Chromatin structure - indels affect regulation Promoter binding - promoter variants affect transcription factor binding, gene expression Epigenomic variability - organismal function

Experimental Support Each study must include experimental data (done by them or others). Provide support for the computational predictions.Provide support for the computational predictions. Allow assessment of prediction accuracy.Allow assessment of prediction accuracy. Range of types - more than plausibility, not necessarily gold-standard validation. May use model organism data.

Related Initiatives NHGRI: ENCODE tech development - exp NHGRI: Church CEGS - exp NIGMS: Function of variants – comp, exp NHLBI: Functional analyses - exp

Timeline Concept May 2013 Release RFAAug 2013 ReceiptJan 2014Jan 2015 Review June 2014 June 2015 Council Sept 2014Sept 2015 Funding Jan 2015Jan 2016 First round Second round

Each award up to $500,000 direct costs/yr. Make ~ 5-6 in each round. Round FY 15 FY 16 FY 17 FY 18 ($M) First $4.5 $4.5 $4.5 Second $4.5 $4.5 $4.5 Total $4.5 $9 $9 $4.5 Funding

“We can start to interpret variation in non-coding regions.” Whole-Genome Sequencing..ATTCGCAATTGTAAGGATA CATGCTGC....ACTCAC - ATTCTTAGGGTACATCATGCAGG..