Functional Mapping and Annotation of GWAS: FUMA

Slides:



Advertisements
Similar presentations
Geuvadis RNAseq UNIGE Genetic regulatory variants
Advertisements

Control of Gene Expression
Gene Regulation in Eukaryotic Cells. Gene regulation is complex Regulation, and therefore, expression of a gene is complex. Regulation of these genes.
Speaker: HU Xue-Jia Supervisor: WU Yun-Dong Date: 19/12/2013.
Introduction to BioInformatics GCB/CIS535
Bioinformatics Alternative splicing Multiple isoforms Exonic Splicing Enhancers (ESE) and Silencers (ESS) SpliceNest Lecture 13.
CS 374: Relating the Genetic Code to Gene Expression Sandeep Chinchali.
“An integrated encyclopedia of DNA elements in the human genome” ENCODE Project Consortium. Nature 2012 Sep 6; 489: Michael M. Hoffman University.
Regulation of eukaryotic gene sequence expression
Day 2! Chapter 15 Eukaryotic Gene Regulation Almost all the cells in an organism are genetically identical. Differences between cell types result from.
ENCODE enhancers 12/13/2013 Yao Fu Gerstein lab. ‘Supervised’ enhancer prediction Yip et al., Genome Biology (2012) Get enhancer list away to genes DNase.
Control of Gene Expression Eukaryotes. Eukaryotic Gene Expression Some genes are expressed in all cells all the time. These so-called housekeeping genes.
Geuvadis RNAseq analysis at UNIGE Analysis plans
Epigenome 1. 2 Background: GWAS Genome-Wide Association Studies 3.
The Genome is Organized in Chromatin. Nucleosome Breathing, Opening, and Gaping.
Gary Stormo by Andrew Bardee. History Born 1950 in South Dakota Undergraduate in Biology from Caltech PhD in Molecular Biology from University of Colorado.
Regulation of Gene Expression Eukaryotes
GWAS Hits and Functional Implications Peter Castaldi February 1, 2013.
From Genomes to Genes Rui Alves.
Complexities of Gene Expression Cells have regulated, complex systems –Not all genes are expressed in every cell –Many genes are not expressed all of.
Recombination breakpoints Family Inheritance Me vs. my brother My dad (my Y)Mom’s dad (uncle’s Y) Human ancestry Disease risk Genomics: Regions  mechanisms.
Thoughts on ENCODE Annotations Mark Gerstein. Simplified Comprehensive (published annotation, mostly in '12 & '14 rollouts)
Genetics of Gene Expression BIOS Statistics for Systems Biology Spring 2008.
Using public resources to understand associations Dr Luke Jostins Wellcome Trust Advanced Courses; Genomic Epidemiology in Africa, 21 st – 26 th June 2015.
Transcriptional Enhancers Looking out for the genes and each other Sridhar Hannenhalli Department of Cell Biology and Molecular Genetics Center for Bioinformatics.
Genetics Journal Club Sumeet A. Khetarpal 10 December 2015.
Fig Prokaryotes and Eukaryotes
Lesson Four Structure of a Gene.
Online BIOS QTL atlases
Lesson Four Structure of a Gene.
Functional Annotations of
Complex disease and long-range regulation: Interpreting the GWAS using a Dual Colour Transgenesis Strategy in Zebrafish.
Control of Gene Expression
Bioinformatic Tools for Epigenetic Research
Gene-set analysis Danielle Posthuma & Christiaan de Leeuw
Gene Hunting: Design and statistics
Genome Biology & Applied Bioinformatics Mehmet Tevfik DORAK, MD PhD
LncRNAs exert their effects by diverse mechanisms. LncRNAs exert their effects by diverse mechanisms. (A) lncRNAs can.
Regulation of Gene Expression by Eukaryotes
Post-GWAS and Mechanistic Analyses
Interpreting noncoding variants
Effect of polymorphisms on transcriptional regulation in mice
Regulation of Gene Expression
Gene Regulation.
Volume 152, Issue 3, Pages (January 2013)
Genetic-Variation-Driven Gene-Expression Changes Highlight Genes with Important Functions for Kidney Disease  Yi-An Ko, Huiguang Yi, Chengxiang Qiu, Shizheng.
Coordinately Controlled Genes in Eukaryotes
Disentangling the Effects of Colocalizing Genomic Annotations to Functionally Prioritize Non-coding Variants within Complex-Trait Loci  Gosia Trynka,
Genome Biology & Applied Bioinformatics Mehmet Tevfik DORAK, MD PhD
In these studies, expression levels are viewed as quantitative traits, and gene expression phenotypes are mapped to particular genomic loci by combining.
mRNA Degradation and Translation Control
Genetics and genomics of psychiatric disease
Genetic variation in DREs could be a causative factor in dysregulation of distal target gene expression. Genetic variation in DREs could be a causative.
Tuuli Lappalainen, Stephen B. Montgomery, Alexandra C
Working in the Post-Genomic C. elegans World
Genetic Regulatory Mechanisms of Smooth Muscle Cells Map to Coronary Artery Disease Risk Loci  Boxiang Liu, Milos Pjanic, Ting Wang, Trieu Nguyen, Michael.
One SNP at a Time: Moving beyond GWAS in Psoriasis
ChIP-seq Robert J. Trumbly
Sherlock: Detecting Gene-Disease Associations by Matching Patterns of Expression QTL and GWAS  Xin He, Chris K. Fuller, Yi Song, Qingying Meng, Bin Zhang,
A systems view of genetics in chronic kidney disease
Are Interactions between cis-Regulatory Variants Evidence for Biological Epistasis or Statistical Artifacts?  Alexandra E. Fish, John A. Capra, William.
An Expanded View of Complex Traits: From Polygenic to Omnigenic
GWAS-eQTL signal colocalisation methods
From gene to protein.
Adam C. Wilkinson, Hiromitsu Nakauchi, Berthold Göttgens  Cell Systems 
Part I. Introduction and Genetic Engineering
Eukaryotic Gene Regulation
Deep Learning in Bioinformatics
A Role for Epigenetics in Psoriasis: Methylated Cytosine–Guanine Sites Differentiate Lesional from Nonlesional Skin and from Normal Skin  Johann E. Gudjonsson,
IMPACT: Genomic Annotation of Cell-State-Specific Regulatory Elements Inferred from the Epigenome of Bound Transcription Factors  Tiffany Amariuta, Yang.
Presentation transcript:

Functional Mapping and Annotation of GWAS: FUMA Danielle Posthuma Dept. Complex Trait Genetics, VU University Amsterdam //danielle/2017/FUMA_dp.ppt Boulder, TC31, March 8 2017

S Ripke et al. Nature (2014)

Physical & chemical environment Regulation & control Protein Translation RNA Transcription DNA

Interpreting the effect of SNPs on gene-products SNPs can act to alter the protein or RNA structure alter the protein or RNA level SNPs can be located in genes outside genes

Functional categories of SNPs Protein Coding SNPs in exonic regions may alter protein structure and/or function e.g nonsense SNPs or missense SNPs Splicing Regulation SNPs in splice sites may disrupt splicing regulation, resulting in exon skipping or intron retention They can also interfere with alternative splicing regulation by changing exonic splicing enhancers or silencers. Transcriptional Regulation SNPs in transcription regulatory regions (e.g. transcription factor binding sites, CpG islands, microRNAs, etc.) can alter binding sites, and thus disrupt proper gene regulation. Post-Translational Modification SNPs in protein-coding regions may alter post-translational modification sites, interfering with proper posttranslational modification.

Example SNP effect G healthy T disease This is a ‘stop-gained’ SNP

CADD scores Combined Annotation Dependent Depletion Score CADD is a tool for scoring the deleteriousness of SNPs as well as insertion/deletions variants in the human genome. A scaled CADD score >=10 indicates that these SNPs are predicted to be the 10% most deleterious substitutions; a score >=20 indicates the 1% most deleterious

Expression QTLs Alexandra C. Nica, and Emmanouil T. Dermitzakis Phil. Trans. R. Soc. B 2013;368:20120362 © 2013 The Author(s) Published by the Royal Society. All rights reserved.

The same regulatory regions and variant could be an eQTL for gene 2 in (a) tissue 1 and for gene 1 in (b) tissue 2, suggesting that limited interrogation of tissues would be misleading for the biological signal underlying disease. Alexandra C. Nica, and Emmanouil T. Dermitzakis Phil. Trans. R. Soc. B 2013;368:20120362 © 2013 The Author(s) Published by the Royal Society. All rights reserved.

RegulomeDB score Score Supporting data 1a eQTL + TF binding + matched TF motif + matched DNase Footprint + DNase peak 1b eQTL + TF binding + any motif + DNase Footprint + DNase peak 1c eQTL + TF binding + matched TF motif + DNase peak 1d eQTL + TF binding + any motif + DNase peak 1e eQTL + TF binding + matched TF motif 1f eQTL + TF binding / DNase peak 2a TF binding + matched TF motif + matched DNase Footprint + DNase peak 2b TF binding + any motif + DNase Footprint + DNase peak 2c TF binding + matched TF motif + DNase peak 3a TF binding + any motif + DNase peak 3b TF binding + matched TF motif 4 TF binding + DNase peak 5 TF binding or DNase peak 6 other

SNP annotation implicates genes Explore gene function Explore pathway enrichment of implicated genes Explore in which tissue gene is expressed

How to pinpoint causal genes based on GWAS? Are there functional variants in the GWAS risk loci? Are there regulatory variants or eQTLs in the GWAS risk loci? Are there SNPS with high CADD scores or low RegulomeDB scores? Where are the genes expressed that lie in the risk loci? What are the implicated pathways? Answers to these questions will generate hypotheses for functional follow-up experiments to investigate causality

Combine functional annotation information from different resources Many different repositories Need knowledge of how to normalize/interpret data Output can be huge, need visualizations for interpretation

FUMA developed by Kyoko Watanabe fuma.ctglab.nl

Demo fuma.ctglab.nl Watanabe K, Taskesen E, van Bochoven A, Posthuma D. FUMA: Functional mapping and annotation. doi: https://doi.org/10.1101/110023 BiorXiv

No exonic SNPs eQTLs Third implicated gene

Exonic SNPs, high CADD scores

And also several eQTLS

In sum Upload GWAS summary statistics Adds unmeasured SNPs to genomic risk loci, with known LD to measured SNPs Annotates all SNPs in genomic risk loci Functional annotations CADD scores RegulomeDB scores Chromatin states eQTL information Prioritizes genes based on user-defined filters Conducts gene-based and pathway analyses (MAGMA & enrichment) Provides interactive plots to visualize results All generated results can be downloaded