Genome Biology & Applied Bioinformatics Mehmet Tevfik DORAK, MD PhD

Slides:



Advertisements
Similar presentations
Jump to Contents Instructor Tutorial essignments.com Paperless assignment submission system.
Advertisements

Short introduction to the use of PEARL General properties First tier assessments Higher tier assessments Before looking at first and higher tier assessments,
The Maize Inflorescence Project Website Tutorial Nov 7, 2014.
NGS Analysis Using Galaxy
Prepare Your Academy for 2013
Moodle (Course Management Systems). Assignments 1 Assignments are a refreshingly simple method for collecting student work. They are a simple and flexible.
Using the FOCUS Web Site Teacher’s Desk. Topics Covered in this Presentation n Accessing the FOCUS Web site n Importing and Creating Classes n Adding.
Desoto County Schools
Website Development with Dreamweaver
SAGExplore web server tutorial for Module II: Genome Mapping.
Fission Yeast Computing Workshop -1- Searching, querying, browsing downloading and analysing data using PomBase Basic PomBase Features Gene Page Overview.
Analyzing and Interpreting Quantitative Data
Instructors begin using McGraw-Hill’s Homework Manager by creating a unique class Web site in the system. The Class Homepage becomes the entry point for.
D-1 Extended Learning Module D Decision Analysis With Spreadsheet Software.
Regulatory Genomics Lab Saurabh Sinha Regulatory Genomics Lab v1 | Saurabh Sinha1 Powerpoint by Casey Hanson.
Copyright OpenHelix. No use or reproduction without express written consent1.
RiskMeter Batch Training. Batch Tool The Riskmeter batch tool allows users to submit multiple locations all at once. The Riskmeter batch tool allows users.
WINKS 7 Tutorial 7 – Advanced Topic: Labels and Formats Permission granted for use for instruction and for personal use. © Alan C. Elliott,
1 ADVANCED MICROSOFT POWERPOINT Lesson 9 – Importing and Exporting Information Microsoft Office 2003: Advanced.
Regulatory Genomics Lab Saurabh Sinha Regulatory Genomics | Saurabh Sinha | PowerPoint by Casey Hanson.
Copyright OpenHelix. No use or reproduction without express written consent1.
GVS: Genome Variation Server Materials prepared by: Warren C. Lathe, PhD Updated: Q Version 2.
Chapter 6: Analyzing and Interpreting Quantitative Data
SAGExplore web server tutorial. The SAGExplore server has three different modules …
Learning Aim C.  In this section we will look at how text, tables, forms and frames can be used in web pages.
1 Bioinformatics Tools for Genotyping Frances Tong Dr. Garry Larson, Ph.D City of Hope Department of Molecular Medicine Southern California Bioinformatics.
PowerTeacher Gradebook PTG and PowerTeacher Pro PT Pro A Comparison The following slides will give you an overview of the changes that will occur moving.
Online Submission and Management Information -- Authors
SNPs and complex traits: where is the hidden heritability?
Module 4: Building Reports
A Complete guide to Summer 2016 Registration Once you’re logged into WyoRecords Click on Registration *Slides will forward after 6 seconds or by mouse.
Functional Annotations of
Regulatory Genomics Lab
SAGExplore web server tutorial for Module III:
Mehmet Tevfik DORAK, MD PhD 2nd Practical Bioinformatics Course
Functional Mapping and Annotation of GWAS: FUMA
Genome Biology & Applied Bioinformatics Mehmet Tevfik DORAK, MD PhD
Bioinformatic Tools for Epigenetic Research
Genome Biology & Applied Bioinformatics Mehmet Tevfik DORAK, MD PhD
Introduction to WRDS data platform
Analyzing and Interpreting Quantitative Data
Microsoft Office Illustrated
A Complete Guide to Summer 2016 Registration Once you’re logged into WyoRecords, click on Registration *Slides will forward after 6 seconds or by mouse.
Tutorial for using Case It for bioinformatics analyses
Creating assignments: Best Practices
Genome Biology & Applied Bioinformatics Mehmet Tevfik DORAK, MD PhD
Finding Answers through Data Collection
Yonglan Zheng Galaxy Hands-on Demo Step-by-step Yonglan Zheng
Focus on Reports Session 3 LD_Pay & Distribution Adj History To print:
Major Databases/Portals
Beyond GWAS Erik Fransen.
Benchmark Series Microsoft Word 2016 Level 2
Bradford VTS Web Pages for General Practice ONLINE RESOURCES
High-Resolution Genetic Maps Identify Multiple Type 2 Diabetes Loci at Regulatory Hotspots in African Americans and Europeans  Winston Lau, Toby Andrew,
Combined Analysis of Genome-wide Association Studies for Crohn Disease and Psoriasis Identifies Seven Shared Susceptibility Loci  David Ellinghaus, Eva.
Genetics and genomics of psychiatric disease
Genome Biology & Applied Bioinformatics Mehmet Tevfik DORAK, MD PhD
A Tutorial Lincoln Stein, Cold Spring Harbor Laboratory
Virtual Association Studies
Genome Biology & Applied Bioinformatics Mehmet Tevfik DORAK, MD PhD
Patient Safety Reporting System Course Contents
A Flexible Bayesian Framework for Modeling Haplotype Association with Disease, Allowing for Dominance Effects of the Underlying Causative Variants  Andrew.
One SNP at a Time: Moving beyond GWAS in Psoriasis
State of New Jersey Department of Health Patient Safety Reporting System Module 3 – Root Cause Analysis.
Regulatory Genomics Lab
Exploring Microsoft® Office 2016 Series Editor Mary Anne Poatsy
GWAS-eQTL signal colocalisation methods
 .
Regulatory Genomics Lab
Leslie Chavez and Will Bardé
Presentation transcript:

Genome Biology & Applied Bioinformatics Mehmet Tevfik DORAK, MD PhD SNP Sets Mehmet Tevfik DORAK, MD PhD

Schedule

Outline I. Proxy SNP (ssSNP) sets II. Independent SNP Sets (HaploReg, VADE) II. Independent SNP Sets RegulomeDB, CADD Variant Set Enrichment Analysis (GRAIL) III. wANNOVAR, AVIA ssSNP = Statistically similar (correlated)

Proxy SNP Set Generation HaploReg generates a statistically similar SNP (ssSNP) set for each SNP together with their functional annotations. The Set Options tab allows setting the desired r2 value (0.80 by default). eQTL results are included. For best outcome, choose the text file output option and examine the results in a spreadsheet.

Proxy SNP Set Once an ssSNP set is generated, it can be annotated using the tools listed in next few slides to assess their functionality to choose the most likely causal SNP. The ssSNP list (and HaploReg results) can also be used to make a list of target genes to be used in a gene set enrichment analysis (for combinatorial effects of multiple variants in linkage disequilibrium). Most tools available to generate a list of ssSNPs only include SNPs within 500kb of the lead SSNP. To go beyond that may be necessary, for example, for HLA region SNPs, and this can be achieved by downloading selected SNP genotypes from 1KG (Ferret is helpful) and running the LD analysis on Haploview.

Independent SNP Sets Unlike ssSNP sets (which include SNPs correlated with one another), we may want to analyse a set of independent SNPs too. This may be GWAS results, like SNPs that have shown associations. These SNPs can be assessed for causality using a variety of tools. One of them is RegulomeDB (http://regulomedb.org), which provides a functionality score for regulatory effects of non-coding region SNPs.

Independent SNP Sets CADD (http://cadd.gs.washington.edu/home) combines more than 80 features to provide a single summary score. However, it requires uploading of a list of SNPs in VCF format. Results are provided in a file together with all 80+ features for each SNP (including PolyPhen and SIFT scores). CADD has been superseded by DANN (https://cbcl.ics.uci.edu/public_data/DANN/data) and EIGEN (http://www.columbia.edu/~ii2135/eigen.html), but these two cannot be reached via web like CADD.

Independent SNP Sets To obtain CADD scores of up to 100 thousand variants, prepare a VCF file (explanation is given in this page) and upload your file. Make sure you check the box at the bottom of this screen to get details of all features for all variants (rather than just CADD scores). The results can be downloaded from this site. The scaled CADD scores will be given in the last column of the results file (called PHREG). Link: http://cadd.gs.washington.edu/score

VSE Analysis for SNPs (GRAIL) VSE = Variant set enrichment Links: GRAIL: http://www.broadinstitute.org/mpg/grail PAPER: http://journals.plos.org/plosgenetics/article?id=10.1371/journal.pgen.1000534

VSE Analysis for SNPs (GRAIL) Figure 1. Gene Relationships Among Implicated Loci (GRAIL) method consists of four steps. (A) Identifying genes in disease regions. For each independent associated SNP or CNV from a GWAS, GRAIL defines a disease region; then GRAIL identifies genes overlapping the region. In this region there are three genes. We use gene 1 (pink arrow) as an example. (B) Assess relatedness to other human genes. GRAIL scores each gene contained in a disease region for relatedness to all other human genes. GRAIL determines gene relatedness by looking at words in gene references; related genes are defined as those whose abstract references use similar words. Here gene 1 has word counts that are highly similar to gene A but not to gene B. All human genes are ranked according to text-based similarity (green bar), and the most similar genes are considered related. (C) Counting regions with similar genes. For each gene in a disease region, GRAIL assesses whether other independent disease regions contain highly significant genes. GRAIL assigns a significance score to the count. In this illustration gene 1 is similar to genes in three of the regions (green arrows), including gene A. (D) Assigning a significance score to a disease region. After all of the genes within a region are scored, GRAIL identifies the most significant gene as the likely candidate. GRAIL corrects its significance score for multiple hypothesis testing, to assign a significance score to the region. Links: GRAIL: http://www.broadinstitute.org/mpg/grail PAPER: http://journals.plos.org/plosgenetics/article?id=10.1371/journal.pgen.1000534

More Advanced Tools AVIA and wANNOVAR are advanced tools which provide much more detail than other web-based tools. They both use the ANNOVAR database, and provide a web interface to access that database. Prepare the required input files as described (samples provided), submit and get your results. AVIA: https://avia-abcc.ncifcrf.gov/apps/site/sub_analysis/?id=3 wANNOVAR: http://wannovar.usc.edu

… Looking forward ….. http://www.dorak.info

Course material: http://www.dorak.info/genbiol/course.html