High level GWAS analysis

Slides:



Advertisements
Similar presentations
What is an association study? Define linkage disequilibrium
Advertisements

Genome-wide Association Study Focus on association between SNPs and traits Tendency – Larger and larger sample size – Use of more narrowly defined phenotypes(blood.
Understanding GWAS Chip Design – Linkage Disequilibrium and HapMap Peter Castaldi January 29, 2013.
MALD Mapping by Admixture Linkage Disequilibrium.
Ingredients for a successful genome-wide association studies: A statistical view Scott Weiss and Christoph Lange Channing Laboratory Pulmonary and Critical.
Something related to genetics? Dr. Lars Eijssen. Bioinformatics to understand studies in genomics – São Paulo – June Image:
Haplotype Discovery and Modeling. Identification of genes Identify the Phenotype MapClone.
Understanding Genetics of Schizophrenia
Standardization of Pedigree Collection. Genetics of Alzheimer’s Disease Alzheimer’s Disease Gene 1 Gene 2 Environmental Factor 1 Environmental Factor.
Introduction to BST775: Statistical Methods for Genetic Analysis I Course master: Degui Zhi, Ph.D. Assistant professor Section on Statistical Genetics.
The medical relevance of genome variability Gabor T. Marth, D.Sc. Department of Biology, Boston College
SNPs Daniel Fernandez Alejandro Quiroz Zárate. A SNP is defined as a single base change in a DNA sequence that occurs in a significant proportion (more.
The medical relevance of genome variability Gabor T. Marth, D.Sc. Department of Biology, Boston College Medical Genomics Course – Debrecen,
©Edited by Mingrui Zhang, CS Department, Winona State University, 2008 Identifying Lung Cancer Risks.
CS177 Lecture 10 SNPs and Human Genetic Variation
From Genome-Wide Association Studies to Medicine Florian Schmitzberger - CS 374 – 4/28/2009 Stanford University Biomedical Informatics
Finnish Genome Center Monday, 16 November Genotyping & Haplotyping.
Association analysis Genetics for Computer Scientists Biomedicum & Department of Computer Science, Helsinki Päivi Onkamo.
Genome wide association studies (A Brief Start)
The International Consortium. The International HapMap Project.
Increasing Power in Association Studies by using Linkage Disequilibrium Structure and Molecular Function as Prior Information Eleazar Eskin UCLA.
OMICS Journals are welcoming Submissions
Power Calculations for GWAS
Single Nucleotide Polymorphisms (SNPs
SNPs and complex traits: where is the hidden heritability?
EQTLs.
Genomic Analysis: GWAS
Common variation, GWAS & PLINK
Multiple sclerosis genetics
Gil McVean Department of Statistics
Complex disease and long-range regulation: Interpreting the GWAS using a Dual Colour Transgenesis Strategy in Zebrafish.
Copyright © 2001 American Medical Association. All rights reserved.
Genetics of common complex diseases: a view from Iceland
Consideration for Planning a Candidate Gene Association Study With TagSNPs Shehnaz K. Hussain, PhD, ScM Epidemiology 243: Molecular.
Genetic Testing for the Clinician
upstream vs. ORF binding and gene expression?
Global Variation in Copy Number in the Human Genome
Pharmacogenetics: Implications of race and ethnicity on defining genetic profiles for personalized medicine  Victor E. Ortega, MD, Deborah A. Meyers,
Genome Wide Association Studies using SNP
Introduction to bioinformatics lecture 11 SNP by Ms.Shumaila Azam
Gene Hunting: Design and statistics
POLYMORPHISMS & ASSOCIATION TESTS
Epidemiology 101 Epidemiology is the study of the distribution and determinants of health-related states in populations Study design is a key component.
Power to detect QTL Association
Mapping Quantitative Trait Loci
Genome-wide Associations
Genome-wide Association Studies
Beyond GWAS Erik Fransen.
DNA Chip Data Interpretation Tools: Genmapp & Dragon View
Correlation for a pair of relatives
Genetic Drift, followed by selection can cause linkage disequilibrium
Genetic variations associated with diabetic nephropathy and type II diabetes in a Japanese population  S. Maeda, N. Osawa, T. Hayashi, S. Tsukada, M.
Medical genomics BI420 Department of Biology, Boston College
Genome-Wide Association Studies: Present Status and Future Directions
One SNP at a Time: Moving beyond GWAS in Psoriasis
Genetic Investigations of Kidney Disease: Core Curriculum 2013
Genetics of Human Cardiovascular Disease
Figure 2 LocusZoom plots
Identification of a genetic variant associated with abdominal aortic aneurysms on chromosome 3p12.3 by genome wide association  James R. Elmore, MD, Melissa.
BF528 - Genomic Variation and SNP Analysis
Medical genomics BI420 Department of Biology, Boston College
Evan G. Williams, Johan Auwerx  Cell 
An Expanded View of Complex Traits: From Polygenic to Omnigenic
Fig. 2 Genotype-induced differential gene expression is different in MDMi cells compared to monocytes. Genotype-induced differential gene expression is.
GWAS-eQTL signal colocalisation methods
Hunting for Celiac Disease Genes
The principles of genetic association
Genome wide association study for T2D-related loci
科学抑或哲学?—— 从“遗传率消失之谜”说起
Presentation transcript:

High level GWAS analysis Tzu L Phang Associate Professor of Bioinformatics Colorado Center for Personalized Medicine

Agenda What is GWAS? What does GWAS tell us? What should we consider when designing or interpreting GWAS?

What is a GWAS? A genome-wide association study is an approach that involves rapidly scanning markers across genome (~0.5M to 1M) of many people (~2K) to find genetic variations associated with a particular disease A large number of subjects are needed because Associations between SNPs and causal variants are expected to show low odds rations, typically below 1.5 In order to obtain a reliable signal, given the very large number of tests that are required, associations must show a high level of significance to survice the multiple testing correstion. Such studies are particularly useful in finding genetic variations that contribute to common, complex disease

What is the goal of GWAS? Goal: uncover the genetic basis of a given disease Basic Idea: a rather vague idea of a study design that involves genotyping cases and controls at a large number (104 to 106) of SNP markers spread (in some unspecified way) throughout the genome. Look for associations between the genotypes at each locus and disease statues

Why are such studies possible now? Computing resources to perform GWAS are now commonly available The completion of the Human Genome Project in 2003, and the International HapMap Project in 2005, researchers now have a set of research tools that make it possible to find the genetic contributions to common disease

General design and workflow

Case-control GWAS Obtain DNA from people with disease of interest (cases) and unaffected controls Run each DNA sample on a SNP chip to measure genotypes at 300,000-1,000,000 SNPs in cases and controls Identify SNPs where one allele is significantly more common in cases than controls The SNP is associated with disease

Association does not imply causation Suppose that genotypes at a particular SNP are significantly associated with disease

Association does not imply causation This may be because the SNP is associated with some other factor (a confounder), which is associated with disease but is not in the same causal pathway

Association ≠ Causation Possible confounders of genetic associations: Ethnic ancestry Genotyping batch, genotyping centre DNA quality Environmental exposures in the same causal pathway Nicotine receptors --> smoking --> lung cancer Hung et al, Nature 452: 633 (2008) + other articles in same issue Alcohol dehydrogenase genes --> alcohol consumption --> throat cancer Hashibe et al, Nature Genetics 40: 707 (2008)

Helpful confounding: linkage disequilibrium Linkage disequilibrium (LD) is the non- independence of alleles at nearby markers in a population because of a lack of recombination between the markers LD is helpful, because not all SNPs have to be genotyped

Direct and indirect association Functional SNP is genotyped and an association is found Functional SNP (blue) is not genotyped, but a number of other SNPs (red) in LD with the functional SNP are genotyped, and an association is found for those SNPS

Odds ratio – measure of effect size

Interpretation of odds ratio

Multiple testing considerations Suppose you test 500,000 SNPs for association with disease Expect around 500,000 x 0.05 = 25,000 to have p-value less than 0.05 More appropriate significance threshold p = 0.05 / 500,000 = 10-7 Genome-wide significance In our MS GWAS we considered SNPs for follow- up if they had p- values less than 0.001 To detect a smaller p-value need a larger study