Biomedical Master Introduction to genome-wide association studies Metabolic diseases (B. Thorens) Biomedical Master: Metabolic diseases Lausanne, November.

Slides:



Advertisements
Similar presentations
Analysis of imputed rare variants
Advertisements

What is an association study? Define linkage disequilibrium
SHI Meng. Abstract The genetic basis of gene expression variation has long been studied with the aim to understand the landscape of regulatory variants,
1 Harvard Medical School Mapping Transcription Mechanisms from Multimodal Genomic Data Hsun-Hsien Chang, Michael McGeachie, and Marco F. Ramoni Children.
Meta-analysis for GWAS BST775 Fall DEMO Replication Criteria for a successful GWAS P
Genetic Analysis in Human Disease
Perspectives from Human Studies and Low Density Chip Jeffrey R. O’Connell University of Maryland School of Medicine October 28, 2008.
Objectives Cover some of the essential concepts for GWAS that have not yet been covered Hardy-Weinberg equilibrium Meta-analysis SNP Imputation Review.
Ferdinand van ’t Hooft Cardiovascular Genetics and Genomics Group Karolinska Institutet, Stockholm, Sweden Genome-Wide Association Study GWAS
The UNIVERSITY of NORTH CAROLINA at CHAPEL HILL FastANOVA: an Efficient Algorithm for Genome-Wide Association Study Xiang Zhang Fei Zou Wei Wang University.
Modeling genetic and phenotypic data with the use of statistics Discovery of phenotypes influenced by the season of birth Can environment modify genetic.
Lab 13: Association Genetics. Goals Use a Mixed Model to determine genetic associations. Understand the effect of population structure and kinship on.
Dr. Almut Nebel Dept. of Human Genetics University of the Witwatersrand Johannesburg South Africa Significance of SNPs for human disease.
LINEAR REGRESSION: Evaluating Regression Models Overview Assumptions for Linear Regression Evaluating a Regression Model.
LINEAR REGRESSION: Evaluating Regression Models. Overview Assumptions for Linear Regression Evaluating a Regression Model.
Teresa Przytycka NIH / NLM / NCBI RECOMB 2010 Bridging the genotype and phenotype.
1 FSTL4 and SEMA5A are associated with alcohol dependence: meta- analysis of two genome-wide association studies Kesheng Wang, PhD Department of Biostatistics.
More Powerful Genome-wide Association Methods for Case-control Data Robert C. Elston, PhD Case Western Reserve University Cleveland Ohio.
Quantitative Genetics
Biomedical Master Introduction to genome-wide association studies Metabolic diseases (B. Thorens) Biomedical Master: Metabolic diseases Lausanne, October.
MSc GBE Course: Genes: from sequence to function Genome-wide Association Studies Sven Bergmann Department of Medical Genetics University of Lausanne Rue.
Computational Complexity The complexity of the MG model for a single SNP is determined by the complexity of the matrix operations in formulas used to iteratively.
Using biological networks to search for interacting loci in genome-wide association studies Mathieu Emily et. al. European journal of human genetics, e-pub.
BSc Course: "Experimental design“ Genome-wide Association Studies Sven Bergmann Department of Medical Genetics University of Lausanne Rue de Bugnon 27.
Genome-Wide Association Studies
Give me your DNA and I tell you where you come from - and maybe more! Lausanne, Genopode 21 April 2010 Sven Bergmann University of Lausanne & Swiss Institute.
Computational analysis of biological systems: Past, present and future Sven Bergmann UNIL tenure track commission 5 January 2010.
Quantitative Genetics
Review Session Monday, November 8 Shantz 242 E (the usual place) 5:00-7:00 PM I’ll answer questions on my material, then Chad will answer questions on.
Correlation & Regression
Manolis Kellis Broad Institute of MIT and Harvard
Genome Variations & GWAS
Genetic Analysis in Human Disease. Learning Objectives Describe the differences between a linkage analysis and an association analysis Identify potentially.
Rare and common variants: twenty arguments G.Gibson Homework 3 Mylène Champs Marine Flechet Mathieu Stifkens 1 Bioinformatics - GBIO K.Van Steen.
Modes of selection on quantitative traits. Directional selection The population responds to selection when the mean value changes in one direction Here,
IUMSP Institut universitaire de médecine sociale et préventive, Lausanne Exploring the association of the CYP1A1- CYP1A2 locus with blood pressure in CoLaus.
IAP workshop, Ghent, Sept. 18 th, 2008 Mixed model analysis to discover cis- regulatory haplotypes in A. Thaliana Fanghong Zhang*, Stijn Vansteelandt*,
The Complexities of Data Analysis in Human Genetics Marylyn DeRiggi Ritchie, Ph.D. Center for Human Genetics Research Vanderbilt University Nashville,
What host factors are at play? Paul de Bakker Division of Genetics, Brigham and Women’s Hospital Broad Institute of MIT and Harvard
From Genome-Wide Association Studies to Medicine Florian Schmitzberger - CS 374 – 4/28/2009 Stanford University Biomedical Informatics
Online Mendelian Inheritance in Man (OMIM): What it is & What it can do for you Knowledge Management & Eskind Biomedical Library January 27, 2012 helen.
Experimental Design and Data Structure Supplement to Lecture 8 Fall
Quantitative Genetics. Continuous phenotypic variation within populations- not discrete characters Phenotypic variation due to both genetic and environmental.
Quantitative Genetics
Jianfeng Xu, M.D., Dr.PH Professor of Public Health and Cancer Biology Director, Program for Genetic and Molecular Epidemiology of Cancer Associate Director,
Methods in genome wide association studies. Norú Moreno
Lab 13: Association Genetics December 5, Goals Use Mixed Models and General Linear Models to determine genetic associations. Understand the effect.
POLYMORPHISM AND VARIANT ANALYSIS Saurabh Sinha, University of Illinois.
Future Directions Pak Sham, HKU Boulder Genetics of Complex Traits Quantitative GeneticsGene Mapping Functional Genomics.
Lecture 24: Quantitative Traits IV Date: 11/14/02  Sources of genetic variation additive dominance epistatic.
Lecture 21: Quantitative Traits I Date: 11/05/02  Review: covariance, regression, etc  Introduction to quantitative genetics.
An quick overview of human genetic linkage analysis
Shankar Subramaniam University of California at San Diego Data to Biology.
Using public resources to understand associations Dr Luke Jostins Wellcome Trust Advanced Courses; Genomic Epidemiology in Africa, 21 st – 26 th June 2015.
An atlas of genetic influences on human blood metabolites Nature Genetics 2014 Jun;46(6)
Increasing Power in Association Studies by using Linkage Disequilibrium Structure and Molecular Function as Prior Information Eleazar Eskin UCLA.
1 Finding disease genes: A challenge for Medicine, Mathematics and Computer Science Andrew Collins, Professor of Genetic Epidemiology and Bioinformatics.
Power and Meta-Analysis Dr Geraldine M. Clarke Wellcome Trust Advanced Courses; Genomic Epidemiology in Africa, 21 st – 26 th June 2015 Africa Centre for.
Date of download: 7/2/2016 Copyright © 2016 American Medical Association. All rights reserved. From: How to Interpret a Genome-wide Association Study JAMA.
Chapter 13 Simple Linear Regression
Genomic Analysis: GWAS
David Daniel Rico Sarvenaz Zóltan.
Genome Wide Association Studies using SNP
Gene-set analysis Danielle Posthuma & Christiaan de Leeuw
Gene Hunting: Design and statistics
Beyond GWAS Erik Fransen.
Linking Genetic Variation to Important Phenotypes
GENOME WIDE ASSOCIATION STUDIES (GWAS)
The Population Reference Sample, POPRES: A Resource for Population, Disease, and Pharmacological Genetics Research  Matthew R. Nelson, Katarzyna Bryc,
Evan G. Williams, Johan Auwerx  Cell 
Presentation transcript:

Biomedical Master Introduction to genome-wide association studies Metabolic diseases (B. Thorens) Biomedical Master: Metabolic diseases Lausanne, November 8, 2010 Sven Bergmann University of Lausanne & Swiss Institute of Bioinformatics

A Systems Biology approach Large (genomic) systems many uncharacterized elements relationships unknown computational analysis should:  improve annotation  reveal relations  reduce complexity Small systems elements well-known many relationships established quantitative modeling of systems properties like:  Dynamics  Robustness  Logics

Overview Population stratification Our whole genome associations New Methods and Approaches

ATTGCAATCCGTGG...ATCGAGCCA…TACGATTGCACGCCG… ATTGCAAGCCGTGG...ATCTAGCCA…TACGATTGCAAGCCG… ATTGCAATCCGTGG...ATCGAGCCA…TACGATTGCACGCCG…ATTGCAAGCCGTGG...ATCTAGCCA…TACGATTGCAAGCCG… Genetic variation in SNPs (Single Nucleotide Polymorphisms)

6’189 individuals Phenotypes 159 measurement 144 questions Genotypes SNPs CoLaus = Cohort Lausanne Collaboration with: Vincent Mooser (GSK), Peter Vollenweider & Gerard Waeber (CHUV)

Analysis of Genotypes only Principle Component Analysis reveals SNP-vectors explaining largest variation in the data

Ethnic groups cluster according to geographic distances PC1 PC2

PCA of POPRES cohort

Predicting location according to SNP-profile...

… is pretty accurate!

The Swiss segregate according to language

PC-Analysis of genotypic profile Is surprisingly accurate! Is useful for forensic purposes or for individuals interested in their ancestry Is useful for population stratification in Genome-wide Association studies

Phenotypic variation:

What is association? chromosomeSNPstrait variant Genetic variation yields phenotypic variation Population with ‘ ’ allele Distributions of “trait”

Association using regression genotypeCoded genotype phenotype

Regression formalism (monotonic) transformation phenotype (response variable) of individual i effect size (regression coefficient) coded genotype (feature) of individual i p(β=0) error (residual) Goal: Find effect size that explains best all (potentially transformed) phenotypes as a linear function of the genotypes and estimate the probability (p-value) for the data being consistent with the null hypothesis (i.e. no effect)

Whole Genome Association

Current microarrays probe ~1M SNPs! Standard approach: Evaluate significance for association of each SNP independently: significance

Whole Genome Association significance Manhattan plot observed significance Expected significance Quantile-quantile plot Chromosome & position GWA screens include large number of statistical tests! Huge burden of correcting for multiple testing! Can detect only highly significant associations ( p < α / #(tests) ~ )

Genome-wide meta-analysis for serum calcium identifies significantly associated SNPs near the calcium-sensing receptor (CASR) gene Karen Kapur, Toby Johnson, Noam D. Beckmann, Joban Sehmi, Toshiko Tanaka, Zolt á n Kutalik, Unnur Styrkarsdottir, Weihua Zhang, Diana Marek, Daniel F. Gudbjartsson, Yuri Milaneschi, Hilma Holm, Angelo DiIorio, Dawn Waterworth, Andrew Singleton, Unnur Steina Bjornsdottir, Gunnar Sigurdsson, Dena Hernandez, Ranil DeSilva, Paul Elliott, Gudmundur Eyjolfsson, Jack M Guralnik, James Scott, Unnur Thorsteinsdotti, Stefania Bandinelli, John Chambers, Kari Stefansson, G é rard Waeber, Luigi Ferrucci, Jaspal S Kooner, Vincent Mooser, Peter Vollenweider, Jacques S. Beckmann, Murielle Bochud, Sven Bergmann

Current insights from GWAS: Well-powered (meta-)studies with (ten-)thousands of samples have identified a few (dozen) candidate loci with highly significant associations Many of these associations have been replicated in independent studies

Current insights from GWAS: Each locus explains but a tiny (<1%) fraction of the phenotypic variance All significant loci together explain only a small (<10%) of the variance

The “Missing variance” (Non-)Problem Why should a simplistic (additive) model using incomplete or approximate features possibly explain anything close to the genetic variance of a complex trait? … and it doesn ’ t have to as long as Genome-wide Association Studies are meant to as an undirected approach to elucidate new candidate loci that impact the trait!

1.Improve measurements: - measure more variants (e.g. by UHS) - measure other variants (e.g. CNVs) - measure “molecular phenotypes” 2.Improve models: - proper integration of uncertainties - include interactions - multi-layer models How could our models become more predictive?

Towards a layered Systems Model We need intermediate (molecular) phenotypes to better understand organismal phenotypes

Network Approaches for Integrative Association Analysis Using knowledge on physical gene-interactions or pathways to prioritize the search for functional interactions

Transcription Modules reduce Complexity SB, J Ihmels & N Barkai Physical Review E (2003) /ExpressionView

Association of (average) module expression is often stronger than for any of its constituent genes

Analysis of genome-wide SNP data reveals that population structure mirrors geography Genome-wide association studies elucidate candidate loci for a multitude of traits, but have little predictive power so far Future improvement will require –better genotyping (CGH, UHS, …) –New analysis approaches (interactions, networks, data integration) Take-home Messages: