Genome-wide Association Studies

Slides:



Advertisements
Similar presentations
The genetic dissection of complex traits
Advertisements

Conifer Translational Genomics Network Coordinated Agricultural Project Genomics in Tree Breeding and Forest Ecosystem Management.
Planning breeding programs for impact
Frary et al. Advanced Backcross QTL analysis of a Lycopersicon esculentum x L. pennellii cross and identification of possible orthologs in the Solanaceae.
Association Mapping as a Breeding Strategy
Genetic Architecture of Kernel Composition in the Nested Association Mapping (NAM) Population Sherry Flint-Garcia USDA-ARS Columbia, MO.
Experimental crosses. Inbred Strain Cross Backcross.
Qualitative and Quantitative traits
Selective mapping and simulation study. high-density genome maps Are used for: Comparative mapping Map-based cloning Genome sequencing But genotyping.
ASSOCIATION MAPPING WITH TASSEL Presenter: VG SHOBHANA PhD Student CPMB.
Genomic Tools for Oat Improvement
Whole genome association mapping of beta-glucan content ir barley Ieva Mežaka, Nils Rostoks Advances in Plant Biotechnology in Baltic Sea region1.
Genome-wide association mapping Introduction to theory and methodology
QTL Mapping R. M. Sundaram.
MALD Mapping by Admixture Linkage Disequilibrium.
Plant of the day! Pebble plants, Lithops, dwarf xerophytes Aizoaceae
Signatures of Selection
Admixture Mapping Qunyuan Zhang Division of Statistical Genomics GEMS Course M Computational Statistical Genetics Computational Statistical Genetics.
Genetic Traits Quantitative (height, weight) Dichotomous (affected/unaffected) Factorial (blood group) Mendelian - controlled by single gene (cystic fibrosis)
MSc GBE Course: Genes: from sequence to function Genome-wide Association Studies Sven Bergmann Department of Medical Genetics University of Lausanne Rue.
Mapping Basics MUPGRET Workshop June 18, Randomly Intermated P1 x P2  F1  SELF F …… One seed from each used for next generation.
Module 7: Estimating Genetic Variances – Why estimate genetic variances? – Single factor mating designs PBG 650 Advanced Plant Breeding.
Methods of Genome Mapping linkage maps, physical maps, QTL analysis The focus of the course should be on analytical (bioinformatic) tools for genome mapping,
Multifactorial Traits
Genetic Mapping Oregon Wolfe Barley Map (Szucs et al., The Plant Genome 2, )
Fine mapping QTLs using Recombinant-Inbred HS and In-Vitro HS William Valdar Jonathan Flint, Richard Mott Wellcome Trust Centre for Human Genetics.
PBG 650 Advanced Plant Breeding Module 1: Introduction Population Genetics – Hardy Weinberg Equilibrium – Linkage Disequilibrium.
Genetic Linkage. Two pops may have the same allele frequencies but different chromosome frequencies.
Experimental Design and Data Structure Supplement to Lecture 8 Fall
QTL Mapping in Heterogeneous Stocks Talbot et al, Nature Genetics (1999) 21: Mott et at, PNAS (2000) 97:
INTRODUCTION TO ASSOCIATION MAPPING
February 20, 2002 UD, Newark, DE SNPs, Haplotypes, Alleles.
The International Consortium. The International HapMap Project.
Genomics of Adaptation
Moukoumbi, Y. D1. , R. Yunus2, N. Yao3, M. Gedil1, L. Omoigui1 and O
Common variation, GWAS & PLINK
Genetic Linkage.
Comparative mapping of the Oregon Wolfe Barley using doubled haploid lines derived from female and male gametes L. Cistue, A. Cuesta-Marcos, S. Chao, B.
MULTIPLE GENES AND QUANTITATIVE TRAITS
Population Genetics As we all have an interest in genomic epidemiology we are likely all either in the process of sampling and ananlysising genetic data.
Signatures of Selection
upstream vs. ORF binding and gene expression?
From: Will genomic selection be a practical method for plant breeding?
Genome Wide Association Studies using SNP
High-resolution haplotype structure in the human genome
Genetic Linkage.
Quantitative Traits in Populations
PLANT BIOTECHNOLOGY & GENETIC ENGINEERING (3 CREDIT HOURS)
Washington State University
Patterns of Linkage Disequilibrium in the Human Genome
Power to detect QTL Association
Mapping Quantitative Trait Loci
MULTIPLE GENES AND QUANTITATIVE TRAITS
Genome-wide Associations
The ‘V’ in the Tajima D equation is:
What are BLUP? and why they are useful?
Detecting variance-controlling QTL
Lecture 10: QTL Mapping II: Outbred Populations
Genetic Drift, followed by selection can cause linkage disequilibrium
Genetic Linkage.
Linkage analysis and genetic mapping
Washington State University
Linkage Genes that are physically located on the same chromosome are said to be “linked”. Linked genes are said to be “mapped” to the same chromosome.
Barley (Hordeum vulgare subsp. vulgare)
Medical genomics BI420 Department of Biology, Boston College
Lecture 9: QTL Mapping II: Outbred Populations
Gene mapping March 3, 2017.
Medical genomics BI420 Department of Biology, Boston College
Cancer as a Complex Genetic Trait
Presentation transcript:

Genome-wide Association Studies A population-based survey to identify non-random associations between phenotypes and genetic markers across the genome Does not rely on linkage analysis or trace the inheritance of traits and markers from a cross Relies on historic linkage disequilibrium between genetic markers and QTL Also called Association mapping Linkage disequilibrium mapping

Advantages of GWAS approach More opportunity for recombination than in a biparental mapping population Fine mapping of QTL Validate candidate genes Determine which polymorphisms within a gene determine different phenotypes Surveys a broader gene pool More than two individuals are represented Identify multiple alleles for QTL Evaluate effects of QTL in diverse genetic backgrounds No need to create mapping populations for linkage analysis New possibilities for QTL analysis in species with a long generation time where controlled crossing is difficult

Higher resolution maps with GWAS Source: Conifer Genomics Learning Modules (Modified from Rafalski (2002), COPB 5: 94-100)

Disadvantages of GWAS approach Not a controlled experiment! Risk of false positives due to population structure Results will be confounded by any background LD in the population that is not due to close linkage It is critical to either confirm that the background LD is negligible or use statistical approaches to adjust for it Need to know extent and structure of LD in order to identify best association mapping strategy Power to detect QTL is unpredictable Ideally… LD has decayed to a large extent in the population as a whole and over fairly small map distances Adequate LD still exists between marker loci and closely linked QTL

Steps in association mapping Select an association panel Measure phenotypes Genotype the panel Quantify extent of linkage disequilibrium Assess population structure Estimate kinship Apply appropriate statistical model to detect associations between markers and traits

LD decay Extent of LD “decays” as the distance between markers increases Can also think of “decay” as distance along a chromosome D is the covariance between alleles at different loci Can consider r2 to be the square of the correlation coefficient r2 r2 0.2 10 kb 100 kb distance between markers

Low LD requires high marker density High LD Low density Low LD Low density Low LD High density High power to detect QTL High resolution of QTL

Extent of LD in barley Wild barley: LD decays within a gene Landraces: ~ 90 kb European germplasm - significant LD: mean 3.9 cM, median 1.16 cM, maximum >60 cM Modern European barley Landraces (ICARDA) Wild barley

Population Structure Population structure may arise from various causes geographic isolation selection breeding history Population structure may cause false positive associations between genotypes and phenotypes Methods to account for populations structure Genomic control (GC) Structured Association (SA) Software: Structure 2.3.4 Principle Component Analysis (PCA)

Population Structure Many individuals will not belong uniquely to one subpopulation, but will be the descendents of crosses between two or more ancestral populations Estimates the proportion of ancestry attributable to each population for each individual

Slide courtesy of Alfonso Cuesta-Marcos Marker Distance Line 1 Line 2 Line 3 Line 4 Line 5 Line 6 Line 7 Line 8 Line 9 Line 10 Line 11 Line 12 Line 13 Line 14 Line 15 Line 16 _3_0363_ A B _1_1061_ 0.8 _3_0703_ 1.5 _1_1505_ _1_0498_ _2_1005_ 3.8 _1_1054_ _2_0674_ 6 _1_0297_ 8.8 _1_0638_ 10.7 _1_1302_ 11.4 _1_0422_ _2_0929_ 15.3 _3_1474_ 15.4 _1_1522_ 17.3 _2_1388_ _3_0259_ 18.1 _1_0325_ _2_0602_ 20.8 _1_0733_ 23.9 _2_0729 _1_1272_ _2_0891_ 26.1 _2_0748_ 26.6 _3_0251_ 27.4 _1_0997_ 35.5 _1_1133_ 41.8 _2_0500_ 42.5 _3_0634_ 43.3 10 Desease severity 5 Slide courtesy of Alfonso Cuesta-Marcos

Q + K model Y = Xß + S + Qv + Zu + e random effects Mixed Model – includes fixed and random effects random effects Y = Xß + S + Qv + Zu + e Y is the individual observations of the phenotype Xß includes fixed effects: population means, environments S includes marker allele effects (fixed) Q is a subpopulation incidence matrix (adjusts for structure) v is a matrix of estimates of subpopulation mean effects (fixed) Zu represents degree of relatedness not captured by population structure (adjusts for kinship) u is the polygenic effect generated by other loci that are unlinked to the one being tested Yu et al. (2006) Nature Genetics 38: 203-208

Linkage analysis + association mapping Can we combine the benefits of both approaches? Nested Association Mapping (NAM) Method Crossed 25 diverse inbreds to a common inbred B73 Derived recombinant inbred lines from each cross Pros and Cons Diverse and representative High power to detect QTL High resolution of QTL A lot of work!!! Yu, et al. (2008) Genetics McMullen, et al. (2009) Science

Linkage analysis + association mapping Multi-parent Advanced Generation Intercrosses (MAGIC) (A) Select founders (C) Intercross individuals across funnels (B) Make defined crosses (funnels) (D) Self or create double haploids Huang et al. (2015) Theor Appl Genet 128:999-1017