Identifying QTLs in experimental crosses

Slides:



Advertisements
Similar presentations
The genetic dissection of complex traits
Advertisements

Chapter 9: Simple Regression Continued
Managerial Economics in a Global Economy
Computational Statistics. Basic ideas  Predict values that are hard to measure irl, by using co-variables (other properties from the same measurement.
Introduction Materials and methods SUBJECTS : Balb/cJ and C57BL/6J inbred mouse strains, and inbred fruit fly strains number 11 and 70 from the recombinant.
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
Multiple Comparisons Measures of LD Jess Paulus, ScD January 29, 2013.
QTL Mapping R. M. Sundaram.
1 QTL mapping in mice Lecture 10, Statistics 246 February 24, 2004.
Statistical association of genotype and phenotype.
Mapping a disease locus Fig. 11.A A1D A2d A1d d A2d A1 A2.
Lecture 9: QTL Mapping I:
Simulation/theory With modest marker spacing in a human study, LOD of 3 is 9% likely to be a false positive.
Quantitative Genetics
2050 VLSB. Dad phase unknown A1 A2 0.5 (total # meioses) Odds = 1/2[(1-r) n r k ]+ 1/2[(1-r) n r k ]odds ratio What single r value best explains the data?
Topic 3: Regression.
Karl W Broman Department of Biostatistics Johns Hopkins University Gene mapping in model organisms.
Simple Linear Regression Analysis
General Linear Model & Classical Inference
QTL mapping in animals. It works QTL mapping in animals It works It’s cheap.
Chapter 10: Analyzing Experimental Data Inferential statistics are used to determine whether the independent variable had an effect on the dependent variance.
Multiple Regression and Model Building Chapter 15 Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
Experimental Design and Data Structure Supplement to Lecture 8 Fall
Quantitative Genetics. Continuous phenotypic variation within populations- not discrete characters Phenotypic variation due to both genetic and environmental.
Complex Traits Most neurobehavioral traits are complex Multifactorial
Quantitative Genetics
Multiple Regression BPS chapter 28 © 2006 W.H. Freeman and Company.
Lecture 4: Statistics Review II Date: 9/5/02  Hypothesis tests: power  Estimation: likelihood, moment estimation, least square  Statistical properties.
ANOVA: Analysis of Variance.
Lecture 3: Statistics Review I Date: 9/3/02  Distributions  Likelihood  Hypothesis tests.
Environmental Modeling Basic Testing Methods - Statistics III.
Lecture 24: Quantitative Traits IV Date: 11/14/02  Sources of genetic variation additive dominance epistatic.
Association between genotype and phenotype
Lecture 21: Quantitative Traits I Date: 11/05/02  Review: covariance, regression, etc  Introduction to quantitative genetics.
BayesNCSU QTL II: Yandell © Bayesian Interval Mapping 1.what is Bayes? Bayes theorem? Bayesian QTL mapping Markov chain sampling18-25.
Epistasis / Multi-locus Modelling Shaun Purcell, Pak Sham SGDP, IoP, London, UK.
Research Methods and Data Analysis in Psychology Spring 2015 Kyle Stephenson.
Lecture 22: Quantitative Traits II
Lecture 23: Quantitative Traits III Date: 11/12/02  Single locus backcross regression  Single locus backcross likelihood  F2 – regression, likelihood,
Why you should know about experimental crosses. To save you from embarrassment.
1 QTL mapping in mice, completed. Lecture 12, Statistics 246 March 2, 2004.
Genetic mapping and QTL analysis - JoinMap and QTLNetwork -
Bayesian Variable Selection in Semiparametric Regression Modeling with Applications to Genetic Mappping Fei Zou Department of Biostatistics University.
Power in QTL linkage analysis
Inference about the slope parameter and correlation
PBG 650 Advanced Plant Breeding
upstream vs. ORF binding and gene expression?
General Linear Model & Classical Inference
Model Selection for Multiple QTL
Genome Wide Association Studies using SNP
12 Inferential Analysis.
Relationship between quantitative trait inheritance and
Gene mapping in mice Karl W Broman Department of Biostatistics
I Have the Power in QTL linkage: single and multilocus analysis
Lecture 7: QTL Mapping I:
Statistical issues in QTL mapping in mice
Mapping Quantitative Trait Loci
Statistical Methods for Quantitative Trait Loci (QTL) Mapping II
Genetic architecture of behaviour
Multiple Regression Models
Lecture 4: Testing for Departures from Hardy-Weinberg Equilibrium
Simple Linear Regression
12 Inferential Analysis.
Statistical Analysis and Design of Experiments for Large Data Sets
Simple Linear Regression
Lecture 9: QTL Mapping II: Outbred Populations
Chapter 7 Beyond alleles: Quantitative Genetics
Power Calculation for QTL Association
Quantitative Trait Locus (QTL) Mapping
Presentation transcript:

Identifying QTLs in experimental crosses Karl W. Broman Department of Biostatistics Johns Hopkins School of Public Health kbroman@jhsph.edu http://kbroman.homepage.com 1

F2 intercross 20 80 20 80 A A B B 60 A B 55 40 75 35 2

Distribution of the QT P1 F1 P2 F2 3

Data n (100–1000) F2 progeny yi = phenotype for individual i gij = genotype for indiv i at marker j (AA, AB or BB) Genetic map of the markers Phenotypes of parentals and F1 4

D1M1 5

Single QTL analysis Marker D1M1 Additive effect Dominance deviation Prop'n var explained 6

D2M2 7

Single QTL analysis Marker D2M2 8

Is it real? Hypothesis testing Null hypothesis, H0: no QTL P-value = Pr(LOD > observed | no QTL) Small P (large LOD)  Reject H0 (Good) Large P (small LOD)  Fail to reject H0 (Bad) Generally want P < 0.05 or < 0.01 P  0.049  P  0.051 LOD  3.01  LOD  2.99 9

A picture Distribution of LOD given no QTL P-value = area Observed LOD 10

Multiple testing  We're doing ~ 200 tests (one at each marker; correlated due to linkage)  Imagine the tests were uncorrelated, and that H0 is true (there is no QTL) Toss 200 biased coins Heads  Reject H0 (falsely conclude that there is a QTL) Pr(Heads) = 5% H Ave no. heads in 200 tosses = 10 Pr(at least one head in 200 tosses)  100% 11

A new picture Distribution of max LOD given no QTL P  25% Observed LOD 12

Interval mapping Interpolation between markers (Lander and Botstein 1989) Interpolation between markers At each point, imagine a putative QTL and maximize Pr(data | QTL, parameters) Great for dealing with missing genotype data; important for widely spaced markers 13

Power Power = Pr(Identify a QTL | there is a QTL) Power depends on Sample size Size of QTL effect (relative to resid. var.) Marker density Level of statistical significance Consider Pr(detect a particular locus) Pr(detect at least one locus) 14

Power = 16% h2 = 20% n = 100 Power = 90% h2 = 20% n = 400 Power = 3% 15

Selection bias Dist'n of a given locus identified ^ Dist'n of a LOD < threshold LOD > threshold If the power to detect a particular locus is not super high, its estimated effect (when it is identified) will be biased Power  90%  Bias  2% Power  45%  Bias  20% Power  5%  Bias  100% 16

Multiple QTLs It is often important to consider multiple QTLs simultaneously Increase power by reducing residual variation Separate linked loci Estimate epistatic effects Analysis of single QTL: analysis of variance (ANOVA) or simple linear regression Analysis of multiple QTL: multiple linear regression, possibly with interaction terms; possibly using tree- based models A key issue: Things are more complicated than "Is there a QTL here or not?" 17

An example Full model Additive model Locus 2 Locus 1 Locus 2 Locus 1 18

A tree-based model 34.9 43.8 61.3 BB at Loc 1 BB at Loc 2 Locus 1 19

Summary LOD scores Hypothesis testing Null hypothesis P-values Significance levels Adjustment for multiple tests Power To identify a particular locus To identify at least one locus Selection bias Multiple QTLs Increase power Separate linked loci Estimate epistasis Things get complicated 20