Regression-based linkage analysis

Slides:



Advertisements
Similar presentations
Computational Statistics. Basic ideas  Predict values that are hard to measure irl, by using co-variables (other properties from the same measurement.
Advertisements

Genetic research designs in the real world Vishwajit L Nimgaonkar MD, PhD University of Pittsburgh
BA 275 Quantitative Business Methods
Meta-analysis for GWAS BST775 Fall DEMO Replication Criteria for a successful GWAS P
. Parametric and Non-Parametric analysis of complex diseases Lecture #6 Based on: Chapter 25 & 26 in Terwilliger and Ott’s Handbook of Human Genetic Linkage.
Linkage Analysis: An Introduction Pak Sham Twin Workshop 2001.
Linkage analysis: basic principles Manuel Ferreira & Pak Sham Boulder Advanced Course 2005.
LINEAR REGRESSION: Evaluating Regression Models Overview Assumptions for Linear Regression Evaluating a Regression Model.
LINEAR REGRESSION: Evaluating Regression Models. Overview Assumptions for Linear Regression Evaluating a Regression Model.
Estimating “Heritability” using Genetic Data David Evans University of Queensland.
Power in QTL linkage: single and multilocus analysis Shaun Purcell 1,2 & Pak Sham 1 1 SGDP, IoP, London, UK 2 Whitehead Institute, MIT, Cambridge, MA,
Quantitative Genetics
Biometrical Genetics Pak Sham & Shaun Purcell Twin Workshop, March 2002.
Statistical Power Calculations Boulder, 2007 Manuel AR Ferreira Massachusetts General Hospital Harvard Medical School Boston.
Shaun Purcell & Pak Sham Advanced Workshop Boulder, CO, 2003
Copy the folder… Faculty/Sarah/Tues_merlin to the C Drive C:/Tues_merlin.
Chuanyu Sun Paul VanRaden National Association of Animal breeders, USA Animal Improvement Programs Laboratory, USA Increasing long term response by selecting.
Quantitative Trait Loci, QTL An introduction to quantitative genetics and common methods for mapping of loci underlying continuous traits:
Introduction to QTL analysis Peter Visscher University of Edinburgh
Introduction to Linkage Analysis Pak Sham Twin Workshop 2003.
Gene Mapping Quantitative Traits using IBD sharing References: Introduction to Quantitative Genetics, by D.S. Falconer and T. F.C. Mackay (1996) Longman.
Linkage in selected samples Manuel Ferreira QIMR Boulder Advanced Course 2005.
Permutation Analysis Benjamin Neale, Michael Neale, Manuel Ferreira.
Type 1 Error and Power Calculation for Association Analysis Pak Sham & Shaun Purcell Advanced Workshop Boulder, CO, 2005.
Quantitative Genetics
Power of linkage analysis Egmond, 2006 Manuel AR Ferreira Massachusetts General Hospital Harvard Medical School Boston.
Regression-Based Linkage Analysis of General Pedigrees Pak Sham, Shaun Purcell, Stacey Cherny, Gonçalo Abecasis.
Tutorial #10 by Ma’ayan Fishelson. Classical Method of Linkage Analysis The classical method was parametric linkage analysis  the Lod-score method. This.
Future Directions Pak Sham, HKU Boulder Genetics of Complex Traits Quantitative GeneticsGene Mapping Functional Genomics.
A Statistical Method for Adjusting Covariates in Linkage Analysis With Sib Pairs Colin O. Wu, Gang Zheng, JingPing Lin, Eric Leifer and Dean Follmann Office.
Lecture 21: Quantitative Traits I Date: 11/05/02  Review: covariance, regression, etc  Introduction to quantitative genetics.
Errors in Genetic Data Gonçalo Abecasis. Errors in Genetic Data Pedigree Errors Genotyping Errors Phenotyping Errors.
Genetic Theory Pak Sham SGDP, IoP, London, UK. Theory Model Data Inference Experiment Formulation Interpretation.
Epistasis / Multi-locus Modelling Shaun Purcell, Pak Sham SGDP, IoP, London, UK.
Practical With Merlin Gonçalo Abecasis. MERLIN Website Reference FAQ Source.
Lecture 22: Quantitative Traits II
Lecture 23: Quantitative Traits III Date: 11/12/02  Single locus backcross regression  Single locus backcross likelihood  F2 – regression, likelihood,
Powerful Regression-based Quantitative Trait Linkage Analysis of General Pedigrees Pak Sham, Shaun Purcell, Stacey Cherny, Gonçalo Abecasis.
David M. Evans Multivariate QTL Linkage Analysis Queensland Institute of Medical Research Brisbane Australia Twin Workshop Boulder 2003.
Introduction to Genetic Theory
A simple method to localise pleiotropic QTL using univariate linkage analyses of correlated traits Manuel Ferreira Peter Visscher Nick Martin David Duffy.
Efficient calculation of empirical p- values for genome wide linkage through weighted mixtures Sarah E Medland, Eric J Schmitt, Bradley T Webb, Po-Hsiu.
Topics, Summer 2008 Day 1. Introduction Day 2. Samples and populations Day 3. Evaluating relationships Scatterplots and correlation Day 4. Regression and.
Lecture 17: Model-Free Linkage Analysis Date: 10/17/02  IBD and IBS  IBD and linkage  Fully Informative Sib Pair Analysis  Sib Pair Analysis with Missing.
Power in QTL linkage analysis
Regression Models for Linkage: Merlin Regress
PBG 650 Advanced Plant Breeding
upstream vs. ORF binding and gene expression?
Genome Wide Association Studies using SNP
Can resemblance (e.g. correlations) between sib pairs, or DZ twins, be modeled as a function of DNA marker sharing at a particular chromosomal location?
A few Simple Applications of Index Selection Theory
I Have the Power in QTL linkage: single and multilocus analysis
Linkage in Selected Samples
Linear Regression.
Power to detect QTL Association
Mapping Quantitative Trait Loci
ACE Problems Greg Carey BGA, 2009 Minneapolis, MN.
Correlation for a pair of relatives
Why general modeling framework?
Pak Sham & Shaun Purcell Twin Workshop, March 2002
What are BLUP? and why they are useful?
OVERVIEW OF LINEAR MODELS
Linkage analysis and genetic mapping
Lecture 9: QTL Mapping II: Outbred Populations
Univariate Linkage in Mx
Stephen C. Pratt, Mark J. Daly, Leonid Kruglyak 
William F. Forrest, Eleanor Feingold 
Power Calculation for QTL Association
David B. Allison, Bonnie Thiel, Pamela St. Jean, Robert C
Presentation transcript:

Regression-based linkage analysis Shaun Purcell, Pak Sham.

(HE-SD) (X – Y)2 = 2(1 – r) – 2Q( – 0.5) +  Penrose (1938) quantitative trait locus linkage for sib pair data Simple regression-based method squared pair trait difference proportion of alleles shared identical by descent (HE-SD) (X – Y)2 = 2(1 – r) – 2Q( – 0.5) +  ^

Haseman-Elston regression  = -2Q (X - Y)2 IBD 1 2

Expected sibpair allele sharing + - Sib 2 Sib 1

Squared differences (SD) - - + + - + + + + - - - Sib 2 Sib 1

Sums versus differences Wright (1997), Drigalenko (1998) phenotypic difference discards sib-pair QTL linkage information squared pair trait sum provides extra information for linkage independent of information from HE-SD (HE-SS) (X + Y)2 = 2(1 + r) + 2Q( – 0.5) +  ^

Squared sums (SS) + - Sib 2 Sib 1

SD and SS - - + + - + + + + - - - Sib 2 Sib 1

New dependent variable to increase power mean corrected cross-product (HE-CP) other extensions > 2 sibs in a sibship multiple trait loci and epistasis multivariate multiple markers binary traits other relative classes

SD + SS ( = CP) - - + + - + + + + - - - Sib 2 Sib 1

Xu et al With  residual sibling correlation HE-CP HE-CP  in power, HE-SD  in power HE-CP Propose a weighting scheme

Variance of SD

Variance of SS

Low sibling correlation

Increased sibling correlation

Clarify the relative efficiencies of existing HE methods Demonstrate equivalence between a new HE method and variance components methods Show application to the selection and analysis of extreme, selected samples

Haseman-Elston regressions HE-SD (X – Y)2 = 2(1 – r) – 2Q( – 0.5) +  HE-SS (X + Y)2 = 2(1 + r) + 2Q( – 0.5) +  HE-CP XY = r + Q( – 0.5) + 

NCPs for H-E regressions Variance of Dependent NCP per sibpair (X – Y)2 (X + Y)2 XY Dependent

Weighted H-E Squared-sums and squared-differences Optimal weighting orthogonal components in the population Optimal weighting inverse of their variances

Weighted H-E A function of Equivalent to variance components square of QTL variance marker informativeness complete information = 0.0125 sibling correlation Equivalent to variance components to second-order approximation Rijsdijk et al (2000)

Combining into one regression New dependent variable : a linear combination of squared-sum and squared-difference weighted by the population sibling correlation:

HE-COM - - + + - + + - - - Sib 2 Sib 1

Simulation Single QTL simulated Residual variance 10,000 sibling pairs accounts for 10% of trait variance 2 equifrequent alleles; additive gene action assume complete IBD information at QTL Residual variance shared and nonshared components residual sibling correlation : 0 to 0.5 10,000 sibling pairs 100 replicates 1000 under the null

Unselected samples

Sample selection A sib-pairs’ squared mean-corrected DV is proportional to its expected NCP Equivalent to variance-components based selection scheme Purcell et al, (2000)

Sample selection Sibship NCP 1.6 1.4 1.2 0.8 0.6 0.4 0.2 -4 -3 -2 -1 1 2 3 4 Sib 1 trait Sib 2 trait 0.2 0.4 0.6 0.8 1.2 1.4 1.6 Sibship NCP

Analysis of selected samples 500 (5%) most informative pairs selected r = 0.05 r = 0.60

Selected samples : H0

Selected samples : HA

Variance-based weighting scheme SD and SS weighted in proportion to the inverse of their variances Implemented as an iterative estimation procedure loses simple regression-based framework

Product of pair values corrected for the family mean for sibs 1 and 2 from the j th family, Adjustment for high shared residual variance For pairs, reduces to HE-SD

Conclusions Advantages Future directions Efficient Robust Easy to implement Future directions Weight by marker informativeness Extension to general pedigrees

The End