Lecture 21: Quantitative Traits I Date: 11/05/02  Review: covariance, regression, etc  Introduction to quantitative genetics.

Slides:



Advertisements
Similar presentations
Multiple Regression Analysis
Advertisements

Kin 304 Regression Linear Regression Least Sum of Squares
The Simple Regression Model
Chap 12-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 12 Simple Regression Statistics for Business and Economics 6.
Qualitative and Quantitative traits
Chapter 6: Quantitative traits, breeding value and heritability Quantitative traits Phenotypic and genotypic values Breeding value Dominance deviation.
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
Simple Linear Regression. Start by exploring the data Construct a scatterplot  Does a linear relationship between variables exist?  Is the relationship.
Probabilistic & Statistical Techniques Eng. Tamer Eshtawi First Semester Eng. Tamer Eshtawi First Semester
Simple Linear Regression. G. Baker, Department of Statistics University of South Carolina; Slide 2 Relationship Between Two Quantitative Variables If.
Linear regression models
Quantitative Genetics Up until now, we have dealt with characters (actually genotypes) controlled by a single locus, with only two alleles: Discrete Variation.
Quantitative Genetics Theoretical justification Estimation of heritability –Family studies –Response to selection –Inbred strain comparisons Quantitative.
The Simple Linear Regression Model: Specification and Estimation
1-1 Regression Models  Population Deterministic Regression Model Y i =  0 +  1 X i u Y i only depends on the value of X i and no other factor can affect.
Chapter 10 Simple Regression.
Chapter 12 Simple Regression
Quantitative Genetics
Chapter 4 Multiple Regression.
Chapter 13 Introduction to Linear Regression and Correlation Analysis
Linear Regression and Correlation Analysis
Chapter 11 Multiple Regression.
The Simple Regression Model
Ch. 14: The Multiple Regression Model building
Quantitative Genetics
Lecture 2: Basic Population and Quantitative Genetics.
Correlation and Regression Analysis
Linear Regression/Correlation
Lecture 5 Correlation and Regression
PBG 650 Advanced Plant Breeding
Regression and Correlation Methods Judy Zhong Ph.D.
Introduction to Linear Regression and Correlation Analysis
Regression Analysis Regression analysis is a statistical technique that is very useful for exploring the relationships between two or more variables (one.
3.1 Ch. 3 Simple Linear Regression 1.To estimate relationships among economic variables, such as y = f(x) or c = f(i) 2.To test hypotheses about these.
Inference for regression - Simple linear regression
Chapter 11 Simple Regression
Lecture 5: Segregation Analysis I Date: 9/10/02  Counting number of genotypes, mating types  Segregation analysis: dominant, codominant, estimating segregation.
Quantitative Trait Loci, QTL An introduction to quantitative genetics and common methods for mapping of loci underlying continuous traits:
Introduction to Regression Analysis. Two Purposes Explanation –Explain (or account for) the variance in a variable (e.g., explain why children’s test.
1 Least squares procedure Inference for least squares lines Simple Linear Regression.
Announcements -Pick up Problem Set 3 from the front -Ave SD- 4.1 Group work -Amy is lecturing on Thursday on Methods in Quantitative Genetics -Test.
Correlation and Regression Used when we are interested in the relationship between two variables. NOT the differences between means or medians of different.
Applied Quantitative Analysis and Practices LECTURE#23 By Dr. Osman Sadiq Paracha.
Lecture 8 Simple Linear Regression (cont.). Section Objectives: Statistical model for linear regression Data for simple linear regression Estimation.
1 Multiple Regression Analysis y =  0 +  1 x 1 +  2 x  k x k + u.
Copyright ©2011 Brooks/Cole, Cengage Learning Inference about Simple Regression Chapter 14 1.
Lecture 3: Statistics Review I Date: 9/3/02  Distributions  Likelihood  Hypothesis tests.
The Simple Linear Regression Model: Specification and Estimation ECON 4550 Econometrics Memorial University of Newfoundland Adapted from Vera Tabakova’s.
Lecture 24: Quantitative Traits IV Date: 11/14/02  Sources of genetic variation additive dominance epistatic.
Lecture 1: Basic Statistical Tools. A random variable (RV) = outcome (realization) not a set value, but rather drawn from some probability distribution.
Lecture 22: Quantitative Traits II
Lecture 23: Quantitative Traits III Date: 11/12/02  Single locus backcross regression  Single locus backcross likelihood  F2 – regression, likelihood,
1 G Lect 3M Regression line review Estimating regression coefficients from moments Marginal variance Two predictors: Example 1 Multiple regression.
Quantitative Genetics as it Relates to Plant Breeding PLS 664 Spring 2011 D. Van Sanford.
The “Big Picture” (from Heath 1995). Simple Linear Regression.
Inference about the slope parameter and correlation
The simple linear regression model and parameter estimation
Simple Linear Regression
PBG 650 Advanced Plant Breeding
Genotypic value is not transferred from parent to
The Simple Regression Model
Genotypic value is not transferred from parent to
CHAPTER 29: Multiple Regression*
Lecture 2: Fisher’s Variance Decomposition
The Simple Linear Regression Model: Specification and Estimation
Simple Linear Regression
Pi = Gi + Ei Pi = pi - p Gi = gi - g Ei = ei - e _ _ _ Phenotype
Genotypic value is not transferred from parent to
Power Calculation for QTL Association
Presentation transcript:

Lecture 21: Quantitative Traits I Date: 11/05/02  Review: covariance, regression, etc  Introduction to quantitative genetics

Joint Density  Suppose you have two random variables X and Y. For each entity in the sample you take from the population you obtain a random pair (X i, Y i ).  The joint density function for two random variables is given by p(x,y) such that

Conditional Density  The conditional density function for the random variable Y conditional on a particular realization x of the random variable X is p(y|x).  The joint and conditional densities are related: p(x,y) = p(y|x)p(x), where p(x) is the marginal density

Independent Random Variables  If the random variables X and Y are independent, then p(x,y) = p(x)p(y).  As a consequence, we also have p(y|x) = p(y) and p(x|y) = p(x).  If X and Y are dependent, then the relationship between then is either linear or nonlinear. In either case, a linear relationship is a first approximation to the true relationship.

Conditional Expectation  The expectation of the product is given by  The conditional expectation is given by  If X and Y are independent, then

Covariance  Definition: The covariance of two random variables is defined as  and an covariance estimator is given by

Meaning of Covariance  The sign of the covariance implies something about how Y responds to changes in X or vice versa.

Regression and History  The use of a linear function as a first approximation of the relationship between two random variables is termed regression.  In fact, regression was first introduced in a genetics context.  Galton (1889) studied the average height of parents X and the height of offspring Y.

Least Squares Method  The least squares method finds estimates a and b for the coefficients of the assumed linear relationship by minimizing the mean squared error.  Because means, variances and covariances are available from phenotypic data, these estimates are particularly useful.

Homoscedasticity and Heteroscedasticity  Definition: If the variance in the residual error is constant regardless of the dependent variable x, then E(Y|X) is homoscedastic.  Definition: There is heteroscedasticity in the data if the residual variance depends on the value of the dependent variable x.  Transformations exist to achieve homoscedasticity.

Example: Regression  Suppose Cov(X,Y) = 10, Var(X) = 10, Var(Y) = 15 and the means E(X) = E(Y) = 0.  Regress X on Y and Y on X.  Then the intercept estimates is  and the slope estimates are

Correlation  Definition: The correlation of two random variables X and Y is defined as  with estimator  Correlations are scale independent:

 Correlations imply linear association.  r is the standardized regression coefficient that is obtained if x and y are scaled to have unit variance.  r 2 measures the proportion of the Var(Y) that is explained if E(Y|X) is linear. Correlation (cont)

Example: rats

Quantitative Traits  Definition: A quantitative trait is one with a continuous distribution. In other words, it is a trait that is measured not counted.  It is assumed that quantitative traits are controlled by many genes, each with small effect. Environmental effects are also important.  Definition: A quantitative trait locus (QTL) is locus controlling a quantitative trait.

Quantitative Trait – A Model  We start with a very simple scenario. Suppose there is one locus determining a quantitative trait. Suppose that there are only two alleles at this locus. We seek a model for this scenario. This model will have two parameters to account for the two degrees of freedom (when location is removed) among the 3 possible outcomes (genotypes).

 Let the phenotypic value of a particular genotype be z. When environment has an effect, z is a consequence of both the underlying genotype and the environment. We can write z = G + E. Here G is the genotypic value and it is the expected phenotypic value averaged over all environments.  Each of the three genotypes has an associated genotypic value. Phenotypic and Genotypic Value

Quantitative Trait – Model A 0(1+k)a2a2a

Quantitative Trait – Model B -a-ada

Model A – Parameter Meanings 0(1+k)a2a2a Value of kGenetic Interpretation 0 1 >1overdominance <-1underdominance

Example – Scaling Quantitative Trait  The Booroola (B) gene influences fecundity in Merino sheep. GenotypeMean Litter Size BB2.66 Bb2.17 bb1.48

Gene Content  Definition: The B 1 gene content of a genotype is the number of copies of the B 1 allele. The gene content for allele B 1 in genotype B 1 B 2 is 1.  At a single locus, the genotypic value is not a linear relationship on gene content, unless k=0. 0 a 2a2a (1+k)a 2a2a

Partitioning Genotypic Value  Let N 1 be the number of B 1 alleles in the genotype.  Let N 2 be the number of B 2 alleles in the genotype.  Then, multiply regress the genotypic value on independent variables N 1 and N 2.  Assume again only two alleles, then N 1 = 2 – N 2. Call N 2 = N.

Predicted Genotypic Values  The predicted genotypic values are given as

Weighted Mean of  ’s is 0

Slope of Regression Line  Recalling the formula for the slope of a regression line, we have  We will now find expressions for the covariance and variance.

Derivation of Slope

Average Effect of Allelic Substitution  The previous derivations were completed under the assumption of random mating and HWE.  The slope  is the change in genotypic value associated with the addition of one more allele. To add one more B 2 allele, one must replace another B 1 allele with B 2, so it is also called the average effect of allelic substitution.  Except under additivity (k=0), this substitution effect can only be defined in terms of the population.

Partitioning Genetic Variance  Because we now have a linear function for genotypic value G  we can write the total genetic variance as  but there is no covariance term.

Additive and Dominance Components  The first term is additive genetic variance: the amount of variance of G that is explained by regression on N.  The second term is dominance genetic variance: the residual variance for the regression.  We seek an expression for both terms.

Derivation of Slope GenotypeB1B1B1B1 B1B2B1B2 B2B2B2B2 N012 G0(1+k)a2a2a Freq.* GN0(1+k)a4a4a N2N2 014 

Derivation of Genetic Variance Components

Genetic Variance Components  Both components depend on gene frequencies (conditional on population from which they are derived).  When k=0 (purely additive effects), then additive genetic variance is maximized when heterozygosity is maximized.  With dominance k>0, additive genetic variance is maximized at higher frequencies of the recessive allele. Rare recessive alleles cause little genetic variance because they are not often expressed.

Why?  Why have we partitioned the genotypic value into additive and dominance components?  When a parent transmits alleles to the offspring, the dominance deviation in the parent is irrelevant because only one gamete is transmitted.  May think of additive genetic component as the heritable component of an individual’s genotypic value.

Average Excess  There are multiple ways to measure the effect of an allele. The effect of allelic substitution  is one. The additive effect  i is another.  Definition: The average excess of an allele is the difference between the mean genotypic value of individuals carrying at least one copy of the allele and the mean genotypic value of a random individual form the entire population.

Average Excess with Random Mating

Breeding Value  Definition: The breeding value of an individual is the sum of the additive effects. GenotypeBreeding Value B1B1B1B1 2121 B1B2B1B2 1+21+2 B2B2B2B2 22

Breeding Value and Random Mating  Consider the expected genotypic values of progeny produced by parental genotypes. GenotypeBreeding Value Progeny Expected Genotypic Value Deviation B1B1B1B1 2121 ap 2 (1+k) 11 B1B2B1B2 1+21+2 a[p 2 + (1+k)/2]  1 +  2 )/2 B2B2B2B2 22 a[2p 2 + p 1 (1+k)] 22

How to Use this Analysis  A common approach today.  Identify candidate loci that are potential contributors to the variation of the trait of interest.  Genotype a random selection of individuals are identified by molecular markers.  Determine average phenotypic values within each genotypic class.  Estimate fraction of total phenotypic variation associated with candidate locus.

An Example  Consider the Booroola gene example in two random mating populations where the gene B is present at gene frequencies 0.5. BBBbbb G ij P ij

Booroola (cont) ValueEstimate Mean genotypic value Additive Effects  B =  b = Breeding ValuesA BB = 0.59; A Bb = 0; A bb = 0.59 Dominance Deviations D BB = -0.05; D Bb = 0.05; D bb = -0.05

Booroola (cont) ValueEstimate Additive Genetic Variance Dominance Genetic Variance  Total Genetic Variance0.1765