Bivariate Normal Distribution and Regression

Slides:



Advertisements
Similar presentations
Derivations of Student’s-T and the F Distributions
Advertisements

Bivariate Normal Distribution and Regression Application to Galton’s Heights of Adult Children and Parents Sources: Galton, Francis (1889). Natural Inheritance,
Percentiles and the Normal Curve
1 The Vision Thing Power Thirteen Bivariate Normal Distribution.
The Standard Normal Curve Revisited. Can you place where you are on a normal distribution at certain percentiles? 50 th percentile? Z = 0 84 th percentile?
Calculations in the Bivariate Normal Distribution James H. Steiger.
The Law of Errors Suppose we measure the weight of a grain of rice..025 grams.033 grams We take lost of measurements of the same rice grain. What is the.
Normal distribution. An example from class HEIGHTS OF MOTHERS CLASS LIMITS(in.)FREQUENCY
Regression to Mediocrity Galton and the origins of Regression.
1 Regression Analysis Modeling Relationships. 2 Regression Analysis Regression Analysis is a study of the relationship between a set of independent variables.
Bivariate Regression CJ 526 Statistical Analysis in Criminal Justice.
Continuous Random Variables Chap. 12. COMP 5340/6340 Continuous Random Variables2 Preamble Continuous probability distribution are not related to specific.
Today Today: Chapter 5 Reading: –Chapter 5 (not 5.12) –Exam includes sections from Chapter 5 –Suggested problems: 5.1, 5.2, 5.3, 5.15, 5.25, 5.33,
Mutually Exclusive: P(not A) = 1- P(A) Complement Rule: P(A and B) = 0 P(A or B) = P(A) + P(B) - P(A and B) General Addition Rule: Conditional Probability:
1 The Vision Thing Power Thirteen Bivariate Normal Distribution.
Today Today: More Chapter 5 Reading: –Important Sections in Chapter 5: Only material covered in class Note we have not, and will not cover moment/probability.
Random Variables and Probability Distributions
Lecture II-2: Probability Review
5-1 Two Discrete Random Variables Example Two Discrete Random Variables Figure 5-1 Joint probability distribution of X and Y in Example 5-1.
Multivariate Probability Distributions. Multivariate Random Variables In many settings, we are interested in 2 or more characteristics observed in experiments.
5-1 Two Discrete Random Variables Example Two Discrete Random Variables Figure 5-1 Joint probability distribution of X and Y in Example 5-1.
Joint Probability Distributions
Intelligence. Evolution of the Concept of Intelligence Chinese tests for civil service. Chinese tests for civil service. Galton (mid 1800s)—speed of sensory.
Sampling Distributions  A statistic is random in value … it changes from sample to sample.  The probability distribution of a statistic is called a sampling.
Statistics 359a Regression Analysis. Necessary Background Knowledge - Statistics expectations of sums variances of sums distributions of sums of normal.
What factors are most responsible for height?
Probabilities Probability Distribution Predictor Variables Prior Information New Data Prior and New Data Overview.
Conditional Expectations Bivariate Probability Distributions Discrete Case - Television Sales Data Data Source: Benson (1966): “Analysis of Irregular Two-Dimensional.
Warm-Up If the variance of a set of data is 12.4, what is the standard deviation? If the standard deviation of a set of data is 5.7, what is the variance?
Understanding growth and puberty using the RCPCH UK 2-18 growth charts
1 G Lect 4a G Lecture 4a f(X) of special interest: Normal Distribution Are These Random Variables Normally Distributed? Probability Statements.
1 G Lect 2M Examples of Correlation Random variables and manipulated variables Thinking about joint distributions Thinking about marginal distributions:
Statistics for Business & Economics
STA291 Statistical Methods Lecture LINEar Association o r measures “closeness” of data to the “best” line. What line is that? And best in what terms.
The Normal Distribution Section 8.2. The Galton Board Developed in the late 19 th century by Sir Francis Galton, a cousin of Charles Darwin Theorized.
Overview of our study of the multiple linear regression model Regression models with more than one slope parameter.
Math 4030 – 6a Joint Distributions (Discrete)
Research Question What determines a person’s height?
SAARF ® Lifestages - AMPS 2014BA (July 2013 – June 2014)
1 Probability and Statistical Inference (9th Edition) Chapter 4 Bivariate Distributions November 4, 2015.
What factors are most responsible for height?. Model Specification ERROR??? measurement error model error analysis unexplained unknown unaccounted for.
Outline Research Question: What determines height? Data Input Look at One Variable Compare Two Variables Children’s Height and Parents Height Children’s.
 The heights of 16-year-old males are normally distributed with mean 68 inches and a standard deviation 2 inches. Determine the z-score for: ◦ 70.
Welcome to Applied Multiple Regression (Psych 308c)! Spring 2013 MI Professor Dale Berger ACB101 Tue Teaching.
The Nature of Econometrics Tools of Using Econometrics.
Probability Distributions
Probability Distributions
Biodiversity and Variation
U6 - DAY 2 WARM UP! Given the times required for a group of students to complete the physical fitness obstacle course result in a normal curve, and that.
AP Statistics Empirical Rule.
Bivariate Normal Distribution and Regression
Central Tendency Central Tendency – measures of location for a distribution Mode – the commonly occurring number in a data set Median – the middle score.
Understanding growth and puberty using the RCPCH UK 2-18 growth charts
OVERVIEW OF BAYESIAN INFERENCE: PART 1
Using a histogram to estimate the median
Using a histogram to estimate the median
U6 - DAY 2 WARM UP! Given the times required for a group of students to complete the physical fitness obstacle course result in a normal curve, and that.
Warmup What is the height? What proportion of data is more than 5?
Conditional Expectations
Warm Up – Tuesday The table to the left shows
Bivariate Normal Distribution and Regression
Figure 4-1 (p.104) The statistical model for defining abnormal behavior. The distribution of behavior scores for the entire population is divided into.
Normal Curve 10.3 Z-scores.
Prep your notes for Friday’s Chapter 3 Test
Random Variables and Probability Distributions
12/12/ A Binomial Random Variables.
Ch. 8.5 Variation Functions
Continuous Probability Distributions Solved Problems
Children below the second height centile: effect of adjustment for regression to the mean on difference between actual height standard deviation score.
Conditional Expectations
Presentation transcript:

Bivariate Normal Distribution and Regression Application to Galton’s Heights of Adult Children and Parents Sources: Galton, Francis (1889). Natural Inheritance, MacMillan, London. Galton, F.; J.D. Hamilton Dickson (1886). “Family Likeness in Stature”, Proceedings of the Royal Society of London, Vol. 40, pp.42-73.

Data – Heights of Adult Children and Parents Adult Children Heights are reported by inch, in a manner so that the median of the grouped values is used for each (62.2”,…,73.2” are reported by Galton). He adjusts female heights by a multiple of 1.08 We use 61.2” for his “Below” We use 74.2” for his “Above” Mid-Parents Heights are the average of the two parents’ heights (after female adjusted). Grouped values at median (64.5”,…,72.5” by Galton) We use 63.5” for “Below” We use 73.5” for “Above”

Joint Density Function m1=m2=0 s1=s2=1 r=0.4

Marginal Distribution of Y1 (P. 1)

Marginal Distribution of Y1 (P. 2)

Conditional Distribution of Y2 Given Y1=y1 (P. 1)

Conditional Distribution of Y2 Given Y1=y1 (P. 2) This is referred to as the REGRESSION of Y2 on Y1

Summary of Results

Heights of Adult Children and Parents Empirical Data Based on 924 pairs (F. Galton) Y2 = Adult Child’s Height Y2 ~ N(68.1,6.39) s2=2.53 Y1 = Mid-Parent’s Height Y1 ~ N(68.3,3.18) s1=1.78 COV(Y1,Y2) = 2.02  r = 0.45, r2 = 0.20 Y2|Y1=y1 is Normal with conditional mean and variance: y1 Unconditional 63.5 66.5 69.5 72.5 E[Y2|y1] 68.1 65.0 66.9 68.8 70.8 sY2|y1 2.53 2.26

E(Child)= Parent+constant Galton’s Finding E(Child) independent of parent

Expectations and Variances E(Y1) = 68.3 V(Y1) = 3.18 E(Y2) = 68.1 V(Y2) = 6.39 E(Y2|Y1=y1) = 24.5+0.638y1 EY1[E(Y2|Y1=y1)] = EY1[24.5+0.638Y1] = 24.5+0.638(68.3) = 68.1 = E(Y2) V(Y2|Y1=y1) = 5.11  EY1[V(Y2|Y1=y1)] = 5.11 VY1[E(Y2|Y1=y1)] = VY1[24.5+0.638Y1] = (0.638)2 V(Y1) = (0.407)3.18 = 1.29 EY1[V(Y2|Y1=y1)]+VY1[E(Y2|Y1=y1)] = 5.11+1.29=6.40 = V(Y2) (with round-off)