LSRLs: Interpreting r vs. r 2 r – “the correlation coefficient” tells you the strength and direction between two variables (x and y, for example, height.

Slides:



Advertisements
Similar presentations
LSRLs: Interpreting r vs. r2
Advertisements

2nd Day: Bear Example Length (in) Weight (lb)
Scatter Diagrams and Linear Correlation
AP Statistics Mrs Johnson
CHAPTER 3 Describing Relationships
Warm-up with 3.3 Notes on Correlation
Lesson Least-Squares Regression. Knowledge Objectives Explain what is meant by a regression line. Explain what is meant by extrapolation. Explain.
AP STATISTICS LESSON 3 – 3 LEAST – SQUARES REGRESSION.
Warm-up with 3.3 Notes on Correlation Universities use SAT scores in the admissions process because they believe these scores provide some insight into.
Chapter 3 Section 3.1 Examining Relationships. Continue to ask the preliminary questions familiar from Chapter 1 and 2 What individuals do the data describe?
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 3 Describing Relationships 3.2 Least-Squares.
Examining Bivariate Data Unit 3 – Statistics. Some Vocabulary Response aka Dependent Variable –Measures an outcome of a study Explanatory aka Independent.
3.2 Least-Squares Regression Objectives SWBAT: INTERPRET the slope and y intercept of a least-squares regression line. USE the least-squares regression.
 Chapter 3! 1. UNIT 7 VOCABULARY – CHAPTERS 3 & 14 2.
POD 09/19/ B #5P a)Describe the relationship between speed and pulse as shown in the scatterplot to the right. b)The correlation coefficient, r,
AP Statistics HW: p. 165 #42, 44, 45 Obj: to understand the meaning of r 2 and to use residual plots Do Now: On your calculator select: 2 ND ; 0; DIAGNOSTIC.
Linear Regression Day 1 – (pg )
^ y = a + bx Stats Chapter 5 - Least Squares Regression
Warm-up O Turn in HW – Ch 8 Worksheet O Complete the warm-up that you picked up by the door. (you have 10 minutes)
CHAPTER 3 Describing Relationships
Chapter 5 Lesson 5.2 Summarizing Bivariate Data 5.2: LSRL.
CHAPTER 3 Describing Relationships
Unit 4 LSRL.
LSRL.
Least Squares Regression Line.
CHAPTER 3 Describing Relationships
Chapter 3: Describing Relationships
Chapter 5 LSRL.
LSRL Least Squares Regression Line
Regression and Residual Plots
AP Stats: 3.3 Least-Squares Regression Line
Least-Squares Regression
Least Squares Regression Line LSRL Chapter 7-continued
EQ: How well does the line fit the data?
Chapter 3: Describing Relationships
CHAPTER 3 Describing Relationships
Least-Squares Regression
^ y = a + bx Stats Chapter 5 - Least Squares Regression
CHAPTER 3 Describing Relationships
Chapter 3: Describing Relationships
Least-Squares Regression
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 5 LSRL.
Least-Squares Regression
Chapter 3: Describing Relationships
Least-Squares Regression
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Warmup A study was done comparing the number of registered automatic weapons (in thousands) along with the murder rate (in murders per 100,000) for 8.
CHAPTER 3 Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
CHAPTER 3 Describing Relationships
Chapter 3: Describing Relationships
CHAPTER 3 Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
CHAPTER 3 Describing Relationships
A medical researcher wishes to determine how the dosage (in mg) of a drug affects the heart rate of the patient. Find the correlation coefficient & interpret.
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
CHAPTER 3 Describing Relationships
Presentation transcript:

LSRLs: Interpreting r vs. r 2 r – “the correlation coefficient” tells you the strength and direction between two variables (x and y, for example, height v. weight, r = ?). -1 ≤ r ≤ 1 r will not work for non-linear relationships r does not have units (r ≠.30 pounds?) r is not resistant to outliers! Consider the effect of outliers when looking at r, report r with outliers and without r is same regardless of which is explanatory and which is response variable

Understanding what is expected with LSRLs Note: When finding LSRL the placement of the explanatory and response variables DOES matter! Y_hat = _x + _ (prediction equation, equation of line of best fit) Found by minimizing sum of squares of residuals *extra credit for manual calculation from packet 1. Find LSRL using calculator: stat->calc->8 resulted in the output for packet examples, y_hat = a + bx (#8 2. Find LSRL using computer output. 3. Find LSRL using b= r s y /s x. (You are not given data, you are given statistics: s y, s x, x_bar, y_bar, and r.) Find b., Substitute into y_bar = ax_bar + b. Solve for a. Substitute a and b into y_hat = a + bx and you are done.

Simple understanding: r v r 2 r, correlation coefficient (strength and direction, only about relationship between x and y, r is related to the slope of LSRL – b = rs y /s x ) R 2, coefficient of determination (how strong=accurate is our LSRL?)

Examining LSRLs: r v. r 2 Students height v. weight y_hat = 4.915x predicted weight = 4.915(height) r = r 2 =

To answer the question in your packet, which is the better prediction equation (which would be more accurate in making a prediction)? The one with the highest r 2 value! The higher the value, the more % of variation in y is explained by the LSRL of y on x.

Theory behind r 2 It tells us how much better a line with a slope would be at predicting than a line of y=y_bar (the average of all of the y values in the data set). It compares the vertical deviations (residuals) between the sloped line and the horizontal line (y=y_bar) and tells how much better the sloped line is in accounting for this variation. This math and theory can be found in the book You don’t have to know the theory for AP Test or my test.

What You Should Know: Summary of r 2 r 2 tells us how accurate our LSRL is at making predictions. Do you think the x value in each observation tells you something about y? How much is it actually telling you? When r 2 = 1 we say “100% of the variation in y is explained by the LSRL of y on x. When r 2 =.64, we say “64% of the variation in y is explained by the LSRL of y on x. r 2 tells us the fractional variation in y that is explained by the LSRL of y on x. MUST USE THIS SPECIFIC LANGUAGE TO INTERPRET r2 ON THE AP TEST AND MY TEST!!!

What is a residual? The vertical deviation from y to y_hat from each observation to the LSRL (y_hat). The residual values (the vertical deviations) are stored in your calculator each time you run a linear regression LinReg a+bx. These residuals can be found in RESID in your calculator 2 nd ->Stat->RESID

What do the residuals tell us? The residuals tell us whether a line is a best fit (maybe a non-linear function, exponential or power, might fit the data better and help us predict better). How to create a residual plot: Plot x, the explanatory variable, L1 vs. y=RESIDS. If the plot shows a pattern (not scattered), a line is not a best fit.

HW Assignment - TBA For each set of data in your packet (4 sets): Enter the data in L1, L2. Run the linear regression to create your LSRL. On a separate sheet of paper, record r and r 2. Also, in two sentences, interpret r and r 2 in context of the data. Create a residual plot (L1 vs. Resids) on your calculator in statplot and sketch it on this separate sheet. On the separate sheet, answer these questions: Is the LSRL (a linear equation) a best fit for this data? How do you know? How accurate is the LSRL? How do you know? Also, complete the last page of the packet for HW.