Chapter 3 Describing Relationships

Slides:



Advertisements
Similar presentations
A student wonders if tall women tend to date taller men than do short women. She measures herself, her dormitory roommate, and the women in the adjoining.
Advertisements

Chapter 3 Bivariate Data
2nd Day: Bear Example Length (in) Weight (lb)
CHAPTER 3 Describing Relationships
A student wonders if tall women tend to date taller men than do short women. She measures herself, her dormitory roommate, and the women in the adjoining.
Chapter 3 concepts/objectives Define and describe density curves Measure position using percentiles Measure position using z-scores Describe Normal distributions.
Objective: Understanding and using linear regression Answer the following questions: (c) If one house is larger in size than another, do you think it affects.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 3 Describing Relationships 3.2 Least-Squares.
Chapter 3-Examining Relationships Scatterplots and Correlation Least-squares Regression.
Objective Find the line of regression. Use the Line of Regression to Make Predictions.
3.2 Least-Squares Regression Objectives SWBAT: INTERPRET the slope and y intercept of a least-squares regression line. USE the least-squares regression.
Linear Regression Day 1 – (pg )
LEAST-SQUARES REGRESSION 3.2 Least Squares Regression Line and Residuals.
CHAPTER 3 Describing Relationships
3.2 - Residuals and Least Squares Regression Line.
Chapters 8 Linear Regression. Correlation and Regression Correlation = linear relationship between two variables. Summarize relationship with line. Called.
Chapter 3: Describing Relationships
Describing Relationships. Least-Squares Regression  A method for finding a line that summarizes the relationship between two variables Only in a specific.
1. Analyzing patterns in scatterplots 2. Correlation and linearity 3. Least-squares regression line 4. Residual plots, outliers, and influential points.
Warm-up Get a sheet of computer paper/construction paper from the front of the room, and create your very own paper airplane. Try to create planes with.
A student wonders if tall women tend to date taller men than do short women. She measures herself, her dormitory roommate, and the women in the adjoining.
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
LSRL.
Least Squares Regression Line.
3.2 Least-Squares Regression
Sections Review.
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Examining Relationships
SCATTERPLOTS, ASSOCIATION AND RELATIONSHIPS
Chapter 5 LSRL.
LSRL Least Squares Regression Line
The following data represents the amount of Profit (in thousands of $) made by a trucking company dependent on gas prices. Gas $
The scatterplot shows the advertised prices (in thousands of dollars) plotted against ages (in years) for a random sample of Plymouth Voyagers on several.
Chapter 3.2 LSRL.
The Least-Squares Regression Line
Describe the association’s Form, Direction, and Strength
Suppose the maximum number of hours of study among students in your sample is 6. If you used the equation to predict the test score of a student who studied.
Ice Cream Sales vs Temperature
Least Squares Regression Line LSRL Chapter 7-continued
residual = observed y – predicted y residual = y - ŷ
CHAPTER 3 Describing Relationships
Least-Squares Regression
Chapter 3: Describing Relationships
CHAPTER 3 Describing Relationships
Chapter 3: Describing Relationships
Least-Squares Regression
Chapter 3 Describing Relationships Section 3.2
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 5 LSRL.
Chapter 5 LSRL.
Least-Squares Regression
Chapter 3: Describing Relationships
Least-Squares Regression
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Chapter 3: Describing Relationships
CHAPTER 3 Describing Relationships
Chapter 3: Describing Relationships
Warmup A study was done comparing the number of registered automatic weapons (in thousands) along with the murder rate (in murders per 100,000) for 8.
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
3.2 – Least Squares Regression
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
9/27/ A Least-Squares Regression.
Review of Chapter 3 Examining Relationships
CHAPTER 3 Describing Relationships
Presentation transcript:

Chapter 3 Describing Relationships

3.1 Scatterplots and correlation

Variables Why do we study relationships between two variables? What is the difference between an explanatory variable and a response variable? How do you know which variable to put on which axis? Where do you start each axis?

Scatterplots What is the easiest way to lose points when making a scatterplot?

Example Track and Field Day! The table below shows data for 13 students in a statistics class. Each member of the class ran a 40-yard sprint and then did a long jump (with a running start). Make a scatterplot of the relationship between sprint time (in seconds) and long jump distance (in inches).

Example Con’t

Interpreting Scatterplot What four characteristics should you consider when interpreting a scatterplot? Describe Track and Field Example

Example The following scatterplot shows the amount of carbs (in grams) and amount of fat (in grams) of beef sandwiches from McDonalds. Describe the relationship between carbs and fat.

Cause and Effect Does a strong association between two variables indicate a cause-and-effect relationship?

Calculators Using technology to create scatterplots

Correlation Just like two distributions can have the same shape and center with different spreads, two associations can have the same direction and form, but very different strengths.

Correlation Con’t What is the correlation r? What are some characteristics of the correlation? http://www.rossmanchance.com/applets/GuessCorrelation.html

Correlation Con’t

Correlation Con’t Can you determine the form of a relationship using only the correlation? www.whfreeman.com/tps4e Is correlation a resistant measure of strength? Do you need to know the formula for correlation? What are some additional facts about correlation?

3.2 Least-Squares Regression

Regression Equation What is the general form of a regression equation? What is the difference between y and ?

Example Used Hondas: The following scatterplot shows the number of miles driven (in thousands) and advertised price (in thousands) for 11 used Honda CR-Vs from the 2002-2006 model years. The regression line shown on the scatterplot is = 18773 – 86.18x.

Example Con’t Interpret the slope and y intercept of a regression line. Predict the price of a used CR-V with 50,000 miles. Predict the price of a used CR-V with 250,000 miles. How confident are you in this prediction?

extrapolation What is extrapolation? Is it a good idea to extrapolate?

Residual What is a residual? How do you interpret a residual?

Example Calculate and interpret the residual for the Honda CR-V with 86,000 miles and an asking price of $10,998.

Example Con’t How can we determine the “best” regression line for a set of data? Is the least-squares regression line resistant to outliers?

Example McDonalds Beef Sandwiches: Calculate the equation of the least-squares regression line using technology. Make sure to define variables! Sketch the scatterplot with the graph of the least-squares regression line. Interpret the slope and y-intercept in context Calculate and interpret the residual for the Big Mac, with 45g of carbs and 29g of fat.

Residual Plot What is a residual plot? What is the purpose of a residual plot? What two things do you look for in a residual plot? How can you tell if a linear model is appropriate?

Example Construct and interpret a residual plot for the Honda CR-V data.

Standard Deviation What is the standard deviation of the residuals? How do you calculate and interpret it?

Example Calculate and interpret the standard deviation of the residuals for the Honda CR-V data.

Example Con’t Suppose that you see a used Honda CR-V for sale. Predict the asking price for this CR-V. How much better would our predictions be if we knew how many miles it had been driven?

r2 What is the coefficient of determination r2? How do you calculate and interpret r2? How is r2 related to r? How is r2 related to s?

Interpreting Computer Output When Mentos are dropped into a newly opened bottle of Diet Coke, carbon dioxide is released from the Diet Coke very rapidly, causing the Diet Coke to be expelled from the bottle. Will more Diet Coke be expelled when there is a larger number of Mentos dropped in the bottle? Two statistics students, Brittany and Allie, decided to find out. Using 16 ounce (2 cup) bottles of Diet Coke, they dropped either 2, 3, 4, or 5 Mentos into a randomly selected bottle, waited for the fizzing to die down, and measured the number of cups remaining in the bottle. Then, they subtracted this measurement from the original amount in the bottle to calculate the amount of Diet Coke expelled (in cups). Output from a regression analysis is shown below.

Con’t

Con’t What is the equation of the least-squares regression line? Define any variables you use. Interpret the slope of the least-squares regression line. What is the correlation? Is a linear model appropriate for this data? Explain. Would you be willing to use the linear model to predict the amount of Diet Coke expelled when 10 mentos are used? Explain.

Con’t Calculate and interpret the residual for bottle of diet coke that had 2 mentos and lost 1.25 cups. Interpret the values of r2 and s. If the amount expelled was measured in ounces instead of cups, how would the values of and s be affected? Explain.

Summary

Example Using the top ten money winners from the 2009 LPGA Tour, we can investigate the relationship between average driving distance and driving accuracy using a scatterplot. Here we will use average driving distance (in yards) as the explanatory variable and driving accuracy (proportion of drives that land in the fairway) as the response variable.

Con’t Draw a scatterplot for this association and discuss the noticeable features. Calculate the equation of the least squares regression line and graph it on the scatterplot. Interpret the slope and y-intercept in the context of the problem. Calculate and interpret the value of the correlation coefficient. If the distance was measured in feet instead of yards, how would the correlation change? Explain.

Con’t Calculate and interpret the residual for Lorena Ochoa. Sketch the residual plot. What information does this provide? Calculate and interpret the value of s in the context of the problem. Calculate and interpret the value of in the context of the problem.

Regression Does it matter which variable is x and which is y? Which of the following has the highest correlation? How do outliers affect the correlation, least-squares regression line, and standard deviation of the residuals? Are all outliers influential?

Example Here is a scatterplot showing the cost in dollars and the battery life in hours for a sample of netbooks (small laptop computers). What effect do the two netbooks that cost $500 have on the equation of the least-squares regression line, correlation, standard deviation of the residuals, and r2? Explain.

Example Here is a scatterplot showing the relationship between the number of fouls and the number of points scored for NBA players in the 2010-2011 season Describe the association. Should NBA players commit more fouls if they want to score more points? Explain.

Regression How can you calculate the equation of the least-squares regression line using summary statistics? What happens to the predicted value of y for each increase of 1 standard deviation in x?