Tuesday December 10 Make a scatter plot Estimate a line of best fit

Slides:



Advertisements
Similar presentations
5.4 Correlation and Best-Fitting Lines
Advertisements

Analyzing Bivariate Data With Fathom * CFU Using technology with a set of contextual linear data to examine the line of best fit; determine and.
1 Objective Investigate how two variables (x and y) are related (i.e. correlated). That is, how much they depend on each other. Section 10.2 Correlation.
5-7: Scatter Plots & Lines of Best Fit. What is a scatter plot?  A graph in which two sets of data are plotted as ordered pairs  When looking at the.
Chap 3-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 3 Describing Data: Numerical Statistics for Business and Economics.
1 Chapter 10 Correlation and Regression We deal with two variables, x and y. Main goal: Investigate how x and y are related, or correlated; how much they.
Correlation and Linear Regression Chapter 13 McGraw-Hill/Irwin Copyright © 2012 by The McGraw-Hill Companies, Inc. All rights reserved.
Descriptive Methods in Regression and Correlation
Copyright © Cengage Learning. All rights reserved. 1 Functions and Their Graphs.
Copyright © Cengage Learning. All rights reserved.
Correlation and regression 1: Correlation Coefficient
1 Chapter 3: Examining Relationships 3.1Scatterplots 3.2Correlation 3.3Least-Squares Regression.
Warm-up 3.4 Diagnostic of Influential Points 4) 5) 6)
Bivariate Distributions Overview. I. Exploring Data Describing patterns and departures from patterns (20%-30%) Exploring analysis of data makes use of.
1 Chapter 10 Correlation and Regression 10.2 Correlation 10.3 Regression.
1.6 Linear Regression & the Correlation Coefficient.
Linear Regression Least Squares Method: the Meaning of r 2.
Chapter 10 Correlation and Regression
 Graph of a set of data points  Used to evaluate the correlation between two variables.
3.2 Least Squares Regression Line. Regression Line Describes how a response variable changes as an explanatory variable changes Formula sheet: Calculator.
Variation This presentation should be read by students at home to be able to solve problems.
Regression Regression relationship = trend + scatter
Relationships If we are doing a study which involves more than one variable, how can we tell if there is a relationship between two (or more) of the.
Draw Scatter Plots and Best-Fitting Lines Section 2.6.
Correlation and Regression. Section 9.1  Correlation is a relationship between 2 variables.  Data is often represented by ordered pairs (x, y) and.
Simple Linear Regression In the previous lectures, we only focus on one random variable. In many applications, we often work with a pair of variables.
Chapter 2 – Linear Equations and Functions
Creating a Residual Plot and Investigating the Correlation Coefficient.
Correlation The apparent relation between two variables.
Scatter Plots, Correlation and Linear Regression.
Chapter 8 Linear Regression HOW CAN A MODEL BE CREATED WHICH REPRESENTS THE LINEAR RELATIONSHIP BETWEEN TWO QUANTITATIVE VARIABLES?
LEAST-SQUARES REGRESSION 3.2 Least Squares Regression Line and Residuals.
Scatterplots and Linear Regressions Unit 8. Warm – up!! As you walk in, please pick up your calculator and begin working on your warm – up! 1. Look at.
In this lesson, you will create residual plots to help you visually determine how well a LSRL fits the data. Day 1.
GOAL: I CAN USE TECHNOLOGY TO COMPUTE AND INTERPRET THE CORRELATION COEFFICIENT OF A LINEAR FIT. (S-ID.8) Data Analysis Correlation Coefficient.
STANDARD: 8.SP.1 STATISTICS AND PROBABILITY: SCATTER PLOTS.
Correlation and Linear Regression
CHAPTER 3 Describing Relationships
CPM and CCSS Statistics
Unit 4 LSRL.
CHAPTER 3 Describing Relationships
Warm-up: This table shows a person’s reported income and years of education for 10 participants. The correlation is .79. State the meaning of this correlation.
Chapter 5 LSRL.
SIMPLE LINEAR REGRESSION MODEL
CHAPTER 10 Correlation and Regression (Objectives)
Suppose the maximum number of hours of study among students in your sample is 6. If you used the equation to predict the test score of a student who studied.
Correlation and Simple Linear Regression
1) A residual: a) is the amount of variation explained by the LSRL of y on x b) is how much an observed y-value differs from a predicted y-value c) predicts.
2.6 Draw Scatter Plots and Best-Fitting Lines
Regression.
Equations of Lines and Modeling
CHAPTER 3 Describing Relationships
Correlation and Simple Linear Regression
Functions and Their Graphs
Regression.
Warm-up: This table shows a person’s reported income and years of education for 10 participants. The correlation is .79. State the meaning of this correlation.
Chapter 5 LSRL.
Correlation and Regression
Simple Linear Regression and Correlation
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Algebra Review The equation of a straight line y = mx + b
A medical researcher wishes to determine how the dosage (in mg) of a drug affects the heart rate of the patient. Find the correlation coefficient & interpret.
Applying linear and median regression
Correlation and Simple Linear Regression
Correlation and Simple Linear Regression
Presentation transcript:

Tuesday December 10 Make a scatter plot Estimate a line of best fit Hours GPA 4 2.9 5 3.3 11 3.9 1 2.2 15 4.1 2 1.8 10 4.6 6 7 3 9 4.5 Make a scatter plot Estimate a line of best fit Write a rule Interpret the slope and y intercept Find each residual Find your upper and lower boundaries

You may recall that to find the equation of the LSRL, your calculator minimized the sum of the squares of the residuals. The smaller the sum of the squares, the closer the data was to the line of best fit. However, the magnitude of the sum of squares depends on the units of the variables being plotted. Therefore the sum of squares cannot be compared between different scatterplots—it is not a good way to compare the strength of various associations. Today you will investigate a better way to describe the strength of an association, by using the correlation coefficient.

6-67. CORRELATION COEFFICIENT The correlation coefficient, r, is a measure of how much or how little data is scattered around the LSRL. That is, if you have already plotted the residuals and decided that a linear model is a good fit, the correlation coefficient, r, is a measure of the strength of a linear association. The correlation coefficient does not have units, so it is useful no matter what the units of the variables are. This investigation will help you learn more about the correlation coefficient r.

Select any three points in the first quadrant. Use your calculator to make a scatterplot and find the LSRL. Make a sketch of the scatterplot in your notebook, and record the value of the correlation coefficient r. Your teacher will show you how to use you calculator to find r. Continue to investigate different combinations of three points and graph each of their scatterplots. Work with your team to discuss and record all of your conclusions about the value of r from this investigation. Your Task: With your team, use your calculators to explore the interpretation of the correlation coefficient r.

6-68. This investigation will help you learn more about the correlation coefficient r. For parts (a) through (e), use your calculator to make a scatterplot and find the LSRL. Make a sketch of each graph in your notebook, and record the value of the correlation coefficient r. Your teacher will show you how to use your calculator to find r. Start by graphing any two points that have integer coordinates and a positive slope between them. Each member in your team should choose a different pair of points. Compare your results with your team. Each member of your team should choose two new points that have a negative slope between them. Compare your results with your team.

c) What happens when you have more than two data points c) What happens when you have more than two data points? Use your original two points from part (a), and add an additional third point that results in r = 1. How can you describe the location of all possible points that result in r = 1? d) Start again with your original two points from part (a). Enter a third point that makes the slope of the LSRL negative. What happens to r? e) Start again with your original two points from part (a). Choose a third point that makes r close to zero (say, r between –0.2 and 0.2). f) Work with your team to discuss the following questions, and record all of your conclusions about the value of r. What is the largest r can be? The smallest? What do the scatterplot and LSRL look like if r = 1? r = –1? r = 0? What does a value of r close to 1 mean, compared to a value of r close to zero?

6-69. The following scatterplots have correlation r = −0. 9, r = −0 6-69. The following scatterplots have correlation r = −0.9, r = −0.6, r = 0.1, and r = 0.6. Which scatterplot has which correlation coefficient, r?

6-70. LEARNING LOG Work with your team to discuss how the value of r helps you numerically describe the strength and direction of an association. When you have come to an agreement, write your ideas as a Learning Log entry. Title this entry “Correlation Coefficient, r” and label it with today’s date.

6-71. In problem 6-1, you completed an investigation that helped Robbie use a viewing tube to see a football game. Typical data is shown in the table below. The LSRL is y = 1.66 + 0.13x. Find the correlation coefficient. Is the association strong or weak? Describe the form, direction, strength, and outliers of the association. You already know a graphical way to determine if the “form” is linear by looking at the residual plot for the data. A mathematical description of “direction” is the slope. A mathematical description of “strength” is the correlation coefficient. Mathematical descriptions for outliers will be dealt with in a later course. Describe the form, direction, and strength of the viewing tube data in more mathematical terms than you did in part (b).

6-72. Extension: A computer will help us explore the correlation coefficient further. Go to http://illuminations.nctm.org/LessonDetail.aspx?ID=L456#qs . Add some points to the graph by clicking on the graph. Press “Show Line” to plot the LSRL line and calculate the correlation coefficient, r. Press Ctrl-click to delete a point. Hold Shift-click to drag a point. Your screen should look something like this: Create scatterplots with the following associations and record r: Strong positive linear association Weak positive linear association Strong negative linear association No linear association (random scatter) Use just five points to make a strong negative linear association (say r < −0.95). Drag one of the points around to observe the effect on the slope and correlation coefficient. Can you make the slope positive by dragging just one point?