Scatterplots, Association and Correlation

Slides:



Advertisements
Similar presentations
Section 6.1: Scatterplots and Correlation (Day 1).
Advertisements

Math 243 Exam 2 Preparation In regression what is considered an influential point? Any ordered pair is considered influential, if when removed, the result.
AP Statistics Chapters 3 & 4 Measuring Relationships Between 2 Variables.
Correlation vs. Causation What is the difference?.
Correlation with a Non - Linear Emphasis Day 2.  Correlation measures the strength of the linear association between 2 quantitative variables.  Before.
AP Statistics Chapter 8 & 9 Day 3
Example 1: page 161 #5 Example 2: page 160 #1 Explanatory Variable - Response Variable - independent variable dependent variable.
 Graph of a set of data points  Used to evaluate the correlation between two variables.
Examining Bivariate Data Unit 3 – Statistics. Some Vocabulary Response aka Dependent Variable –Measures an outcome of a study Explanatory aka Independent.
Relationships Scatterplots and correlation BPS chapter 4 © 2006 W.H. Freeman and Company.
UNIT 4 Bivariate Data Scatter Plots and Regression.
Method 3: Least squares regression. Another method for finding the equation of a straight line which is fitted to data is known as the method of least-squares.
Correlation  We can often see the strength of the relationship between two quantitative variables in a scatterplot, but be careful. The two figures here.
Part II Exploring Relationships Between Variables.
Warm-up Get a sheet of computer paper/construction paper from the front of the room, and create your very own paper airplane. Try to create planes with.
Linear Regression.
Two Quantitative Variables
Chapter 3: Describing Relationships
CHAPTER 3 Describing Relationships

Chapter 3: Describing Relationships
Looking at data: relationships - Correlation
Chapter 3: Describing Relationships
Correlation vs. Causation
Least-Squares Regression
Chapter 3: Describing Relationships
Investigation 4 Students will be able to identify correlations in data and calculate and interpret standard deviation.
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Review of Chapter 3 Examining Relationships
Chapter 3: Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
11A Correlation, 11B Measuring Correlation
Chapter 3: Describing Relationships
Adequacy of Linear Regression Models
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
y = mx + b Linear Regression line of best fit REMEMBER:
Adequacy of Linear Regression Models
Chapter 3: Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Adequacy of Linear Regression Models
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Math 243 Exam 2 Preparation.
Adequacy of Linear Regression Models
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Scatterplots, Association and Correlation
Chapter 3: Describing Relationships
CHAPTER 3 Describing Relationships
Chapter 3: Describing Relationships
AP Stats Agenda Text book swap 2nd edition to 3rd Frappy – YAY
CHAPTER 3 Describing Relationships
Correlation & Trend Lines
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Statistics 101 CORRELATION Section 3.2.
Chapter 3: Describing Relationships
Correlation Coefficient
Solution to Problem 2.25 DS-203 Fall 2007.
Review of Chapter 3 Examining Relationships
CHAPTER 3 Describing Relationships
Chapter 3: Describing Relationships
Scatterplots contd: Correlation The regression line
3.2 Correlation Pg
Presentation transcript:

Scatterplots, Association and Correlation “If he was all on the same scale as his foot, he must certainly have been a giant.” - Sherlock Holmes The Adventure of Wisteria Lodge.

Scatterplots, Association and Correlation y = 0.9x +18 Using function fitting tools (regression)

Scatterplots, Association and Correlation Response Variable Explanatory Variable Direction: Positive (vs. negative) relationship

Scatterplot and line of fit: Calculator example Calculator Example: Tuition at a University Data: 6546, 6996, 6996, 7350, 7500, 7978, 8377, 8710, 9110, 9411, 9800 Y = 326.08 x+ 6439 r = .99

Scatterplot and line of fit: By hand “Eyeball” a line of fit. Pick two representative Points (points on your line). Use y – y0 =m(x – x0) Using (6, 8377) and (7,8710): M = (8710-8377)/(7-6)=333 Y – 8710 = 333(x – 7) Y = 333x + 6379 Y = 3x + 92

Scatterplot and line of fit: Comparision of methods 22 178 25 173 20 150 24 169 166 29 170 163 23 179 28 167 27 157 161 Looks like our “eyeball” was off a good bit, when compared to the software. Y = 3x +92 vs y =1.3x + 135

Correlation Measures the linear association between two variables. Correlation Coefficient Sum the product of the x and y z-score for each ordered pair and divide by n-1

Correlation Correlation Coefficient x y x-xbar y-ybar (x-xbar)(y-ybar)/(sx*sy)=zx*zy 6 5 -8 -2 0.7613 10 3 -4 14 7 0.0000 19 8 1 0.2379 21 12 1.6652 sum 3.4256 sum/(n-1) 0.8564

Correlation Measures the linear association between two variables. A correlation greater than 0.8 is generally described as strong, whereas a correlation less than 0.5 is generally described as weak. 

Correlation Conditions for using the correlation coefficient: Measures the linear association between two variables. Conditions for using the correlation coefficient: Quantitative Variables Linear Outliers are not distorting correlation

Correlation Sometimes we can make non-linear data linear This is the data for f-stops for a camera. Here we square the data to make it linear. Another transformation that statisticians sometimes use involves logs .

Correlation vs. Causation Discuss possible lurking variables: Possibly poor decisions based on incorrect causation assumptions: The number of AP courses a student takes and SAT performance strongly correlate, so we should enroll all students in more AP courses. Studies show that the number of police in an area positively correlates with the amount of gang activity, so we should reduce the number of policemen to reduce gang activity. Homework and student performance have a positive correlation, so all teachers should assign more homework every night in every course. The amount of damage caused by a fire has a positive correlation with the number of firemen at the scene. To reduce the amount of damage due to fires, we should send less firemen.