Basic Practice of Statistics - 5th Edition

Slides:



Advertisements
Similar presentations
5/17/2015Chapter 41 Scatterplots and Correlation.
Advertisements

CHAPTER 4: Scatterplots and Correlation. Chapter 4 Concepts 2  Explanatory and Response Variables  Displaying Relationships: Scatterplots  Interpreting.
CHAPTER 4: Scatterplots and Correlation
Chapter 41 Describing Relationships: Scatterplots and Correlation.
Describing Relationships: Scatterplots and Correlation
Stat 1510: Statistical Thinking and Concepts Scatterplots and Correlation.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
BPS - 3rd Ed. Chapter 41 Scatterplots and Correlation.
BPS - 3rd Ed. Chapter 41 Scatterplots and Correlation.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 3 Association: Contingency, Correlation, and Regression Section 3.2 The Association.
Chapter 14 Describing Relationships: Scatterplots and Correlation Chapter 141.
Essential Statistics Chapter 41 Scatterplots and Correlation.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
CHAPTER 4 SCATTERPLOTS AND CORRELATION BPS - 5th Ed. Chapter 4 1.
Chapter 4 Scatterplots and Correlation. Explanatory and Response Variables u Interested in studying the relationship between two variables by measuring.
Relationships If we are doing a study which involves more than one variable, how can we tell if there is a relationship between two (or more) of the.
Chapter 7 Scatterplots, Association, and Correlation.
Chapter 4 - Scatterplots and Correlation Dealing with several variables within a group vs. the same variable for different groups. Response Variable:
Chapter 141 Describing Relationships: Scatterplots and Correlation.
BPS - 5th Ed. Chapter 41 Scatterplots and Correlation.
Essential Statistics Chapter 41 Scatterplots and Correlation.
3. Relationships Scatterplots and correlation
CHAPTER 3 Describing Relationships
Variables Dependent variable: measures an outcome of a study
Basic Practice of Statistics - 3rd Edition
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Data Analysis and Statistical Software I ( ) Quarter: Autumn 02/03
Daniela Stan Raicu School of CTI, DePaul University
Basic Practice of Statistics - 3rd Edition
Basic Practice of Statistics - 3rd Edition
Chapter 3: Describing Relationships
Daniela Stan Raicu School of CTI, DePaul University
Variables Dependent variable: measures an outcome of a study
CHAPTER 4: Scatterplots and Correlation
Chapter 2 Looking at Data— Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Chapter 4 - Scatterplots and Correlation
Basic Practice of Statistics - 3rd Edition
Chapter 3 Scatterplots and Correlation.
Chapter 3: Describing Relationships
Basic Practice of Statistics - 3rd Edition Regression
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Essential Statistics Scatterplots and Correlation
Chapter 3: Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
CHAPTER 3 Describing Relationships
Chapter 3: Describing Relationships
AP Stats Agenda Text book swap 2nd edition to 3rd Frappy – YAY
CHAPTER 3 Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Basic Practice of Statistics - 3rd Edition
CHAPTER 3 Describing Relationships
Chapter 3: Describing Relationships
Presentation transcript:

Basic Practice of Statistics - 5th Edition Chapter 4 Scatterplots and Correlation BPS - 5th Ed. Chapter 4 Chapter 4

Explanatory and Response Variables Interested in studying the relationship between two variables by measuring both variables on the same individuals. a response variable measures an outcome of a study an explanatory variable explains or influences changes in a response variable sometimes there is no distinction BPS - 5th Ed. Chapter 4

Basic Practice of Statistics - 5th Edition Question In a study to determine whether surgery or chemotherapy results in higher survival rates for a certain type of cancer, whether or not the patient survived is one variable, and whether they received surgery or chemotherapy is the other. Which is the explanatory variable and which is the response variable? From Seeing Through Statistics, 2nd Edition by Jessica M. Utts. BPS - 5th Ed. Chapter 4 Chapter 4

Scatterplot Graphs the relationship between two quantitative (numerical) variables measured on the same individuals. If a distinction exists, plot the explanatory variable on the horizontal (x) axis and plot the response variable on the vertical (y) axis. BPS - 5th Ed. Chapter 4

Scatterplot Relationship between mean SAT verbal score and percent of high school grads taking SAT BPS - 5th Ed. Chapter 4

Southern states highlighted Scatterplot To add a categorical variable, use a different plot color or symbol for each category Southern states highlighted BPS - 5th Ed. Chapter 4

Scatterplot Look for overall pattern and deviations from this pattern Describe pattern by form, direction, and strength of the relationship Look for outliers BPS - 5th Ed. Chapter 4

To Make a Scatterplot Scale the axes with uniform intervals ( and y need not be the same scale) Label the axes Fill up available area by choosing an appropriate scale BPS - 5th Ed. Chapter 4

Interpreting a Scatterplot Look for a pattern, then deviations from the pattern Describe by Strength, Direction, and Form BPS - 5th Ed. Chapter 4

Interpreting a Scatterplot Strength Words Very Strong Fairly Strong Fairly Weak Weak Direction Words Postive Negative Form Words Linear Exponential Curved Clusters BPS - 5th Ed. Chapter 4

Linear Relationship Some relationships are such that the points of a scatterplot tend to fall along a straight line -- linear relationship BPS - 5th Ed. Chapter 4

Direction Positive association Negative association above-average values of one variable tend to accompany above-average values of the other variable, and below-average values tend to occur together Negative association above-average values of one variable tend to accompany below-average values of the other variable, and vice versa BPS - 5th Ed. Chapter 4

Basic Practice of Statistics - 5th Edition Examples From a scatterplot of college students, there is a positive association between verbal SAT score and GPA. For used cars, there is a negative association between the age of the car and the selling price. From Seeing Through Statistics, 2nd Edition by Jessica M. Utts. BPS - 5th Ed. Chapter 4 Chapter 4

Examples of Relationships Basic Practice of Statistics - 5th Edition Examples of Relationships BPS - 5th Ed. Chapter 4 Chapter 4

Measuring Strength & Direction of a Linear Relationship How closely does a non-horizontal straight line fit the points of a scatterplot? The correlation coefficient (often referred to as just correlation): r measure of the strength of the relationship: the stronger the relationship, the larger the magnitude of r. measure of the direction of the relationship: positive r indicates a positive relationship, negative r indicates a negative relationship. BPS - 5th Ed. Chapter 4

Correlation Coefficient special values for r : a perfect positive linear relationship would have r = +1 a perfect negative linear relationship would have r = -1 if there is no linear relationship, or if the scatterplot points are best fit by a horizontal line, then r = 0 Note: r must be between -1 and +1, inclusive both variables must be quantitative; no distinction between response and explanatory variables r has no units; does not change when measurement units are changed (ex: ft. or in.) BPS - 5th Ed. Chapter 4

Examples of Correlations BPS - 5th Ed. Chapter 4

Examples of Correlations Husband’s versus Wife’s ages r = .94 Husband’s versus Wife’s heights r = .36 Professional Golfer’s Putting Success: Distance of putt in feet versus percent success r = -.94 BPS - 5th Ed. Chapter 4

Not all Relationships are Linear Miles per Gallon versus Speed Linear relationship? Correlation is close to zero. BPS - 5th Ed. Chapter 4

Not all Relationships are Linear Miles per Gallon versus Speed Curved relationship. Correlation is misleading. BPS - 5th Ed. Chapter 4

Problems with Correlations Basic Practice of Statistics - 5th Edition Problems with Correlations Outliers can inflate or deflate correlations (see next slide) Groups combined inappropriately may mask relationships (a third variable) groups may have different relationships when separated BPS - 5th Ed. Chapter 4 Chapter 4

Outliers and Correlation Basic Practice of Statistics - 5th Edition Outliers and Correlation A B For each scatterplot above, how does the outlier affect the correlation? From Seeing Through Statistics, 2nd Edition by Jessica M. Utts. A: outlier decreases the correlation B: outlier increases the correlation BPS - 5th Ed. Chapter 4 Chapter 4

Correlation Calculation Suppose we have data on variables X and Y for n individuals: x1, x2, … , xn and y1, y2, … , yn Each variable has a mean and std dev: BPS - 5th Ed. Chapter 4

Case Study Per Capita Gross Domestic Product and Average Life Expectancy for Countries in Western Europe BPS - 5th Ed. Chapter 4

Case Study Chapter 4 Country Per Capita GDP (x) Life Expectancy (y) Austria 21.4 77.48 Belgium 23.2 77.53 Finland 20.0 77.32 France 22.7 78.63 Germany 20.8 77.17 Ireland 18.6 76.39 Italy 21.5 78.51 Netherlands 22.0 78.15 Switzerland 23.8 78.99 United Kingdom 21.2 77.37 BPS - 5th Ed. Chapter 4

Case Study Chapter 4 x y sum = 7.285 21.4 77.48 -0.078 -0.345 0.027 23.2 77.53 1.097 -0.282 -0.309 20.0 77.32 -0.992 -0.546 0.542 22.7 78.63 0.770 1.102 0.849 20.8 77.17 -0.470 -0.735 0.345 18.6 76.39 -1.906 -1.716 3.271 21.5 78.51 -0.013 0.951 -0.012 22.0 78.15 0.313 0.498 0.156 23.8 78.99 1.489 1.555 2.315 21.2 77.37 -0.209 -0.483 0.101 = 21.52 = 77.754 sum = 7.285 sx =1.532 sy =0.795 BPS - 5th Ed. Chapter 4

Case Study BPS - 5th Ed. Chapter 4

BPS - 5th Ed. Chapter 4

x y xy x^2 y^2 21.4 77.48 1658 458 6003 23.2 77.53 1799 538.2 6011 20 77.32 1546 400 5978 22.7 78.63 1785 515.3 6183 20.8 77.17 1605 432.6 5955 18.6 76.39 1421 346 5835 21.5 78.51 1688 462.3 6164 22 78.15 1719 484 6107 23.8 78.99 1880 566.4 6239 21.2 77.37 1640 449.4 5986 sum 215.2 777.54 16742 4652 60463 squared 46311 604568.5 10 Top 88.692 Bottom before sqrt 12006.64206 Bottome 109.5748 r= 0.80942

Facts About Correlation Correlation doesn’t care about explanatory and response variables Both variables must be quantitative Correlation doesn't depend on units r takes values between 1 and -1. The closer to ZERO, the weaker the linear relationship r = 1 or -1 means a perfect line BPS - 5th Ed. Chapter 4

Facts About Correlation Correlation is meaningless when applied to curved relationships Correlation is NOT resistant. It depends on the mean The r number is NOT a slope BPS - 5th Ed. Chapter 4