Lecture 3 – Sep 3. Normal quantile plots are complex to do by hand, but they are standard features in most statistical software. Good fit to a straight.

Slides:



Advertisements
Similar presentations
Section 6.1: Scatterplots and Correlation (Day 1).
Advertisements

Scatterplots and Correlation
CHAPTER 4: Scatterplots and Correlation. Chapter 4 Concepts 2  Explanatory and Response Variables  Displaying Relationships: Scatterplots  Interpreting.
CHAPTER 4: Scatterplots and Correlation
Chapter 41 Describing Relationships: Scatterplots and Correlation.
Looking at data: relationships - Correlation IPS chapter 2.2 Copyright Brigitte Baldi 2005 ©
Looking at data: relationships Scatterplots IPS chapter 2.1 © 2006 W. H. Freeman and Company.
CHAPTER 3 Describing Relationships
Chapter 7 Scatterplots, Association, Correlation Scatterplots and correlation Fitting a straight line to bivariate data © 2006 W. H. Freeman.
LECTURE 2 Understanding Relationships Between 2 Numerical Variables
Regression.
Objectives (BPS chapter 5)
CENTRE FOR INNOVATION, RESEARCH AND COMPETENCE IN THE LEARNING ECONOMY Session 2: Basic techniques for innovation data analysis. Part I: Statistical inferences.
Association between 2 variables We've described the distribution of 1 variable in Chapter 1 - but what if 2 variables are measured on the same individual?
Relationships Scatterplots and correlation BPS chapter 4 © 2006 W.H. Freeman and Company.
Chapter 3: Examining relationships between Data
Scatterplots. Learning Objectives By the end of this lecture, you should be able to: – Describe what a scatterplot is – Be comfortable with the terms.
LECTURE UNIT 7 Understanding Relationships Among Variables Scatterplots and correlation Fitting a straight line to bivariate data.
Looking at data: relationships Scatterplots IPS chapter 2.1 © 2006 W. H. Freeman and Company.
Chapter 6 Scatterplots and Correlation Chapter 7 Objectives Scatterplots  Scatterplots  Explanatory and response variables  Interpreting scatterplots.
IPS Chapter 2 DAL-AC FALL 2015  2.1: Scatterplots  2.2: Correlation  2.3: Least-Squares Regression  2.4: Cautions About Correlation and Regression.
Examining Relationships Scatterplots
Exploring Relationships Between Variables Chapter 7 Scatterplots and Correlation.
Objectives (IPS Chapter 2.1)
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 3 Describing Relationships 3.1 Scatterplots.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 3 Describing Relationships 3.1 Scatterplots.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Objectives 2.1Scatterplots  Scatterplots  Explanatory and response variables  Interpreting scatterplots  Outliers Adapted from authors’ slides © 2012.
Chapter 7 Scatterplots, Association, and Correlation.
Association between 2 variables We've described the distribution of 1 variable - but what if 2 variables are measured on the same individual? Examples?
Relationships Scatterplots and correlation BPS chapter 4 © 2006 W.H. Freeman and Company.
3.2: Linear Correlation Measure the strength of a linear relationship between two variables. As x increases, no definite shift in y: no correlation. As.
April 1 st, Bellringer-April 1 st, 2015 Video Link Worksheet Link
Relationships Scatterplots and correlation BPS chapter 4 © 2006 W.H. Freeman and Company.
Relationships Scatterplots and Correlation.  Explanatory and response variables  Displaying relationships: scatterplots  Interpreting scatterplots.
Notes Chapter 7 Bivariate Data. Relationships between two (or more) variables. The response variable measures an outcome of a study. The explanatory variable.
The Practice of Statistics for Business and Economics Third Edition David S. Moore George P. McCabe Layth C. Alwan Bruce A. Craig William M. Duckworth.
Lecture 8 Sections Objectives: Bivariate and Multivariate Data and Distributions − Scatter Plots − Form, Direction, Strength − Correlation − Properties.
Correlation  We can often see the strength of the relationship between two quantitative variables in a scatterplot, but be careful. The two figures here.
Lecture 4 Chapter 3. Bivariate Associations. Objectives (PSLS Chapter 3) Relationships: Scatterplots and correlation  Bivariate data  Scatterplots (2.
Statistics for Business and Economics Module 2: Regression and time series analysis Spring 2010 Lecture 2: Examining the relationship between two quantitative.
Scatter plots Adapted from 350/
Scatter Plots. Standard: 8.SP.1 I can construct and interpret scatterplots.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 3: Describing Relationships Section 3.1 Scatterplots and Correlation.
3. Relationships Scatterplots and correlation
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
The Practice of Statistics in the Life Sciences Fourth Edition
Chapter 3: Describing Relationships
Chapter 2 Looking at Data— Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
CHAPTER 3 Describing Relationships
Association between 2 variables
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Association between 2 variables
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Scatterplots.
Chapter 3: Describing Relationships
Image from Minitab Website
Presentation transcript:

Lecture 3 – Sep 3

Normal quantile plots are complex to do by hand, but they are standard features in most statistical software. Good fit to a straight line: the distribution of rainwater pH values is close to normal. Curved pattern: the data are not normally distributed. Instead, it shows a right skew: a few individuals have particularly long survival times.

StudentBeersBlood Alcohol Here, we have two quantitative variables for each of 16 students. 1) How many beers they drank, and 2) Their blood alcohol level (BAC) We are interested in the relationship between the two variables: How is one affected by changes in the other one?

StudentBeersBAC Scatterplots In a scatterplot, one axis is used to represent each of the variables, and the data are plotted as points on the graph.

Form and direction of an association Linear Nonlinear No relationship

Positive association: High values of one variable tend to occur together with high values of the other variable. Negative association: High values of one variable tend to occur together with low values of the other variable.

Strength of the association The strength of the relationship between the two variables can be seen by how much variation, or scatter, there is around the main form. With a strong relationship, you can get a pretty good estimate of y if you know x. With a weak relationship, for any x you might get a wide range of y values.

This is a very strong relationship. The daily amount of gas consumed can be predicted quite accurately for a given temperature value. This is a weak relationship. For a particular state median household income, you can’t predict the state per capita income very well.

How to scale a scatterplot Using an inappropriate scale for a scatterplot can give an incorrect impression. Both variables should be given a similar amount of space: Plot roughly square Points should occupy all the plot space (no blank space) Same data in all four plots

Outliers An outlier is a data value that has a very low probability of occurrence (i.e., it is unusual or unexpected). In a scatterplot, outliers are points that fall outside of the overall pattern of the relationship.

Not an outlier: The upper right-hand point here is not an outlier of the relationship—It is what you would expect for this many beers given the linear relationship between beers/weight and blood alcohol. This point is not in line with the others, so it is an outlier of the relationship. Outliers

Categorical variables in scatterplots Often, things are not simple and one-dimensional. We need to group the data into categories to reveal trends. What may look like a positive linear relationship is in fact a series of negative linear associations. Plotting different habitats in different colors allows us to make that important distinction.

Comparison of men and women racing records over time. Each group shows a very strong negative linear relationship that would not be apparent without the gender categorization. Relationship between lean body mass and metabolic rate in men and women. Both men and women follow the same positive linear trend, but women show a stronger association. As a group, males typically have larger values for both variables.

"r" ranges from -1 to +1 "r" quantifies the strength and direction of a linear relationship between 2 quantitative variables. Strength: how closely the points follow a straight line. Direction: is positive when individuals with higher X values tend to have higher values of Y.

Correlations are calculated using means and standard deviations, and thus are NOT resistant to outliers. Influential points Just moving one point away from the general trend here decreases the correlation from to -0.75