Bivariate Data AS 90425 (3 credits) Complete a statistical investigation involving bi-variate data.

Slides:



Advertisements
Similar presentations
Chapter 3 Examining Relationships Lindsey Van Cleave AP Statistics September 24, 2006.
Advertisements

Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. Relationships Between Quantitative Variables Chapter 5.
5/17/2015Chapter 41 Scatterplots and Correlation.
Relationships Between Quantitative Variables
Describing the Relation Between Two Variables
Correlation Relationship between Variables. Statistical Relationships What is the difference between correlation and regression? Correlation: measures.
Correlation: Relationship between Variables
Discovering and Describing Relationships
10-2 Correlation A correlation exists between two variables when the values of one are somehow associated with the values of the other in some way. A.
Chapter 7 Scatterplots, Association, and Correlation Stats: modeling the world Second edition Raymond Dahlman IV.
Describing Relationships: Scatterplots and Correlation
Chapter 7 Scatterplots, Association, Correlation Scatterplots and correlation Fitting a straight line to bivariate data © 2006 W. H. Freeman.
Scatter Diagrams and Correlation
Chapter 5 Regression. Chapter 51 u Objective: To quantify the linear relationship between an explanatory variable (x) and response variable (y). u We.
Chapter 3 Describing Bivariate Data General Objectives: Sometimes the data that are collected consist of observations for two variables on the same experimental.
Chapter 5 Correlation. Suppose we found the age and weight of a sample of 10 adults. Create a scatterplot of the data below. Is there any relationship.
Chapter 3 Correlation. Suppose we found the age and weight of a sample of 10 adults. Create a scatterplot of the data below. Is there any relationship.
Correlation.
Quantitative Data Essential Statistics. Quantitative Data O Review O Quantitative data is any data that produces a measurement or amount of something.
Prior Knowledge Linear and non linear relationships x and y coordinates Linear graphs are straight line graphs Non-linear graphs do not have a straight.
LECTURE UNIT 7 Understanding Relationships Among Variables Scatterplots and correlation Fitting a straight line to bivariate data.
BPS - 3rd Ed. Chapter 41 Scatterplots and Correlation.
M22- Regression & Correlation 1  Department of ISM, University of Alabama, Lesson Objectives  Know what the equation of a straight line is,
Notes Bivariate Data Chapters Bivariate Data Explores relationships between two quantitative variables.
Product moment correlation
4.1 Scatter Diagrams and Correlation. 2 Variables ● In many studies, we measure more than one variable for each individual ● Some examples are  Rainfall.
Example 1: page 161 #5 Example 2: page 160 #1 Explanatory Variable - Response Variable - independent variable dependent variable.
CORRELATION. Bivariate Distribution Observations are taken on two variables Two characteristics are measured on n individuals e.g : The height (x) and.
 Graph of a set of data points  Used to evaluate the correlation between two variables.
Notes Bivariate Data Chapters Bivariate Data Explores relationships between two quantitative variables.
Chapter 4 Describing the Relation Between Two Variables 4.1 Scatter Diagrams; Correlation.
Scatterplots are used to investigate and describe the relationship between two numerical variables When constructing a scatterplot it is conventional to.
Chapters 8 & 9 Linear Regression & Regression Wisdom.
Objectives 2.1Scatterplots  Scatterplots  Explanatory and response variables  Interpreting scatterplots  Outliers Adapted from authors’ slides © 2012.
1.1 example these are prices for Internet service packages find the mean, median and mode determine what type of data this is create a suitable frequency.
Relationships If we are doing a study which involves more than one variable, how can we tell if there is a relationship between two (or more) of the.
Bivariate Data analysis. Bivariate Data In this PowerPoint we look at sets of data which contain two variables. Scatter plotsCorrelation OutliersCausation.
Scatter Diagrams and Correlation Variables ● In many studies, we measure more than one variable for each individual ● Some examples are  Rainfall.
Chapter 7 Scatterplots, Association, and Correlation.
What do you see in these scatter plots? Latitude (°S) Mean January Air Temperatures for 30 New Zealand Locations Temperature.
Scatterplots and Correlations
Linear regression Correlation. Suppose we found the age and weight of a sample of 10 adults. Create a scatterplot of the data below. Is there any relationship.
Correlation The apparent relation between two variables.
 Find the Least Squares Regression Line and interpret its slope, y-intercept, and the coefficients of correlation and determination  Justify the regression.
Chapter 141 Describing Relationships: Scatterplots and Correlation.
Chapter 14: Inference for Regression. A brief review of chapter 4... (Regression Analysis: Exploring Association BetweenVariables )  Bi-variate data.
SSF1063: Statistics for Social Sciences
What Do You See?. A scatterplot is a graphic tool used to display the relationship between two quantitative variables. How to Read a Scatterplot A scatterplot.
Scatterplots Association and Correlation Chapter 7.
Chapter 5 Summarizing Bivariate Data Correlation.
GOAL: I CAN USE TECHNOLOGY TO COMPUTE AND INTERPRET THE CORRELATION COEFFICIENT OF A LINEAR FIT. (S-ID.8) Data Analysis Correlation Coefficient.
Bivariate Data AS Complete a statistical investigation involving bi-variate data.
Unit 3 – Association: Contingency, Correlation, and Regression Lesson 3-2 Quantitative Associations.
Scatterplots, Association, and Correlation. Scatterplots are the best way to start observing the relationship and picturing the association between two.
Correlation Correlation measures the strength of the linear association between two quantitative variables Get the correlation coefficient (r) from your.
Quantitative Data Essential Statistics.
Chapter 5 Correlation.
Examining Relationships
Two Quantitative Variables
Bivariate Data.
Suppose the maximum number of hours of study among students in your sample is 6. If you used the equation to predict the test score of a student who studied.
2. Find the equation of line of regression
What do you see in these scatter plots?
Correlation Coefficient
Scatterplots and Correlation
Bivariate Data analysis
Correlation.
Correlation & Trend Lines
Honors Statistics Review Chapters 7 & 8
Solution to Problem 2.25 DS-203 Fall 2007.
Presentation transcript:

Bivariate Data AS (3 credits) Complete a statistical investigation involving bi-variate data

Scatter Plots The scatter plot is the basic tool used to investigate relationships between two quantitative variables. Types of Variables Quantitative (measurements/ counts) Qualitative (groups)

What do I see in these scatter plots? There appears to be a linear trend. There appears to be moderate constant scatter about the trend line. Negative Association. No outliers or groupings visible Latitude (°S) Mean January Air Temperatures for 30 New Zealand Locations Temperature (°C)

What do I see in these scatter plots? There appears to be a non-linear trend. There appears to be non-constant scatter about the trend line. Positive Association. One possible outlier (Large GDP, low % Internet Users). Internet Users (%) % of population who are Internet Users vs GDP per capita for 202 Countries

What do I see in these scatter plots? Two non-linear trends (Male and Female). Very little scatter about the trend lines Negative association until about 1970, then a positive association. Gap in the data collection (Second World War).

What do I look for in scatter plots? Trend Do you see a linear trend… straight line OR a non-linear trend?

What do I look for in scatter plots? Trend Do you see a linear trend… straight line OR a non-linear trend?

What do I look for in scatter plots? Trend Do you see a positive association… as one variable gets bigger, so does the other OR a negative association? as one variable gets bigger, the other gets smaller

What do I look for in scatter plots? Trend Do you see a positive association… as one variable gets bigger, so does the other OR a negative association? as one variable gets bigger, the other gets smaller

What do I look for in scatter plots? Scatter Do you see a strong relationship… little scatter OR a weak relationship? lots of scatter

What do I look for in scatter plots? Scatter Do you see a strong relationship… little scatter OR a weak relationship? lots of scatter

What do I look for in scatter plots? Scatter Do you see constant scatter… roughly the same amount of scatter as you look across the plot or non-constant scatter? the scatter looks like a “fan” or “funnel”

What do I look for in scatter plots? Scatter Do you see constant scatter… roughly the same amount of scatter as you look across the plot or non-constant scatter? the scatter looks like a “fan” or “funnel”

What do I look for in scatter plots? Anything unusual Do you see any outliers? unusually far from the trend any groupings?

What do I look for in scatter plots? Anything unusual Do you see any outliers? unusually far from the trend any groupings?

Rank these relationships from weakest (1) to strongest (4):

Rank these relationships from weakest (1) to strongest (4): How did you make your decisions? The less scatter there is about the trend line, the stronger the relationship is.

Describing scatterplots Trend – linear or non linear Trend – positive or negative Scatter – strong or weak Scatter – constant or non constant Anything unusual – outliers or groupings

Correlation Correlation measures the strength of the linear association between two quantitative variables Get the correlation coefficient (r) from your calculator or computer r has a value between -1 and +1: Correlation has no units

Calculating the coefficient of correlation (r) The coefficient of correlation (r) is given by the following formula: Fortunately you will have Excel do all the work for you!

What can go wrong? Use correlation only if you have two quantitative variables (variables that can be measured) There is an association between gender and weight but there isn’t a correlation between gender and weight! Gender is not quantitative Use correlation only if the relationship is linear Beware of outliers!

Deceptive situations r is a measure of how one variable varies in a linear relation to the other An obvious pattern does not always indicate a high value of the coefficient of correlation (r) A horizontal or vertical trend indicates that there is no relationship, and so r is (close to) zero

Always plot the data before looking at the correlation! r = 0 No linear relationship, but there is a relationship! r = 0.9 No linear relationship, but there is a relationship!

Tick the plots where it would be OK to use a correlation coefficient to describe the strength of the relationship: Position Number Distance (million miles) Distances of Planets from the Sun Reaction Times (seconds) for 30 Year 10 Students Non-dominant Hand Dominant Hand Latitude (°S) Mean January Air Temperatures for 30 New Zealand Locations Temperature (°C) Female ($) Average Weekly Income for Employed New Zealanders in 2001 Male ($)

Tick the plots where it would be OK to use a correlation coefficient to describe the strength of the relationship: Position Number Distance (million miles) Distances of Planets from the Sun Reaction Times (seconds) for 30 Year 10 Students Non-dominant Hand Dominant Hand Latitude (°S) Mean January Air Temperatures for 30 New Zealand Locations Temperature (°C) Female ($) Average Weekly Income for Employed New Zealanders in 2001 Male ($)   Not linear Remove two outliers, nothing happening

What do I see in this scatter plot? Appears to be a linear trend, with a possible outlier (tall person with a small foot size.) Appears to be constant scatter. Positive association Foot size (cm) Height (cm) Height and Foot Size for 30 Year 10 Students

What will happen to the correlation coefficient if the tallest Year 10 student is removed? It will get smaller It won’t change It will get bigger Foot size (cm) Height (cm) Height and Foot Size for 30 Year 10 Students

What will happen to the correlation coefficient if the tallest Year 10 student is removed? It will get bigger Foot size (cm) Height (cm) Height and Foot Size for 30 Year 10 Students

What do I see in this scatter plot? Appears to be a strong linear trend. Outlier in X (the elephant). Appears to be constant scatter. Positive association Gestation (Days) Life Expectancy (Years) Life Expectancies and Gestation Period for a sample of non-human Mammals Elephant

What will happen to the correlation coefficient if the elephant is removed? It will get smaller It won’t change It will get bigger Gestation (Days) Life Expectancy (Years) Life Expectancies and Gestation Period for a sample of non-human Mammals Elephant

What will happen to the correlation coefficient if the elephant is removed? It will get smaller Gestation (Days) Life Expectancy (Years) Life Expectancies and Gestation Period for a sample of non-human Mammals Elephant