Foundation Statistics Copyright Douglas L. Dean, 2015.

Slides:



Advertisements
Similar presentations
Forecasting Using the Simple Linear Regression Model and Correlation
Advertisements

Correlation and Linear Regression.
Learning Objectives Copyright © 2002 South-Western/Thomson Learning Data Analysis: Bivariate Correlation and Regression CHAPTER sixteen.
Learning Objectives Copyright © 2004 John Wiley & Sons, Inc. Bivariate Correlation and Regression CHAPTER Thirteen.
Learning Objectives 1 Copyright © 2002 South-Western/Thomson Learning Data Analysis: Bivariate Correlation and Regression CHAPTER sixteen.
Correlation and Regression
Chapter 12 Simple Regression
Examining Relationship of Variables  Response (dependent) variable - measures the outcome of a study.  Explanatory (Independent) variable - explains.
Correlation A correlation exists between two variables when one of them is related to the other in some way. A scatterplot is a graph in which the paired.
RESEARCH STATISTICS Jobayer Hossain Larry Holmes, Jr November 6, 2008 Examining Relationship of Variables.
Regression Chapter 10 Understandable Statistics Ninth Edition By Brase and Brase Prepared by Yixun Shi Bloomsburg University of Pennsylvania.
1 Simple Linear Regression Linear regression model Prediction Limitation Correlation.
MARE 250 Dr. Jason Turner Correlation & Linear Regression.
© 2000 Prentice-Hall, Inc. Chap Forecasting Using the Simple Linear Regression Model and Correlation.
Basic Statistical Concepts Part II Psych 231: Research Methods in Psychology.
Correlation and Regression. Relationships between variables Example: Suppose that you notice that the more you study for an exam, the better your score.
Multiple Regression Research Methods and Statistics.
Summary of Quantitative Analysis Neuman and Robson Ch. 11
Descriptive measures of the strength of a linear association r-squared and the (Pearson) correlation coefficient r.
Linear Regression/Correlation
Aim: How do we calculate and interpret correlation coefficients with SPSS? SPSS Assignment Due Friday 2/12/10.
Correlation & Regression Math 137 Fresno State Burger.
Linear Regression Analysis
Aron, Aron, & Coups, Statistics for the Behavioral and Social Sciences: A Brief Course (3e), © 2005 Prentice Hall Chapter 3 Correlation and Prediction.
Descriptive Methods in Regression and Correlation
Chapter 12 Correlation and Regression Part III: Additional Hypothesis Tests Renee R. Ha, Ph.D. James C. Ha, Ph.D Integrative Statistics for the Social.
Chapter 14 – Correlation and Simple Regression Math 22 Introductory Statistics.
Learning Objective Chapter 14 Correlation and Regression Analysis CHAPTER fourteen Correlation and Regression Analysis Copyright © 2000 by John Wiley &
Jon Curwin and Roger Slater, QUANTITATIVE METHODS: A SHORT COURSE ISBN © Thomson Learning 2004 Jon Curwin and Roger Slater, QUANTITATIVE.
Statistics for Business and Economics 7 th Edition Chapter 11 Simple Regression Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch.
Regression Analysis. Scatter plots Regression analysis requires interval and ratio-level data. To see if your data fits the models of regression, it is.
Run the colour experiment where kids write red, green, yellow …first.
Business Statistics: A First Course, 5e © 2009 Prentice-Hall, Inc. Chap 12-1 Correlation and Regression.
Introduction to Linear Regression
Chap 12-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 12 Introduction to Linear.
Multiple Linear Regression. Purpose To analyze the relationship between a single dependent variable and several independent variables.
Examining Relationships in Quantitative Research
CHAPTER 3 INTRODUCTORY LINEAR REGRESSION. Introduction  Linear regression is a study on the linear relationship between two variables. This is done by.
Regression Lesson 11. The General Linear Model n Relationship b/n predictor & outcome variables form straight line l Correlation, regression, t-tests,
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Chapter 16 Data Analysis: Testing for Associations.
Lecture 10: Correlation and Regression Model.
Examining Relationships in Quantitative Research
Chapter Thirteen Copyright © 2006 John Wiley & Sons, Inc. Bivariate Correlation and Regression.
Chapter 4 Summary Scatter diagrams of data pairs (x, y) are useful in helping us determine visually if there is any relation between x and y values and,
Correlation and Regression: The Need to Knows Correlation is a statistical technique: tells you if scores on variable X are related to scores on variable.
MARE 250 Dr. Jason Turner Linear Regression. Linear regression investigates and models the linear relationship between a response (Y) and predictor(s)
Psy302 Quantitative Methods
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Chapter 10 Correlation and Regression 10-2 Correlation 10-3 Regression.
Regression Analysis. 1. To comprehend the nature of correlation analysis. 2. To understand bivariate regression analysis. 3. To become aware of the coefficient.
Outline of Today’s Discussion 1.Introduction to Correlation 2.An Alternative Formula for the Correlation Coefficient 3.Coefficient of Determination.
.  Relationship between two sets of data  The word Correlation is made of Co- (meaning "together"), and Relation  Correlation is Positive when the.
CORRELATION ANALYSIS.
SOCW 671 #11 Correlation and Regression. Uses of Correlation To study the strength of a relationship To study the direction of a relationship Scattergrams.
Descriptive measures of the degree of linear association R-squared and correlation.
Correlation & Linear Regression Using a TI-Nspire.
Chapter 11 Linear Regression and Correlation. Explanatory and Response Variables are Numeric Relationship between the mean of the response variable and.
Chapter 2 Bivariate Data Scatterplots.   A scatterplot, which gives a visual display of the relationship between two variables.   In analysing the.
Regression Analysis.
REGRESSION (R2).
Correlation and Regression
Regression Analysis.
Chapter 11 Simple Regression
Happiness comes not from material wealth but less desire.
CHAPTER fourteen Correlation and Regression Analysis
Linear Regression/Correlation
CORRELATION ANALYSIS.
Merve denizci nazlıgül, M.s.
An Introduction to Correlational Research
Correlation and Prediction
Presentation transcript:

Foundation Statistics Copyright Douglas L. Dean, 2015

Types of Variables 1.Input Variables 1.Might explain the outcome variable 2.Output variable 1.Variable you want to explain or predict 2

Outline 1.One-way ANOVA –Does a category make a significant difference? 2.Bivariate Statistics –Are numeric variables related to each other? 3.Plotting bivariate relationships and lines of best fit 4.Simple linear regression 3

Oneway ANOVA 4

Outlier Thresholds 1.5 x the interquartile range Interquartile range

Annotated Box-whisker Plot

Bivariate descriptive stats Slope (m) Correlation (r) Coefficient of Determination (R 2 ) 7

Slope Every straight line can be represented by an equation: y = mx + b The slope ‘m’ describes both the direction and the steepness of the line. 8

Slope Tree 9

How slope is calculated 10

A positive slope example 11 ( 0,1 ) (3,3) ● ●

Another example 12 (0,1) (3,4) ● ●

Negative Slope example 13 (3,0) (1,4) ● ●

Correlation Coefficient The Pearson Correlation Coefficient (r) is a measure of the strength of the linear relationship between two numeric variables. The value of r ranges from -1 to 1

Correlation Coefficient Stronger Which correlation is stronger? r = -.80 or r =.80? Neither. They are the same strength.

Examples of Perfect Correlation 16

Examples of Strong Correlation 17

Examples of Weak Correlation 18

No Correlation 19 Ways to get r = 0 Pure randomness Perfectly horizontal strait line

Correlation Often the first thing we check to see if a relationship with variable may exist Good place to start, not to finish Is a standardized value –Units of measure are factored out –So you can have one variable on a small scale and the related variable on a large scale. No matter the scales, the value of r will be adjusted to be between negative and positive one. 20

Limitations of Correlation Correlation measures linear association not causality Correlation is only one important measure of a possible linear relationship Correlations lack statistical control for other possible related variables 21

Correlation ≠ Causality The Japanese eat very little fat and drink little red wine and suffer fewer heart attacks than the British or Americans The French eat a lot of fat and drink a lot of red wine and suffer fewer heart attacks than the British or Americans The Germans drink a lot of beer and eat a lot of sausages and fat and suffer fewer heart attacks than the British or Americans. Conclusion: Eat and drink what you like. Apparently it is speaking English that kills you. 22

23

Ambiguities in causality abound… When Y and Z are correlated, direction of causality might be 24 X Z Or X Z

Ambiguities in causality abound… When a correlation exists between Y and Z. Causality might be 25 X Y Z X YZ X Y Z X Y Z

Statistical control The purpose of statistical control is to find the degree of association between two variables after removing the effects of other variables. Correlation lacks statistical control Many variables may exert influence on the variable being predicted. You cannot control for multiple influences with r alone Some forms of statistics and data mining methods give you statistical control 26

Sign (+ or -) of Basic statistics Slope and r can be positive or negative –Slope and r have the same sign ›If one is positive, so is the other ›If one is negative, so is the other R 2 is always positive 27

R-Squared (R 2 ) R 2 is the Coefficient of determination The proportion of the variance in Y attributable to the variance in X (if only one x) or set of X variables if more than one predictor is included in the model. 28

Calculation of R 2 29 If only one predictor: R 2 = r 2, r =.80 R 2 =.80 2 =.64 With multiple input variables - The math to calculate R 2 is more complex - Math beyond the scope of this course.

Why we need more than just slope If we have the slope, why do we need r and R 2 ? –Relationships are rarely perfectly linear in their ability to predict. –Slopes of “best-fit” lines do not give us a measure of variability in how x and y relate to each other Correlation (r) measures the variability in linear association R 2 measures proportion of variance in y attributable to all of the input variables included in the model 30