Correlation and Regression Basic Concepts. An Example We can hypothesize that the value of a house increases as its size increases. Said differently,

Slides:



Advertisements
Similar presentations
9: Examining Relationships in Quantitative Research ESSENTIALS OF MARKETING RESEARCH Hair/Wolfinbarger/Ortinau/Bush.
Advertisements

Regression Analysis Module 3. Regression Regression is the attempt to explain the variation in a dependent variable using the variation in independent.
Statistics Measures of Regression and Prediction Intervals.
Learning Objectives Copyright © 2002 South-Western/Thomson Learning Data Analysis: Bivariate Correlation and Regression CHAPTER sixteen.
Learning Objectives Copyright © 2004 John Wiley & Sons, Inc. Bivariate Correlation and Regression CHAPTER Thirteen.
Learning Objectives 1 Copyright © 2002 South-Western/Thomson Learning Data Analysis: Bivariate Correlation and Regression CHAPTER sixteen.
Overview Correlation Regression -Definition
Introduction to Regression Analysis
LSRL Least Squares Regression Line
1-1 Regression Models  Population Deterministic Regression Model Y i =  0 +  1 X i u Y i only depends on the value of X i and no other factor can affect.
Regression and Correlation
SIMPLE LINEAR REGRESSION
1 Relationships We have examined how to measure relationships between two categorical variables (chi-square) one categorical variable and one measurement.
Correlation Coefficients Pearson’s Product Moment Correlation Coefficient  interval or ratio data only What about ordinal data?
Chapter 8: Bivariate Regression and Correlation
Linear Regression and Correlation
SIMPLE LINEAR REGRESSION
Correlation and regression 1: Correlation Coefficient
STATISTICS: BASICS Aswath Damodaran 1. 2 The role of statistics Aswath Damodaran 2  When you are given lots of data, and especially when that data is.
Analysis of Variance: Some Review and Some New Ideas
Chapter 14 – Correlation and Simple Regression Math 22 Introductory Statistics.
1 G Lect 10a G Lecture 10a Revisited Example: Okazaki’s inferences from a survey Inferences on correlation Correlation: Power and effect.
Linear Functions 2 Sociology 5811 Lecture 18 Copyright © 2004 by Evan Schofer Do not copy or distribute without permission.
Managerial Economics Demand Estimation. Scatter Diagram Regression Analysis.
Introduction to Linear Regression
Section 5.2: Linear Regression: Fitting a Line to Bivariate Data.
Chapter 20 Linear Regression. What if… We believe that an important relation between two measures exists? For example, we ask 5 people about their salary.
Examining Relationships in Quantitative Research
Roger B. Hammer Assistant Professor Department of Sociology Oregon State University Conducting Social Research Ordinary Least Squares Regression.
10B11PD311 Economics REGRESSION ANALYSIS. 10B11PD311 Economics Regression Techniques and Demand Estimation Some important questions before a firm are.
STATISTICS 12.0 Correlation and Linear Regression “Correlation and Linear Regression -”Causal Forecasting Method.
Copyright © 2010 Pearson Education, Inc Chapter Seventeen Correlation and Regression.
Chapter 6 Simple Regression Introduction Fundamental questions – Is there a relationship between two random variables and how strong is it? – Can.
Examining Relationships in Quantitative Research
Linear Prediction Correlation can be used to make predictions – Values on X can be used to predict values on Y – Stronger relationships between X and Y.
Regression Analysis. 1. To comprehend the nature of correlation analysis. 2. To understand bivariate regression analysis. 3. To become aware of the coefficient.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Simple Linear Regression Analysis Chapter 13.
Psychology 202a Advanced Psychological Statistics October 22, 2015.
Least Squares Regression Lines Text: Chapter 3.3 Unit 4: Notes page 58.
1 Simple Linear Regression and Correlation Least Squares Method The Model Estimating the Coefficients EXAMPLE 1: USED CAR SALES.
STATISTICS 12.0 Correlation and Linear Regression “Correlation and Linear Regression -”Causal Forecasting Method.
Chapter 7 Linear Regression. Bivariate data x – variable: is the independent or explanatory variable y- variable: is the dependent or response variable.
OLS Regression What is it? Closely allied with correlation – interested in the strength of the linear relationship between two variables One variable is.
Chapter 14 Introduction to Regression Analysis. Objectives Regression Analysis Uses of Regression Analysis Method of Least Squares Difference between.
Correlation and Regression Basic Concepts. An Example We can hypothesize that the value of a house increases as its size increases. Said differently,
Describing Bivariate Relationships. Bivariate Relationships When exploring/describing a bivariate (x,y) relationship: Determine the Explanatory and Response.
Bivariate Regression. Bivariate Regression analyzes the relationship between two variables. Bivariate Regression analyzes the relationship between two.
Chapter 3 LSRL. Bivariate data x – variable: is the independent or explanatory variable y- variable: is the dependent or response variable Use x to predict.
Chapter 5 LSRL. Bivariate data x – variable: is the independent or explanatory variable y- variable: is the dependent or response variable Use x to predict.
Linear Regression 1 Sociology 5811 Lecture 19 Copyright © 2005 by Evan Schofer Do not copy or distribute without permission.
Regression Analysis AGEC 784.
Unit 4 LSRL.
LSRL.
Least Squares Regression Line.
Linear Regression and Correlation Analysis
Chapter 5 LSRL.
Ordinary Least Squares (OLS) Regression
Chapter 3.2 LSRL.
Analysis of Variance: Some Review and Some New Ideas
Least Squares Regression Line LSRL Chapter 7-continued
The Weather Turbulence
Introduction to Probability and Statistics Thirteenth Edition
Correlation and Regression-III
Correlation and Regression
Simple Linear Regression
Chapter 5 LSRL.
Chapter 5 LSRL.
Chapter 5 LSRL.
Least-Squares Regression
Correlation and Regression
Presentation transcript:

Correlation and Regression Basic Concepts

An Example We can hypothesize that the value of a house increases as its size increases. Said differently, size and house value “covary” or “co-relate.” Further, we can hypothesize that the relationship is a simple linear one, e.g., that as size increases, house value increases in a similar linear fashion. Hence we can use the simple linear equation, y = a + bx, to describe the relationship

We Ask Two Questions… Is there a relationship and how strong is it? and What is the relationship? We answer the first with a new statistic, a “correlation” coefficient. We answer the second with a linear regression model.

Two Questions We started with Correlation. We continue with Regression.

Terms Independent and Dependent variables Scatterplots Correlation, correlation coefficient, r Regression, regression coefficient, b Regression, regression constant, a Ordinary Least Squares (OLS) equation: y = a + bx + e

Issues Defining relationships –Nature of the relationship: for the moment, linear –Strength of the relationship (using r) –Direction of the relationship (using r and b) –Calculation of the relationship: y = a + bx + e

Some useful websites mlhttp://noppa5.pc.helsinki.fi/koe/corr/cor7.ht ml tmlhttp://davidmlane.com/hyperstat/A60659.h tml by_eye/index.htmlhttp:// by_eye/index.html

Illustration Case A. x= 2.5, y=2 Case B. x=8, y = 7

Linear Trend

What if there are lots of data points?

If there are more data points? How do we summarize the relationships in the data?

Solution: Least Squares Regression, The Best Linear Fit

Some Theory Knowing nothing else, the best estimate of a variable is its mean.

The Regression Model does better… Deviation from y = y i – ymean

A Regression equation… Measures the nature of the relationship between x and y using a linear model Measures the direction of the relationship Accompanying statistics, for the time being, r, measures the strength of the relationship.

Understanding the Improvement, measuring the deviations from the mean

More Terms Yi – the value of a particular case Y mean – mean value of y Y hat – y with a ^ above it soŷ (Yi – Ymean) = total deviation from mean Y (Yhat – Ymean) = explained deviation of Yi from Y mean (Yi – Yhat) = unexplained deviation of Yi from Y mean

Bivariate Regression Relationships are modeled using the equation, y = a + bx + e Translation: The values of an interval level dependent variable, y, can be “predicted” or “modeled” by adding a constant, a, to the product of a slope coefficient, b, times the values of the independent variable, x, and an error term, e.

Estimating the Equation, y = a + bx + e The regression equation is calculated by finding the equation that minimizes the sum of the squared deviations between the data points, the y’s, and the predicted y’s, also called y hat.

Correlation Coefficient: r A measure of the strength of a linear relationship between two interval variables, x and y Ranges from – 1 to + 1 The higher the value of r (e.g., the closer to -1 or + 1, the stronger the relationship between x and y

Correlation Coefficient calculation r = Covariance of x and y divided by the product of the standard deviation of x and the standard deviation of y Covariance is the sum of the products of the deviations of the cases divided by N.

Equations...

Calculating a and b

XY