Regression and Correlation

Slides:



Advertisements
Similar presentations
Regression Analysis Module 3. Regression Regression is the attempt to explain the variation in a dependent variable using the variation in independent.
Advertisements

Regression Greg C Elvers.
Learning Objectives Copyright © 2002 South-Western/Thomson Learning Data Analysis: Bivariate Correlation and Regression CHAPTER sixteen.
Learning Objectives Copyright © 2004 John Wiley & Sons, Inc. Bivariate Correlation and Regression CHAPTER Thirteen.
Learning Objectives 1 Copyright © 2002 South-Western/Thomson Learning Data Analysis: Bivariate Correlation and Regression CHAPTER sixteen.
Correlation and Regression
Education 793 Class Notes Joint Distributions and Correlation 1 October 2003.
Correlation & Regression Chapter 15. Correlation statistical technique that is used to measure and describe a relationship between two variables (X and.
Chapter 15 (Ch. 13 in 2nd Can.) Association Between Variables Measured at the Interval-Ratio Level: Bivariate Correlation and Regression.
LINEAR REGRESSION: Evaluating Regression Models Overview Assumptions for Linear Regression Evaluating a Regression Model.
LINEAR REGRESSION: Evaluating Regression Models. Overview Assumptions for Linear Regression Evaluating a Regression Model.
LINEAR REGRESSION: Evaluating Regression Models. Overview Standard Error of the Estimate Goodness of Fit Coefficient of Determination Regression Coefficients.
1-1 Regression Models  Population Deterministic Regression Model Y i =  0 +  1 X i u Y i only depends on the value of X i and no other factor can affect.
Lecture 11 PY 427 Statistics 1 Fall 2006 Kin Ching Kong, Ph.D
REGRESSION What is Regression? What is the Regression Equation? What is the Least-Squares Solution? How is Regression Based on Correlation? What are the.
Chapter 13 Introduction to Linear Regression and Correlation Analysis
Basic Statistical Concepts Part II Psych 231: Research Methods in Psychology.
REGRESSION Predict future scores on Y based on measured scores on X Predictions are based on a correlation from a sample where both X and Y were measured.
Chapter 14 Introduction to Linear Regression and Correlation Analysis
Correlation and Regression Analysis
Simple Linear Regression and Correlation
Lecture 16 Correlation and Coefficient of Correlation
Introduction to Linear Regression and Correlation Analysis
Correlation.
Chapter 14 – Correlation and Simple Regression Math 22 Introductory Statistics.
Chapter 15 Correlation and Regression
Learning Objective Chapter 14 Correlation and Regression Analysis CHAPTER fourteen Correlation and Regression Analysis Copyright © 2000 by John Wiley &
Inferences in Regression and Correlation Analysis Ayona Chatterjee Spring 2008 Math 4803/5803.
Chapter 12 Examining Relationships in Quantitative Research Copyright © 2013 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin.
Business Research Methods William G. Zikmund Chapter 23 Bivariate Analysis: Measures of Associations.
Introduction to Linear Regression
Chap 12-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 12 Introduction to Linear.
Correlation and Regression Used when we are interested in the relationship between two variables. NOT the differences between means or medians of different.
Applied Quantitative Analysis and Practices LECTURE#23 By Dr. Osman Sadiq Paracha.
BIOL 582 Lecture Set 11 Bivariate Data Correlation Regression.
Examining Relationships in Quantitative Research
Introduction to Probability and Statistics Thirteenth Edition Chapter 12 Linear Regression and Correlation.
Multiple Regression and Model Building Chapter 15 Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
Correlation and Regression Basic Concepts. An Example We can hypothesize that the value of a house increases as its size increases. Said differently,
Chapter 16 Data Analysis: Testing for Associations.
Chapter 6 Simple Regression Introduction Fundamental questions – Is there a relationship between two random variables and how strong is it? – Can.
Statistics for Business and Economics 8 th Edition Chapter 11 Simple Regression Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Ch.
Lecture 10: Correlation and Regression Model.
Inferential Statistics. The Logic of Inferential Statistics Makes inferences about a population from a sample Makes inferences about a population from.
Examining Relationships in Quantitative Research
Chapter Thirteen Copyright © 2006 John Wiley & Sons, Inc. Bivariate Correlation and Regression.
Correlation & Regression Analysis
Copyright © 2010 Pearson Education, Inc Chapter Seventeen Correlation and Regression.
Chapter Thirteen Bivariate Correlation and Regression Chapter Thirteen.
Regression Analysis. 1. To comprehend the nature of correlation analysis. 2. To understand bivariate regression analysis. 3. To become aware of the coefficient.
SOCW 671 #11 Correlation and Regression. Uses of Correlation To study the strength of a relationship To study the direction of a relationship Scattergrams.
Chapter 15: Correlation. Correlations: Measuring and Describing Relationships A correlation is a statistical method used to measure and describe the relationship.
Topics, Summer 2008 Day 1. Introduction Day 2. Samples and populations Day 3. Evaluating relationships Scatterplots and correlation Day 4. Regression and.
Correlation and Regression Basic Concepts. An Example We can hypothesize that the value of a house increases as its size increases. Said differently,
Chapter 11 Linear Regression and Correlation. Explanatory and Response Variables are Numeric Relationship between the mean of the response variable and.
Chapter 13 Simple Linear Regression
Simple Linear Correlation
Correlation and Simple Linear Regression
CHAPTER fourteen Correlation and Regression Analysis
Correlation and Simple Linear Regression
Least-Squares Regression
Correlation and Simple Linear Regression
Least-Squares Regression
Simple Linear Regression and Correlation
Linear Regression and Correlation
Product moment correlation
3 basic analytical tasks in bivariate (or multivariate) analyses:
Presentation transcript:

Regression and Correlation GTECH 201 Lecture 18

ANOVA Analysis of Variance Continuation from matched-pair difference of means tests; but now for 3+ cases We still check whether samples come from one or more distinct populations Variance is a descriptive parameter ANOVA compares group means and looks whether they differ sufficiently to reject H0

ANOVA H0 and HA

ANOVA Test Statistic MSB = between-group mean squares MSW = within-group mean squares Between-group variability is calculated in three steps: Calculate overall mean as weighted average of sample means Calculate between-group sum of squares Calculate between-group mean squares (MSB)

Between-group Variability Total or overall mean Between-group sum of squares Between-group mean squares

Within-group Variability Within-group sum of squares Within-group mean squares

Kruskal-Wallis Test Nonparametric equivalent of ANOVA Extension of Wilcoxon rank sum W test to 3+ cases Average rank is Ri / ni Then the Kruskal-Wallis H test statistic is With N =n1 + n2 + … +nk = total number of observations, and Ri = sum of ranks in sample i

ANOVA Example House prices by neighborhood in ,000 dollars A B C D 175 151 127 174 147 183 142 182 138 174 124 210 156 181 150 191 184 193 180 148 205 196

ANOVA Example, continued Sample statistics n X s A 6 158.00 17.83 B 7 183.29 17.61 C 5 144.60 22.49 D 4 189.25 15.48 Total 22 168.68 24.85 Now fill in the six steps of the ANOVA calculation

The Six Steps

Correlation Co-relatedness between 2+ variables As the values of one variable go up, those of the other change proportionally Two step approach: Graphically - scatterplot Numerically – correlation coefficients

Is There a Correlation?

Scatterplots Exploratory analysis

Pearson’s Correlation Index Based on concept of covariance = covariation between X and Y = deviation of X from its mean = deviation of Y from its mean Pearson’s correlation coefficient

Sample and Population r is the sample correlation coefficient Applying the t distribution, we can infer the correlation for the whole population Test statistic for Pearson’s r

Correlation Example Lake effect snow

Spearman’s Rank Correlation Non-parametric alternative to Pearson Logic similar to Kruskal and Wilcoxon Spearman’s rank correlation coefficient

Regression In correlation we observe degrees of association but no causal or functional relationship In regression analysis, we distinguish an independent from a dependent variable Many forms of functional relationships bivariate linear multivariate non-linear (curvi-linear)

Graphical Representation In correlation analysis either variable could be depicted on either axis In regression analysis, the independent variable is always on the X axis Bivariate relationship is described by a best-fitting line through the scatterplot

Least-Square Regression Objective: minimize

Regression Equation Y = a + bX

Strength of Relationship How much is explained by the regression equation?

Coefficient of Determination Total variation of Y (all the bucket water) Large ‘Y’ = dependent variable Small ‘y’ = deviation of each value of Y from its mean e = explained; u = unexplained

Explained Variation Ratio of square of covariation between X and Y to the variation in X where Sxy = covariation between X and Y Sx2 = total variation of X Coefficient of determination

Error Analysis r 2 tells us what percentage of the variation is accounted for by the independent variable This then allows us to infer the standard error of our estimate which tells us, on average, how far off our prediction would be in measurement units