Outline of Today’s Discussion 1.Regression Analysis: Introduction 2.An Alternate Formula For Regression 3.Correlation, Regression, and Statistical Significance.

Slides:



Advertisements
Similar presentations
Lesson 10: Linear Regression and Correlation
Advertisements

Structural Equation Modeling
Tests of Significance for Regression & Correlation b* will equal the population parameter of the slope rather thanbecause beta has another meaning with.
Objectives 10.1 Simple linear regression
13- 1 Chapter Thirteen McGraw-Hill/Irwin © 2005 The McGraw-Hill Companies, Inc., All Rights Reserved.
Inference for Regression
Correlation and Regression
© The McGraw-Hill Companies, Inc., 2000 CorrelationandRegression Further Mathematics - CORE.
Statistics for the Social Sciences
Linear Regression and Correlation
SIMPLE LINEAR REGRESSION
Linear Regression and Correlation Analysis
More Simple Linear Regression 1. Variation 2 Remember to calculate the standard deviation of a variable we take each value and subtract off the mean and.
Introduction to Probability and Statistics Linear Regression and Correlation.
Regression Chapter 10 Understandable Statistics Ninth Edition By Brase and Brase Prepared by Yixun Shi Bloomsburg University of Pennsylvania.
SIMPLE LINEAR REGRESSION
C82MCP Diploma Statistics School of Psychology University of Nottingham 1 Linear Regression and Linear Prediction Predicting the score on one variable.
Simple Linear Regression Analysis
Relationships Among Variables
Correlation & Regression
Correlation and Linear Regression
Lecture 16 Correlation and Coefficient of Correlation
Correlation and Regression
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Simple Linear Regression Analysis Chapter 13.
Chapter 12 Correlation and Regression Part III: Additional Hypothesis Tests Renee R. Ha, Ph.D. James C. Ha, Ph.D Integrative Statistics for the Social.
SIMPLE LINEAR REGRESSION
The Chi-Square Distribution 1. The student will be able to  Perform a Goodness of Fit hypothesis test  Perform a Test of Independence hypothesis test.
Correlation and Regression
Introduction to Linear Regression and Correlation Analysis
Chapter 13: Inference in Regression
Linear Regression and Correlation
Chapter Correlation and Regression 1 of 84 9 © 2012 Pearson Education, Inc. All rights reserved.
Correlation and Regression
Correlation and Regression Used when we are interested in the relationship between two variables. NOT the differences between means or medians of different.
Production Planning and Control. A correlation is a relationship between two variables. The data can be represented by the ordered pairs (x, y) where.
Correlation Analysis. Correlation Analysis: Introduction Management questions frequently revolve around the study of relationships between two or more.
Basic Concepts of Correlation. Definition A correlation exists between two variables when the values of one are somehow associated with the values of.
MGS3100_04.ppt/Sep 29, 2015/Page 1 Georgia State University - Confidential MGS 3100 Business Analysis Regression Sep 29 and 30, 2015.
Simple Linear Regression. The term linear regression implies that  Y|x is linearly related to x by the population regression equation  Y|x =  +  x.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
MARKETING RESEARCH CHAPTER 18 :Correlation and Regression.
Chapter 10 For Explaining Psychological Statistics, 4th ed. by B. Cohen 1 A perfect correlation implies the ability to predict one score from another perfectly.
Multiple Regression. Simple Regression in detail Y i = β o + β 1 x i + ε i Where Y => Dependent variable X => Independent variable β o => Model parameter.
Practice You collect data from 53 females and find the correlation between candy and depression is Determine if this value is significantly different.
Correlation and Regression: The Need to Knows Correlation is a statistical technique: tells you if scores on variable X are related to scores on variable.
Correlation & Regression Analysis
LESSON 6: REGRESSION 2/21/12 EDUC 502: Introduction to Statistics.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Simple Linear Regression Analysis Chapter 13.
Regression. Outline of Today’s Discussion 1.Coefficient of Determination 2.Regression Analysis: Introduction 3.Regression Analysis: SPSS 4.Regression.
Outline of Today’s Discussion 1.Introduction to Correlation 2.An Alternative Formula for the Correlation Coefficient 3.Coefficient of Determination.
Outline of Today’s Discussion 1.Seeing the big picture in MR: Prediction 2.Starting SPSS on the Different Models: Stepwise versus Hierarchical 3.Interpreting.
SOCW 671 #11 Correlation and Regression. Uses of Correlation To study the strength of a relationship To study the direction of a relationship Scattergrams.
Significance Tests for Regression Analysis. A. Testing the Significance of Regression Models The first important significance test is for the regression.
©The McGraw-Hill Companies, Inc. 2008McGraw-Hill/Irwin Linear Regression and Correlation Chapter 13.
Independent Samples ANOVA. Outline of Today’s Discussion 1.Independent Samples ANOVA: A Conceptual Introduction 2.The Equal Variance Assumption 3.Cumulative.
Chapter 13 Linear Regression and Correlation. Our Objectives  Draw a scatter diagram.  Understand and interpret the terms dependent and independent.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Correlation and Regression
Inference about the slope parameter and correlation
Correlation and Simple Linear Regression
Correlation and Regression
CHAPTER 29: Multiple Regression*
CHAPTER 26: Inference for Regression
Correlation and Simple Linear Regression
Correlation and Regression
Simple Linear Regression
Simple Linear Regression and Correlation
Chapter Thirteen McGraw-Hill/Irwin
MGS 3100 Business Analysis Regression Feb 18, 2016
Presentation transcript:

Outline of Today’s Discussion 1.Regression Analysis: Introduction 2.An Alternate Formula For Regression 3.Correlation, Regression, and Statistical Significance 4.Further Intuitions About The Degrees of Freedom

Part 1 Regression Analysis: Introduction

Regression Analysis Introduction There is a close relationship between correlation and regression. A synonym for regression is prediction! Recall that prediction is one of the (“fab”) four goals of the scientific method. What were the others? A significant correlation implies a significant capacity for prediction, i.e., a prediction that is reliably better than chance!

Regression Analysis Introduction In linear regression, the “regression line” is the best-fitting line to the data. Potential Pop Quiz Question: Given that we could fit an infinite number of lines to a data set, how do we determine which is the best- fitting line? Explain in your own words. (Note: The answer is not in your text, it’s in your intuitions!)

Regression Analysis Introduction The equation for a straight line, again, is: y = mx + B or Criterion = (slope * Predictor) + Intercept How many “parameters” in a linear equation? How about a quadratic equation? How about a Gaussian distribution?

Regression Analysis Introduction Slope can also be though of as “rise over run”.

Regression Analysis Introduction The “rise” on the ordinate = Y 2 - Y 1. The “run” on the abscissa = X 2 - X 1.

Regression Analysis Introduction “Rise over run” in pictures.

Regression Analysis Introduction Here, the regression is “linear”… Next week…no such luck!

Regression Analysis Introduction Note: Whenever our correlation is statistically significant, the slope of the best-fitting regression line is said to be significantly different from zero. When the slope of the regression line is significantly different from zero, it is… - a significantly better predictor than chance; - a significantly better predictor than the MEAN! Critical Thinking Question: How does the previous bullet item relate to the philosophical notion of “Ockham’s Razor”?

Regression Analysis Introduction

Once we have the slope, it’s easy to get the y-intercept!

Part 2 An Alternate Formula For Regression

Alternate Regression Formula Predicted Z score on the criterion variable can be found by multiplying Z score on the predictor variable by that standardized regression coefficient –Standardized regression coefficient is the same thing as the correlation –For raw score predictions Change raw score to Z score Make prediction Change back to raw score

Alternate Regression Formula Potential Pop Quiz Question: What would the linear equation be for this alternate regression formula?

Alternate Regression Formula Potential Pop Quiz Question: Why might you prefer to use one of these regression formulas over the other?

Multiple Correlation and Multiple Regression Multiple correlation –Association between criterion variables and two or more predictor variables Multiple regression –Making predictions about criterion variables based on two or more predictor variables –Unlike prediction from one variable, standardized regression coefficient is not the same as the ordinary correlation coefficient

Part 3 Correlation, Regression, & Statistical Significance

Correlation, Regression & Statistical Significance Let’s consider correlation & regression analysis within the context of our earlier work (this semester) on hypothesis testing… In your own words, what is the meaning of the so-called “critical value” for hypothesis testing? Potential Pop Quiz Question: What two factors determine the critical value (i.e., the number to beat) when we engage in hypothesis testing? (The answer is not in your text…but it might be in your critical thinking, or your memory of research methods.)

Correlation, Regression & Statistical Significance DF for Correlation & Regression Here n stands for the number of pairs of scores. Why would this be n-2, rather than the usual n-1?

Correlation, Regression & Statistical Significance In general, the formula for the degrees of freedom is the number of observations minus the number of parameters estimated. For linear correlation, we have one estimate for the mean of X, and another estimate for the mean of Y. For linear regression, we have one estimate for the slope, and another estimate for y intercept.

Correlation, Regression & Statistical Significance What would the DF be for this function? Explain.

Correlation, Regression & Statistical Significance Note: The a slope can be very modestly different from zero, and still be “statistically significant” if all data points fall very close to the line. In correlation and regression, statistical significance is determined by the strength of the correlation between two variables (the r-value), and NOT by the slope of the regression line. (This statement is relevant to regressions involving raw scores, not z-transformed scores) The significance of the r-value, as always, depends on the alpha level, and the df (which is n-2). Take a peak at the r-value table.

Correlation, Regression & Statistical Significance Remember: The regression line (equation) can help us predict one score, given another score, but only if there is a significant r-value. The terminology w/b… “the regression line explains (or accounts for)” 42% of the variability in the scores (if r-squared =.42). To “explain” or “account for” does NOT mean “to cause”. Correlation does not imply causation!

Part 4 Further Intuitions About Degrees of Freedom

Further Intuitions About DF 1.To understand the concept called “degrees of freedom”, we should remember that dfs are used when we are making estimates. 2.When we are using ANOVA, population means are estimated, based on samples. 3.Let’s consider some simple examples…

Further Intuitions About DF 1.Could someone generate a list of three numbers that has a mean equal to 10? 2.Could someone else generate a different list of three numbers having a mean equal to 10? 3.And, one last list of three numbers with a mean equal to 10?

Further Intuitions About DF 1.Now, lets consider how many scores in a set are free to vary, once we’ve estimated the mean to be Would someone now generate any two numbers? (I’ll generate the third number so that the mean is 10.) 3.Would someone now generate any two other numbers? (I’ll generate the third number so that the mean is 10.) 4.One more time, would someone now generate any two other numbers? (I’ll generate the third number so that the mean is 10.)

Further Intuitions About DF 1.Hopefully you see a pattern now. That is, all scores in a set are free to vary…except “the last one”, if the set of scores is to have the mean that we are estimating. 2.In other words, if we claim that we believe that the population mean associated with a particular sample of three observations is, say, 10, then only the first two scores are free to vary (we have 2, or n-1, “degrees of freedom”). 3.The last score is NOT free to vary…it is fixed, if we are to achieve our mean.