Multiple Correlation and Regression

Slides:



Advertisements
Similar presentations
Simple Linear Regression and Correlation by Asst. Prof. Dr. Min Aung.
Advertisements

11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Forecasting Using the Simple Linear Regression Model and Correlation
13- 1 Chapter Thirteen McGraw-Hill/Irwin © 2005 The McGraw-Hill Companies, Inc., All Rights Reserved.
Correlation and regression Dr. Ghada Abo-Zaid
Correlation & Regression Chapter 10. Outline Section 10-1Introduction Section 10-2Scatter Plots Section 10-3Correlation Section 10-4Regression Section.
Describing Relationships Using Correlation and Regression
Correlation and Regression
Correlation and Regression
Correlation and Regression
PSY 307 – Statistics for the Behavioral Sciences
Chapter 13 Introduction to Linear Regression and Correlation Analysis
Linear Regression and Correlation
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 13 Introduction to Linear Regression and Correlation Analysis.
SIMPLE LINEAR REGRESSION
Chapter Topics Types of Regression Models
Linear Regression and Correlation Analysis
Correlation and Regression. Correlation What type of relationship exists between the two variables and is the correlation significant? x y Cigarettes.
Chapter 13 Introduction to Linear Regression and Correlation Analysis
Chapter 9: Correlation and Regression
© McGraw-Hill, Bluman, 5th ed., Chapter 9
© 2000 Prentice-Hall, Inc. Chap Forecasting Using the Simple Linear Regression Model and Correlation.
Today Concepts underlying inferential statistics
Chapter 14 Introduction to Linear Regression and Correlation Analysis
Farrokh Alemi, Ph.D. Kashif Haqqi M.D.
Chapter 7 Forecasting with Simple Regression
Simple Linear Regression Analysis
Correlation and Linear Regression
SIMPLE LINEAR REGRESSION
Correlation and Regression
Marketing Research Aaker, Kumar, Day and Leone Tenth Edition
Introduction to Linear Regression and Correlation Analysis
Linear Regression and Correlation
Correlation and Regression
OPIM 303-Lecture #8 Jose M. Cruz Assistant Professor.
Correlation and Regression The Big Picture From Bluman 5 th Edition slides © McGraw Hill.
© The McGraw-Hill Companies, Inc., Chapter 11 Correlation and Regression.
Chap 12-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 12 Introduction to Linear.
Correlation and Regression
Production Planning and Control. A correlation is a relationship between two variables. The data can be represented by the ordered pairs (x, y) where.
Elementary Statistics Correlation and Regression.
Correlation & Regression
Examining Relationships in Quantitative Research
1 Chapter 12 Simple Linear Regression. 2 Chapter Outline  Simple Linear Regression Model  Least Squares Method  Coefficient of Determination  Model.
Multiple Regression and Model Building Chapter 15 Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
Unit 10 Correlation and Regression McGraw-Hill, Bluman, 7th ed., Chapter 10 1.
CHAPTER 3 INTRODUCTORY LINEAR REGRESSION. Introduction  Linear regression is a study on the linear relationship between two variables. This is done by.
Chapter 4 Linear Regression 1. Introduction Managerial decisions are often based on the relationship between two or more variables. For example, after.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Chapter 16 Data Analysis: Testing for Associations.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Simple Linear Regression Analysis Chapter 13.
Chapter 7 Calculation of Pearson Coefficient of Correlation, r and testing its significance.
Linear Regression and Correlation Chapter GOALS 1. Understand and interpret the terms dependent and independent variable. 2. Calculate and interpret.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics Seventh Edition By Brase and Brase Prepared by: Lynn Smith.
26134 Business Statistics Week 4 Tutorial Simple Linear Regression Key concepts in this tutorial are listed below 1. Detecting.
©The McGraw-Hill Companies, Inc. 2008McGraw-Hill/Irwin Linear Regression and Correlation Chapter 13.
Video Conference 1 AS 2013/2012 Chapters 10 – Correlation and Regression 15 December am – 11 am Puan Hasmawati Binti Hassan
Correlation and Regression Elementary Statistics Larson Farber Chapter 9 Hours of Training Accidents.
Educational Research Inferential Statistics Chapter th Chapter 12- 8th Gay and Airasian.
Correlation and Regression. O UTLINE Introduction  10-1 Scatter plots.  10-2 Correlation.  10-3 Correlation Coefficient.  10-4 Regression.
CHAPTER 10 & 13 Correlation and Regression
Testing the Difference Between Two Means
10.2 Regression If the value of the correlation coefficient is significant, the next step is to determine the equation of the regression line which is.
Chapter 9 Testing the Difference Between Two Means, Two Proportions, and Two Variances.
Correlation and Regression
Correlation and Regression
CHAPTER 10 Correlation and Regression (Objectives)
Correlation and Regression
Presentation transcript:

Multiple Correlation and Regression Unit 11 Multiple Correlation and Regression McGraw-Hill, Bluman, 7th ed., Chapter 10

Overview Introduction Multiple Regression Bluman, Chapter 10

Objectives Draw a scatter plot for a set of ordered pairs. Compute the correlation coefficient. Test the hypothesis H0: ρ = 0. Compute the equation of the regression line. Compute the coefficient of determination. Compute the standard error of the estimate. Find a prediction interval. Be familiar with the concept of multiple regression. Bluman, Chapter 10

Introduction In addition to hypothesis testing and confidence intervals, inferential statistics involves determining whether a relationship between two or more numerical or quantitative variables exists. Bluman, Chapter 10

Introduction Correlation is a statistical method used to determine whether a linear relationship between variables exists. Regression is a statistical method used to describe the nature of the relationship between variables—that is, positive or negative, linear or nonlinear. Bluman, Chapter 10

Introduction The purpose of this unit is to answer these questions statistically: 1. Are two or more variables related? 2. If so, what is the strength of the relationship? 3. What type of relationship exists? 4. What kind of predictions can be made from the relationship? Bluman, Chapter 10

Introduction 1. Are two or more variables related? 2. If so, what is the strength of the relationship? To answer these two questions, statisticians use the correlation coefficient, a numerical measure to determine whether two or more variables are related and to determine the strength of the relationship between or among the variables. Bluman, Chapter 10

Introduction 3. What type of relationship exists? There are two types of relationships: simple and multiple. In a simple relationship, there are two variables: an independent variable (predictor variable) and a dependent variable (response variable). In a multiple relationship, there are two or more independent variables that are used to predict one dependent variable. Bluman, Chapter 10

Introduction 4. What kind of predictions can be made from the relationship? Predictions are made in all areas and daily. Examples include weather forecasting, stock market analyses, sales predictions, crop predictions, gasoline price predictions, and sports predictions. Some predictions are more accurate than others, due to the strength of the relationship. That is, the stronger the relationship is between variables, the more accurate the prediction is. Bluman, Chapter 10

Correlation The range of the multiple correlation coefficient is from 1 to 1. If there is a strong positive linear relationship between the variables, the value of r will be close to 1. If there is a strong negative linear relationship between the variables, the value of r will be close to 1. Bluman, Chapter 10

Hypothesis Testing In hypothesis testing, one of the following is true: H0:   0 This null hypothesis means that there is no correlation between the x and y variables in the population. H1:   0 This alternative hypothesis means that there is a significant correlation between the variables in the population. Bluman, Chapter 10

Possible Relationships Between Variables When the null hypothesis has been rejected for a specific a value, any of the following five possibilities can exist. There is a direct cause-and-effect relationship between the variables. That is, x causes y. There is a reverse cause-and-effect relationship between the variables. That is, y causes x. The relationship between the variables may be caused by a third variable. There may be a complexity of interrelationships among many variables. The relationship may be coincidental. Bluman, Chapter 10

Possible Relationships Between Variables There is a reverse cause-and-effect relationship between the variables. That is, y causes x. For example, water causes plants to grow poison causes death heat causes ice to melt Bluman, Chapter 10

Possible Relationships Between Variables There is a reverse cause-and-effect relationship between the variables. That is, y causes x. For example, Suppose a researcher believes excessive coffee consumption causes nervousness, but the researcher fails to consider that the reverse situation may occur. That is, it may be that an extremely nervous person craves coffee to calm his or her nerves. Bluman, Chapter 10

Possible Relationships Between Variables The relationship between the variables may be caused by a third variable. For example, If a statistician correlated the number of deaths due to drowning and the number of cans of soft drink consumed daily during the summer, he or she would probably find a significant relationship. However, the soft drink is not necessarily responsible for the deaths, since both variables may be related to heat and humidity. Bluman, Chapter 10

Possible Relationships Between Variables There may be a complexity of interrelationships among many variables. For example, A researcher may find a significant relationship between students’ high school grades and college grades. But there probably are many other variables involved, such as IQ, hours of study, influence of parents, motivation, age, and instructors. Bluman, Chapter 10

Possible Relationships Between Variables The relationship may be coincidental. For example, A researcher may be able to find a significant relationship between the increase in the number of people who are exercising and the increase in the number of people who are committing crimes. But common sense dictates that any relationship between these two values must be due to coincidence. Bluman, Chapter 10

Multiple Regression In multiple regression, there are several independent variables and one dependent variable, and the equation is Bluman, Chapter 10

Assumptions for Multiple Regression 1. normality assumption—for any specific value of the independent variable, the values of the y variable are normally distributed. 2. equal-variance assumption—the variances (or standard deviations) for the y variables are the same for each value of the independent variable. 3. linearity assumption—there is a linear relationship between the dependent variable and the independent variables. 4. nonmulticollinearity assumption—the independent variables are not correlated. 5. independence assumption—the values for the y variables are independent. Bluman, Chapter 10

Multiple Correlation Coefficient In multiple regression, as in simple regression, the strength of the relationship between the independent variables and the dependent variable is measured by a correlation coefficient. This multiple correlation coefficient is symbolized by R. Bluman, Chapter 10

Multiple Correlation Coefficient The formula for R is where ryx1 = correlation coefficient for y and x1 ryx2 = correlation coefficient for y and x2 rx1x2 = correlation coefficient for x1 and x2 Bluman, Chapter 10

Example: State Board Scores A nursing instructor wishes to see whether a student’s grade point average and age are related to the student’s score on the state board nursing examination. She selects five students and obtains the following data. Find the value of R. Bluman, Chapter 10

Example: State Board Scores A nursing instructor wishes to see whether a student’s grade point average and age are related to the student’s score on the state board nursing examination. She selects five students and obtains the following data. Find the value of R. The values of the correlation coefficients are Bluman, Chapter 10

Example: State Board Scores Hence, the correlation between a student’s grade point average and age with the student’s score on the nursing state board examination is 0.989. In this case, there is a strong relationship among the variables; the value of R is close to 1.00. Bluman, Chapter 10

F Test for Significance of R The formula for the F test is where n = the number of data groups k = the number of independent variables. d.f.N. = n – k d.f.D. = n – k – 1 Bluman, Chapter 10

Example: State Board Scores Test the significance of the R obtained in Example 10–15 at α = 0.05. The critical value obtained from Table H with a 0.05, d.f.N. = 3, and d.f.D. = 2 is 19.16. Hence, the decision is to reject the null hypothesis and conclude that there is a significant relationship among the student’s GPA, age, and score on the nursing state board examination. Bluman, Chapter 10

Adjusted R2 The formula for the adjusted R2 is Bluman, Chapter 10

Example: State Board Scores Calculate the adjusted R2 for the data in Example 10–16. The value for R is 0.989. In this case, when the number of data pairs and the number of independent variables are accounted for, the adjusted multiple coefficient of determination is 0.956. Bluman, Chapter 10