Practice You collect data from 53 females and find the correlation between candy and depression is -.40. Determine if this value is significantly different.

Slides:



Advertisements
Similar presentations
Sleeping and Happiness
Advertisements

Managerial Economics in a Global Economy
Kin 304 Regression Linear Regression Least Sum of Squares
Forecasting Using the Simple Linear Regression Model and Correlation
13- 1 Chapter Thirteen McGraw-Hill/Irwin © 2005 The McGraw-Hill Companies, Inc., All Rights Reserved.
Inference for Regression
Regression Analysis Once a linear relationship is defined, the independent variable can be used to forecast the dependent variable. Y ^ = bo + bX bo is.
Hypothesis test flow chart frequency data Measurement scale number of variables 1 basic χ 2 test (19.5) Table I χ 2 test for independence (19.9) Table.
Objectives (BPS chapter 24)
© The McGraw-Hill Companies, Inc., 2000 CorrelationandRegression Further Mathematics - CORE.
Chapter 15 (Ch. 13 in 2nd Can.) Association Between Variables Measured at the Interval-Ratio Level: Bivariate Correlation and Regression.
Cal State Northridge  320 Andrew Ainsworth PhD Regression.
Multiple Regression Models. The Multiple Regression Model The relationship between one dependent & two or more independent variables is a linear function.
Linear Regression and Correlation
SIMPLE LINEAR REGRESSION
Chapter Topics Types of Regression Models
Topics: Regression Simple Linear Regression: one dependent variable and one independent variable Multiple Regression: one dependent variable and two or.
Correlation and Regression. Correlation What type of relationship exists between the two variables and is the correlation significant? x y Cigarettes.
SIMPLE LINEAR REGRESSION
© 2000 Prentice-Hall, Inc. Chap Forecasting Using the Simple Linear Regression Model and Correlation.
Simple Linear Regression Analysis
Relationships Among Variables
Lecture 5 Correlation and Regression
Correlation & Regression
SIMPLE LINEAR REGRESSION
Introduction to Linear Regression and Correlation Analysis
Basic Statistics. Basics Of Measurement Sampling Distribution of the Mean: The set of all possible means of samples of a given size taken from a population.
Correlation and Regression
Correlation and Regression. The test you choose depends on level of measurement: IndependentDependentTest DichotomousContinuous Independent Samples t-test.
Chapter 6 & 7 Linear Regression & Correlation
BPS - 3rd Ed. Chapter 211 Inference for Regression.
OPIM 303-Lecture #8 Jose M. Cruz Assistant Professor.
© The McGraw-Hill Companies, Inc., Chapter 11 Correlation and Regression.
Production Planning and Control. A correlation is a relationship between two variables. The data can be represented by the ordered pairs (x, y) where.
Elementary Statistics Correlation and Regression.
Regression Chapter 16. Regression >Builds on Correlation >The difference is a question of prediction versus relation Regression predicts, correlation.
Political Science 30: Political Inquiry. Linear Regression II: Making Sense of Regression Results Interpreting SPSS regression output Coefficients for.
1 Regression Analysis The contents in this chapter are from Chapters of the textbook. The cntry15.sav data will be used. The data collected 15 countries’
Example You give 100 random students a questionnaire designed to measure attitudes toward living in dormitories Scores range from 1 to 7 –(1 = unfavorable;
You can calculate: Central tendency Variability You could graph the data.
With the growth of internet service providers, a researcher decides to examine whether there is a correlation between cost of internet service per.
SPSS SPSS Problem # (7.19) 7.11 (b) You can calculate: Central tendency Variability You could graph the data.
Economics 173 Business Statistics Lecture 10 Fall, 2001 Professor J. Petry
Practice Does drinking milkshakes affect (alpha =.05) your weight? To see if milkshakes affect a persons weight you collected data from 5 sets of twins.
Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
© The McGraw-Hill Companies, Inc., Chapter 10 Correlation and Regression.
OLS Regression What is it? Closely allied with correlation – interested in the strength of the linear relationship between two variables One variable is.
Chapter 14 Introduction to Regression Analysis. Objectives Regression Analysis Uses of Regression Analysis Method of Least Squares Difference between.
BPS - 5th Ed. Chapter 231 Inference for Regression.
Practice You recently finished giving 5 Villanova students the MMPI paranoia measure. Determine if Villanova students’ paranoia score is significantly.
Linear Regression Hypothesis testing and Estimation.
Regression and Correlation
Chapter 4 Basic Estimation Techniques
Practice. Practice Practice Practice Practice r = X = 20 X2 = 120 Y = 19 Y2 = 123 XY = 72 N = 4 (4) 72.
10.2 Regression If the value of the correlation coefficient is significant, the next step is to determine the equation of the regression line which is.
Multiple Regression.
Review. Review Statistics Needed Need to find the best place to draw the regression line on a scatter plot Need to quantify the cluster.
Correlation and Simple Linear Regression
Correlation and Simple Linear Regression
Regression Chapter 8.
Correlation and Regression
You can calculate: Central tendency Variability You could graph the data.
Basic Practice of Statistics - 3rd Edition Inference for Regression
SIMPLE LINEAR REGRESSION
SIMPLE LINEAR REGRESSION
Sleeping and Happiness
Practice Did the type of signal effect response time?
Correlation and Simple Linear Regression
Correlation and Simple Linear Regression
Presentation transcript:

Practice You collect data from 53 females and find the correlation between candy and depression is Determine if this value is significantly different than zero. You collect data from 53 males and find the correlation between candy and depression is Determine if this value is significantly different than zero.

Practice You collect data from 53 females and find the correlation between candy and depression is –t obs = 3.12 –t crit = 2.00 You collect data from 53 males and find the correlation between candy and depression is –t obs = 4.12 –t crit = 2.00

Practice You collect data from 53 females and find the correlation between candy and depression is You collect data from 53 males and find the correlation between candy and depression is Is the effect of candy significantly different for males and females?

Hypothesis H 1 : the two correlations are different H 0 : the two correlations are not different

Testing Differences Between Correlations Must be independent for this to work

When the population value of r is not zero the distribution of r values gets skewed Easy to fix! Use Fisher’s r transformation Page 746

Testing Differences Between Correlations Must be independent for this to work

Testing Differences Between Correlations

Note: what would the z value be if there was no difference between these two values (i.e., H o was true)

Testing Differences Z = What is the probability of obtaining a Z score of this size or greater, if the difference between these two r values was zero? p =.267 If p is <.025 reject H o and accept H 1 If p is = or >.025 fail to reject H o The two correlations are not significantly different than each other!

Remember this: Statistics Needed Need to find the best place to draw the regression line on a scatter plot Need to quantify the cluster of scores around this regression line (i.e., the correlation coefficient)

Regression allows us to predict!.....

Straight Line Y = mX + b Where: Y and X are variables representing scores m = slope of the line (constant) b = intercept of the line with the Y axis (constant)

Excel Example

That’s nice but.... How do you figure out the best values to use for m and b ? First lets move into the language of regression

Straight Line Y = mX + b Where: Y and X are variables representing scores m = slope of the line (constant) b = intercept of the line with the Y axis (constant)

Regression Equation Y = a + bX Where: Y = value predicted from a particular X value a = point at which the regression line intersects the Y axis b = slope of the regression line X = X value for which you wish to predict a Y value

Practice Y = X What is the slope and the Y-intercept? Determine the value of Y for each X: X = 1, X = 3, X = 5, X = 10

Practice Y = X What is the slope and the Y-intercept? Determine the value of Y for each X: X = 1, X = 3, X = 5, X = 10 Y = -5, Y = -1, Y = 3, Y = 13

Finding a and b Uses the least squares method Minimizes Error Error = Y - Y  (Y - Y) 2 is minimized

.....

..... Error = 1 Error = -1 Error =.5 Error = -.5Error = 0 Error = Y - Y  (Y - Y) 2 is minimized

Finding a and b Ingredients COV xy S x 2 Mean of Y and X

Regression

Ingredients Mean Y =4.6 Mean X = 3 Cov xy = 3.75 S 2 X = 2.50

Regression Ingredients Mean Y =4.6 Mean X = 3 Cov xy = 3.75 S 2 x = 2.50

Regression Ingredients Mean Y =4.6 Mean X = 3 Cov xy = 3.75 S 2 x = 2.50

Regression Equation Y = a + bx Equation for predicting smiling from talking Y = (x)

Regression Equation Y = (x) How many times would a person likely smile if they talked 15 times?

Regression Equation Y = (x) How many times would a person likely smile if they talked 15 times? 22.6 = (15)

Y = (1.5)X.....

Y = (1.5)X X = 1; Y =

Y = (1.5)X X = 5; Y =

Y = (1.5)X

Mean Y = 14.50; S y = 4.43 Mean X = 6.00; S x = 2.16 Quantify the relationship with a correlation and draw a regression line that predicts aggression.

∑XY = 326 ∑Y = 58 ∑X = 24 N = 4

∑XY = 326 ∑Y = 58 ∑X = 24 N = 4

COV = S y = 4.43 S x = 2.16

COV = S y = 4.43 S x = 2.16

Regression Ingredients Mean Y =14.5 Mean X = 6 Cov xy = S 2 X = 4.67

Regression Ingredients Mean Y =14.5 Mean X = 6 Cov xy = S 2 X = 4.67

Regression Equation Y = a + bX Y = (-1.57)X

Y = (-1.57)X

Y = (-1.57)X.

Y = (-1.57)X..

Y = (-1.57)X..

Hypothesis Testing Have learned –How to calculate r as an estimate of relationship between two variables –How to calculate b as a measure of the rate of change of Y as a function of X Next determine if these values are significantly different than 0

Testing b The significance test for r and b are equivalent If X and Y are related (r), then it must be true that Y varies with X (b). Important to learn b significance tests for multiple regression

Steps for testing b value 1) State the hypothesis 2) Find t-critical 3) Calculate b value 4) Calculate t-observed 5) Decision 6) Put answer into words

Practice You are interested in if candy consumption significantly alters a persons depression. Create a graph showing the relationship between candy consumption and depression (note: you must figure out which is X and which is Y)

Practice CandyDepression Charlie555 Augustus743 Veruca459 Mike3108 Violet465

Step 1 H 1 : b is not equal to 0 H 0 : b is equal to zero

Step 2 Calculate df = N - 2 –df = 3 Page 747 –First Column are df –Look at an alpha of.05 with two-tails –t crit = and

Step 3 CandyDepression Charlie555 Augustus743 Veruca459 Mike3108 Violet465 COV = -30.5N = 5 r = -.81Sy = S x = 1.52

Step 3 COV = -30.5N = 5 r = -.81 S x = 1.52 S y = Y = (X) b =

Step 4 Calculate t-observed b = Slope S b = Standard error of slope

Step 4 S yx = Standard error of estimate S x = Standard Deviation of X

Step 4 S y = Standard Deviation of y r = correlation between x and y

Note

..... Error = 1 Error = -1 Error =.5 Error = -.5Error = 0 Error = Y - Y  (Y - Y) 2 is minimized

Step 4 S y = Standard Deviation of y r = correlation between x and y

Step 4 S yx = Standard error of estimate S x = Standard Deviation of X

Step 4 S yx = Standard error of estimate S x = Standard Deviation of X

Step 4 Calculate t-observed b = Slope S b = Standard error of slope

Step 4 Calculate t-observed b = Slope S b = Standard error of slope

Step 4 Note: same value at t-observed for r

Step 5 If t obs falls in the critical region: –Reject H 0, and accept H 1 If t obs does not fall in the critical region: –Fail to reject H 0

t distribution t crit = t crit =

t distribution t crit = t crit =

Step 5 If t obs falls in the critical region: –Reject H 0, and accept H 1 If t obs does not fall in the critical region:If t obs does not fall in the critical region: –Fail to reject H 0

Practice

Page

The regression equation for faculty shows that the best estimate of starting salary for faculty is $15,000 (intercept). For every additional year the salary increases on average by $900 (slope). For administrative staff the best estimate of starting salary is $10,000 (slope), for every additional year the salary increases on average by $1500 (slope). They will be equal at 8.33 years of service.

Practice Page

r =.68r 1 =.829 r =.51r 1 =.563 Z =.797 p =.2119 Correlations are not different from each other

Discuss 9.38

SPSS Problem #3 Due March 15th Page 287 –9.2 –9.3 –9.10 and create a graph by hand