Presentation is loading. Please wait.

Presentation is loading. Please wait.

Chapter 8 – 1 Chapter 8: Bivariate Regression and Correlation Overview The Scatter Diagram Two Examples: Education & Prestige Correlation Coefficient Bivariate.

Similar presentations


Presentation on theme: "Chapter 8 – 1 Chapter 8: Bivariate Regression and Correlation Overview The Scatter Diagram Two Examples: Education & Prestige Correlation Coefficient Bivariate."— Presentation transcript:

1 Chapter 8 – 1 Chapter 8: Bivariate Regression and Correlation Overview The Scatter Diagram Two Examples: Education & Prestige Correlation Coefficient Bivariate Linear Regression Line SPSS Output Interpretation Covariance

2 Chapter 8 – 2 Overview Interval Nominal Dependent Variable Independent Variables Nominal Interval Considers the distribution of one variable across the categories of another variable Considers the difference between the mean of one group on a variable with another group Considers how a change in a variable affects a discrete outcome Considers the degree to which a change in one variable results in a change in another

3 Chapter 8 – 3 Overview Interval Nominal Dependent Variable Independent Variables Nominal Interval Considers the difference between the mean of one group on a variable with another group Considers how a change in a variable affects a discrete outcome Considers the degree to which a change in one variable results in a change in another You already know how to deal with two nominal variables Lambda

4 Chapter 8 – 4 Overview Interval Nominal Dependent Variable Independent Variables Nominal Interval Considers how a change in a variable affects a discrete outcome Considers the degree to which a change in one variable results in a change in another You already know how to deal with two nominal variables Lambda Chi-square Confidence Intervals T-Test We will deal with this later in the course TODAY!

5 Chapter 8 – 5 Overview Interval Nominal Dependent Variable Independent Variables Nominal Interval Considers how a change in a variable affects a discrete outcome Regression Correlation You already know how to deal with two nominal variables Lambda Chi-square Confidence Intervals T-Test We will deal with this later in the course TODAY! What about this cell?

6 Chapter 8 – 6 Overview Interval Nominal Dependent Variable Independent Variables Nominal Interval Logistic Regression Regression Correlation You already know how to deal with two nominal variables Lambda Confidence Intervals T-Test We will deal with this later in the course TODAY! This cell is not covered in this course

7 Chapter 8 – 7 General Examples Does a change in one variable significantly affect another variable? Do two scores tend to co-vary positively (high on one score high on the other, low on one, low on the other)? Do two scores tend to co-vary negatively (high on one score low on the other; low on one, hi on the other)?

8 Chapter 8 – 8 Specific Examples Does getting older significantly influence a person’s political views? Does marital satisfaction increase with length of marriage? How does an additional year of education affect one’s earnings?

9 Chapter 8 – 9 Scatter Diagrams Scatter Diagram (scatterplot)—a visual method used to display a relationship between two interval-ratio variables. Typically, the independent variable is placed on the X-axis (horizontal axis), while the dependent variable is placed on the Y-axis (vertical axis.)

10 Chapter 8 – 10 Scatter Diagram Example % Willing to Pay Higher PricesGNP/Capital United States44.9029.24 Ireland53.3018.71 Netherlands61.2024.78 Norway40.7034.31 Sweden32.6025.58 Russia28.202.66 New Zealand46.1014.60 Canada41.8019.71 Japan53.0032.35 Spain32.5014.10 Latvia21.402.42 Portugal22.1010.67 Chile37.504.99 Switzerland54.7039.98 Finland23.1024.28 Mexico31.603.84

11 Chapter 8 – 11 Scatter Diagram Example

12 Chapter 8 – 12 Scatter Diagram Example

13 Chapter 8 – 13 Linear Relationships Linear relationship – A relationship between two interval-ratio variables in which the observations displayed in a scatter diagram can be approximated with a straight line. Deterministic (perfect) linear relationship – A relationship between two interval-ratio variables in which all the observations (the dots) fall along a straight line. The line provides a predicted value of Y (the vertical axis) for any value of X (the horizontal axis.

14 Chapter 8 – 14 Graph the data below and examine the relationship:

15 Chapter 8 – 15 The Seniority-Salary Relationship

16 Chapter 8 – 16 Example

17 Chapter 8 – 17

18 Chapter 8 – 18 xSocial Alienation 25303540 yReligiosity 109887

19 Chapter 8 – 19 Example: Education & Prestige Does education predict occupational prestige? If so, then the higher the respondent’s level of education, as measured by number of years of schooling, the greater the prestige of the respondent’s occupation. Take a careful look at the scatter diagram on the next slide and see if you think that there exists a relationship between these two variables…

20 Chapter 8 – 20 Scatterplot of Prestige by Education

21 Chapter 8 – 21 Example: Education & Prestige The scatter diagram data can be represented by a straight line, therefore there does exist a relationship between these two variables. In addition, since occupational prestige becomes higher, as years of education increases, we can say also that the relationship is a positive one.

22 Chapter 8 – 22 Linear Regression Technique that finds a line that “ fit ” the scatter of data points in such as ways as to provide for any given value of x the best estimate of the corresponding value of y.

23 Chapter 8 – 23 Equation for a Straight Line Y= a + bX wherea = intercept b = slope Y = dependent variable X = independent variable X Y a rise run rise run = b

24 Chapter 8 – 24 Bivariate Linear Regression Equation Y = a + bX Y-intercept (a)—The point where the regression line crosses the Y-axis, or the value of Y when X=0. Slope (b)—The change in variable Y (the dependent variable) with a unit change in X (the independent variable.) The estimates of a and b will have the property that the sum of the squared differences between the observed and predicted (Y-Y) 2 is minimized using ordinary least squares (OLS). Thus the regression line represents the Best Linear and Unbiased Estimators (BLUE) of the intercept and slope. ˆ ^

25 Chapter 8 – 25 Back to the original scatterplot:

26 Chapter 8 – 26 A Representative Line

27 Chapter 8 – 27

28 Chapter 8 – 28 Ordinary Least Squares Least-squares line (best fitting line) – A line where the errors sum of squares, or e 2, is at a minimum. Least-squares method – The technique that produces the least squares line.

29 Chapter 8 – 29 Least Squares

30 Chapter 8 – 30

31 Chapter 8 – 31 Linear Regression

32 Chapter 8 – 32 Estimating the slope: b The bivariate regression coefficient or the slope of the regression line can be obtained from the observed X and Y scores.

33 Chapter 8 – 33 Estimating the Intercept The regression line always goes through the point corresponding to the mean of both X and Y, by definition. So we utilize this information to solve for a:

34 Chapter 8 – 34 The Least Squares Line!

35 Chapter 8 – 35 Example xSocial Alienation 25303540 yReligiosity 109887

36 Chapter 8 – 36 Example

37 Chapter 8 – 37 If a respondent had zero value of social alienation, this model predicts that his Religiosity scale score would be 14 points. For each additional unit of social alienation, our model predicts a -.1647 point increase in religiosity scale. Y = 14 -.1647(X) ˆ Interpreting the regression equation

38 Chapter 8 – 38 The Regression Equation Prediction Equation: Y = 27.184 +.6.28(X) This line represents the predicted values for Y for any and all values of X ˆ

39 Chapter 8 – 39 Prediction Equation: Y = 27.184 +.628(X) This line represents the predicted values for Y for any and all values of X ˆ The Regression Equation

40 Chapter 8 – 40 Summary: Properties of the Regression Line Represents the predicted values for Y for any and all values of X. Always goes through the point corresponding to the mean of both X and Y. It is the best fitting line in that it minimizes the sum of the squared deviations. Has a slope that can be positive or negative; null hypothesis is that the slope is zero.

41 Chapter 8 – 41 Covariance = Variance of X= Covariance of X and Y—a measure of how X and Y vary together. Covariance will be close to zero when X and Y are unrelated. It will be greater than zero when the relationship is positive and less than zero when the relationship is negative. Variance of X—we have talked a lot about variance in the dependent variable. This is simply the variance for the independent variable Covariance and Variance

42 Chapter 8 – 42 Computational Formula

43 Chapter 8 – 43 Coefficient of Determination Coefficient of Determination (r 2 ) – A PRE measure reflecting the proportional reduction of error that results from using the linear regression model. It reflects the proportion of the total variation in the dependent variable, Y, explained by the independent variable, X.

44 Chapter 8 – 44 Coefficient of Determination

45 Chapter 8 – 45 Coefficient of Determination

46 Chapter 8 – 46 Pearson’s Correlation Coefficient (r) — The square root of r 2. It is a measure of association between two interval- ratio variables. Symmetrical measure—No specification of independent or dependent variables. Ranges from –1.0 to +1.0. The sign (  ) indicates direction. The closer the number is to  1.0 the stronger the association between X and Y. The Correlation Coefficient

47 Chapter 8 – 47 r = 0 means that there is no association between the two variables. The Correlation Coefficient Y X r = 0

48 Chapter 8 – 48 The Correlation Coefficient Y X r = +1 r = 0 means that there is no association between the two variables. r = +1 means a perfect positive correlation.

49 Chapter 8 – 49 The Correlation Coefficient Y X r = –1 r = 0 means that there is no association between the two variables. r = +1 means a perfect positive correlation. r = –1 means a perfect negative correlation.

50 Chapter 8 – 50 SPSS Output: % Willing to Pay Higher Price & GNP/Capital

51 Chapter 8 – 51 Now let’s interpret the SPSS output... SPSS Output: % Willing to Pay Higher Price & GNP/Capital

52 Chapter 8 – 52 Exercise A random sample of 10 elementary school students is selected, and each student is measured on a creativity score (x) using a well-defined testing instrument and on a task score (y) using a new instrument. The task score is the mean time taken to perform several hand-eye coordination tasks.

53 Chapter 8 – 53 Data STUDENTCREATIVITY(X)TASKS(Y) AE284.5 FR353.9 HT373.9 IO506.1 DP694.3 YR848.8 RR517.1 TG457.3 EF313.3 TJ405.2

54 Chapter 8 – 54 Example http://www.math.csusb.edu/faculty/stanton/ m262/regress/regress.html


Download ppt "Chapter 8 – 1 Chapter 8: Bivariate Regression and Correlation Overview The Scatter Diagram Two Examples: Education & Prestige Correlation Coefficient Bivariate."

Similar presentations


Ads by Google