Download presentation
Presentation is loading. Please wait.
1
THE LOGIC OF REGRESSION
2
OUTLINE 1.The Rules of the Game: Interval-Scale Data and PRE (Strength) 2.Understanding the Regression Line (Form) 3.Example: Education and Voter Turnout 4.On the Importance of Visual Inspection (Scattergram)
3
READINGS Pollock, Essentials, review chs. 5-6, read ch. 7 (pp. 154-165) Pollock, SPSS Companion, ch. 8 Course Reader, Selections 3-4 (Ideology and Law, Correlates of Democracy)
4
REGRESSION ANALYSIS: THE BASIC GOALS Taking full advantage of interval-scale data Measuring form, strength, and significance of statistical relationships Specify associations between dependent and independent variables
5
THE RULES OF THE GAME PRE = (E 1 – E 2 )/E 1 1.Guessing Y without knowing X: mean value of Y E 1 = Σ(Y i –Y) 2 2.Guessing Y given knowledge of X: Y i = a + bX i Stipulations: a linear relationship, such that sum of squared deviations of observed values of Y from predicted values is minimal—thus, the line of “least squares”
6
E 1 = sum of squared deviations from the mean E 2 = sum of squared deviations from the regression line PRE = (E 1 – E 2 )/E 1 Which measures the strength of the relationship The regression line—that is, the equation— measures the form of the relationship.
7
Understanding the Regression Line 1.Path of the mean values of Y upon X 2.Estimated “average” values of Y for values of X 3.A line that cuts through the exact middle of the scattergram 4.A very precise statement of the form of a relationship
8
Path of Mean Values of Y for Values of X
9
Scattergram and Least-Squares Line
10
Visualizing Line of Least Squares
11
Variations in Relationships
12
Elements of the Regression Equation
13
Example: % High School Graduates (X) and % Turnout (Y)
14
Regression Equation: High School Graduates and Turnout
15
Estimated turnout = -26.27 +.87 (% graduates) When X is zero, predicted y = -.26.27 Question: Where is X when predicted value of Y = 0? Answer: Around 30.2 (compare to minimal value of X) Slope = +.87 (for every 1 percent increase in high- school graduates, an increase of.87 percent in turnout)
16
What About Wyoming?
17
On the Importance of the Scattergram 1. Visual confirmation of observed relationship 2.Identify patterns in deviations from the line—that is, in patterns among “residual values” 3.This is crucial since different arrays of data can produce identical regression lines (same form, that is, but different strength) 4.Identification of “outliers” (extreme cases)
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.