Agenda Review Association for Nominal/Ordinal Data –  2 Based Measures, PRE measures Introduce Association Measures for I-R data –Regression, Pearson’s.

Slides:



Advertisements
Similar presentations
Lesson 10: Linear Regression and Correlation
Advertisements

Regression, Correlation. Research Theoretical empirical Usually combination of the two.
Learning Objectives Copyright © 2002 South-Western/Thomson Learning Data Analysis: Bivariate Correlation and Regression CHAPTER sixteen.
Association for Interval Level Variables
Learning Objectives 1 Copyright © 2002 South-Western/Thomson Learning Data Analysis: Bivariate Correlation and Regression CHAPTER sixteen.
Correlation and Regression
Sociology 601 Class 17: October 28, 2009 Review (linear regression) –new terms and concepts –assumptions –reading regression computer outputs Correlation.
Overview Correlation Regression -Definition
Chapter 15 (Ch. 13 in 2nd Can.) Association Between Variables Measured at the Interval-Ratio Level: Bivariate Correlation and Regression.
9. SIMPLE LINEAR REGESSION AND CORRELATION
PPA 501 – Analytical Methods in Administration Lecture 8 – Linear Regression and Correlation.
PPA 415 – Research Methods in Public Administration
Correlation and Regression Analysis
Chapter 13 Introduction to Linear Regression and Correlation Analysis
Basic Statistical Concepts
Correlation and Regression. Correlation What type of relationship exists between the two variables and is the correlation significant? x y Cigarettes.
Elaboration Elaboration extends our knowledge about an association to see if it continues or changes under different situations, that is, when you introduce.
Basic Statistical Concepts Part II Psych 231: Research Methods in Psychology.
Correlation and Regression. Relationships between variables Example: Suppose that you notice that the more you study for an exam, the better your score.
Multiple Regression Research Methods and Statistics.
Leon-Guerrero and Frankfort-Nachmias,
Week 9: Chapter 15, 17 (and 16) Association Between Variables Measured at the Interval-Ratio Level The Procedure in Steps.
Review Regression and Pearson’s R SPSS Demo
Regression, Residuals, and Coefficient of Determination Section 3.2.
Week 11 Chapter 12 – Association between variables measured at the nominal level.
Correlation and Linear Regression
McGraw-Hill/Irwin Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 13 Linear Regression and Correlation.
Chapter 8: Bivariate Regression and Correlation
Linear Regression and Correlation
Week 12 Chapter 13 – Association between variables measured at the ordinal level & Chapter 14: Association Between Variables Measured at the Interval-Ratio.
Association between Variables Measured at the Nominal Level.
Linear Regression and Correlation
ASSOCIATION BETWEEN INTERVAL-RATIO VARIABLES
Chapter 6 & 7 Linear Regression & Correlation
Chapter 12 Examining Relationships in Quantitative Research Copyright © 2013 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin.
Chapter 8 – 1 Chapter 8: Bivariate Regression and Correlation Overview The Scatter Diagram Two Examples: Education & Prestige Correlation Coefficient Bivariate.
Statistics in Applied Science and Technology Chapter 13, Correlation and Regression Part I, Correlation (Measure of Association)
Introduction to Linear Regression
Section 5.2: Linear Regression: Fitting a Line to Bivariate Data.
Examining Relationships in Quantitative Research
Agenda Review Homework 5 Review Statistical Control Do Homework 6 (In-class group style)
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Chapter 16 Data Analysis: Testing for Associations.
Relationships If we are doing a study which involves more than one variable, how can we tell if there is a relationship between two (or more) of the.
Correlation & Regression Correlation does not specify which variable is the IV & which is the DV.  Simply states that two variables are correlated. Hr:There.
Examining Relationships in Quantitative Research
Chapter Thirteen Copyright © 2006 John Wiley & Sons, Inc. Bivariate Correlation and Regression.
Creating a Residual Plot and Investigating the Correlation Coefficient.
CHAPTER 5 CORRELATION & LINEAR REGRESSION. GOAL : Understand and interpret the terms dependent variable and independent variable. Draw a scatter diagram.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 3 Association: Contingency, Correlation, and Regression Section 3.3 Predicting the Outcome.
Regression Analysis. 1. To comprehend the nature of correlation analysis. 2. To understand bivariate regression analysis. 3. To become aware of the coefficient.
LESSON 6: REGRESSION 2/21/12 EDUC 502: Introduction to Statistics.
Advanced Statistical Methods: Continuous Variables REVIEW Dr. Irina Tomescu-Dubrow.
Linear Regression and Correlation Chapter GOALS 1. Understand and interpret the terms dependent and independent variable. 2. Calculate and interpret.
Copyright © 2014 by Nelson Education Limited Chapter 11 Introduction to Bivariate Association and Measures of Association for Variables Measured.
STATISTICS 12.0 Correlation and Linear Regression “Correlation and Linear Regression -”Causal Forecasting Method.
Chapters 8 Linear Regression. Correlation and Regression Correlation = linear relationship between two variables. Summarize relationship with line. Called.
BIVARIATE/MULTIVARIATE DESCRIPTIVE STATISTICS Displaying and analyzing the relationship between continuous variables.
©The McGraw-Hill Companies, Inc. 2008McGraw-Hill/Irwin Linear Regression and Correlation Chapter 13.
Chapter 15 Association Between Variables Measured at the Interval-Ratio Level.
Copyright © 2015 McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written consent of McGraw-Hill Education.
Chapter 13 Linear Regression and Correlation. Our Objectives  Draw a scatter diagram.  Understand and interpret the terms dependent and independent.
Simple Bivariate Regression
Final Project Reminder
Final Project Reminder
ASSOCIATION Practical significance vs. statistical significance
Multiple Regression.
Correlation and Regression
Presentation transcript:

Agenda Review Association for Nominal/Ordinal Data –  2 Based Measures, PRE measures Introduce Association Measures for I-R data –Regression, Pearson’s r, R 2

Why measure of association? What do “significant” tests tell us? What is the point of calculating measures of association? –Which do you calculate first (significant tests or association measures) and why?

 2 Based Measures How does  2 tap into association? –Indicates how different our findings from what is expected under null Since null = not related, high  2 suggests stronger relationship –Why not just use  2 ? Phi Cramer’s V

PRE Measures  2 based measures have no intuitive meaning PRE measures = proportionate reduction in error –Does knowing someone’s value on the independent variable (e.g., sex) improve our prediction of their score/value on the dependent variable (whether or not criminal). Lambda Gamma

ORDINAL MEASURE OF ASSOCIATION –GAMMA For examining STRENGTH & DIRECTION of “collapsed” ordinal variables (<6 categories) Like Lambda, a PRE-based measure –Range is -1.0 to +1.0

ASSOCIATION BETWEEN INTERVAL-RATIO VARIABLES

Scattergrams Allow quick identification of important features of relationship between interval-ratio variables Two dimensions: –Scores of the independent (X) variable (horizontal axis) –Scores of the dependent (Y) variable (vertical axis)

3 Purposes of Scattergrams 1.To give a rough idea about the existence, strength & direction of a relationship The direction of the relationship can be detected by the angle of the regression line 2. To give a rough idea about whether a relationship between 2 variables is linear (defined with a straight line) 3.To predict scores of cases on one variable (Y) from the score on the other (X)

IV and DV? What is the direction of this relationship?

IV and DV? What is the direction of this relationship?

The Regression line Properties: 1.The sum of positive and negative vertical distances from it is zero 2.The standard deviation of the points from the line is at a minimum 3.The line passes through the point (mean x, mean y) Bivariate Regression Applet

Regression Line Formula Y = a + bX Y = score on the dependent variable X = the score on the independent variable a = the Y intercept – point where the regression line crosses the Y axis b = the slope of the regression line –SLOPE – the amount of change produced in Y by a unit change in X; or, –a measure of the effect of the X variable on the Y

Regression Line Formula Y = a + bX y-intercept (a) = 102 slope (b) =.9 Y = (.9)X This information can be used to predict weight from height. Example: What is the predicted weight of a male who is 70” tall (5’10”)? –Y = (.9)(70) = = 165 pounds

Example 2: Examining the link between # hours of daily TV watching (X) & # of cans of soda consumed per day (Y) Case # Hours TV/ Day (X) Cans Soda Per Day (Y)

Example 2 Example 2: Examining the link between # hours of daily TV watching (X) & # of cans of soda consumed per day. (Y) The regression line for this problem: –Y = x If a person watches 3 hours of TV per day, how many cans of soda would he be expected to consume according to the regression equation? y = (3) = 3.67

The Slope (b) – A Strength & A Weakness –We know that b indicates the change in Y for a unit change in X, but b is not really a good measure of strength –Weakness –It is unbounded (can be >1 or <-1) making it hard to interpret The size of b is influenced by the scale that each variable is measured on

Pearson’s r Correlation Coefficient By contrast, Pearson’s r is bounded –a value of 0.0 indicates no linear relationship and a value of +/-1.00 indicates a perfect linear relationship

Pearson’s r Y = x s x = 1.51 s y = 2.24 Converting the slope to a Pearson’s r correlation coefficient: –Formula: r = b(s x /s y ) r =.99 (1.51/2.24) r =.67

The Coefficient of Determination The interpretation of Pearson’s r (like Cramer’s V) is not straightforward –What is a “strong” or “weak” correlation? »Subjective The coefficient of determination (r 2 ) is a more direct way to interpret the association between 2 variables r 2 represents the amount of variation in Y explained by X You can interpret r 2 with PRE logic: 1.predict Y while ignoring info. supplied by X 2.then account for X when predicting Y

Coefficient of Determination: Example Without info about X (hours of daily TV watching), the best predictor we have is the mean # of cans of soda consumed (mean of Y) The green line (the slope) is what we would predict WITH info about X

Coefficient of Determination Conceptually, the formula for r 2 is: r 2 = Explained variation Total variation “The proportion of the total variation in Y that is attributable or explained by X.” The variation not explained by r 2 is called the unexplained variation –Usually attributed to measurement error, random chance, or some combination of other variables

Coefficient of Determination –Interpreting the meaning of the coefficient of determination in the example: Squaring Pearson’s r (.67) gives us an r 2 of.45 Interpretation: –The # of hours of daily TV watching (X) explains 45% of the total variation in soda consumed (Y)

Another Example: Relationship between Mobility Rate (x) & Divorce rate (y) The formula for this regression line is: Y = (.17)X –1) What is this slope telling you? –2) Using this formula, if the mobility rate for a given state was 45, what would you predict the divorce rate to be? –3) The standard deviation (s) for x=6.57 & the s for y=1.29. Use this info to calculate Pearson’s r. How would you interpret this correlation? –4) Calculate & interpret the coefficient of determination (r 2 )

Another Example: Relationship between Mobility Rate (x) & Divorce rate (y) The formula for this regression line is: Y = (.17)X –1) What is this slope telling you? For every one unit increase in x (mobility rate), divorce rate (y) goes up.17 –2) Using this formula, if the mobility rate for a given state was 45, what would you predict the divorce rate to be? Y = (.17) 45 = 5.15 –3) The standard deviation (s) for x=6.57 & the s for y=1.29. Use this info to calculate Pearson’s r. How would you interpret this correlation? r =.17 (6.57/1.29) =.17(5.093) =.866 –There is a strong positive association between mobility rate & divorce rate. –4) Calculate & interpret the coefficient of determination (r 2 ) r 2 = (.866) 2 =.75 –A state’s mobility rate explains 75% of the variation in its divorce rate.