Regression & Correlation. Review: Types of Variables & Steps in Analysis.

Slides:



Advertisements
Similar presentations
Lesson 10: Linear Regression and Correlation
Advertisements

13- 1 Chapter Thirteen McGraw-Hill/Irwin © 2005 The McGraw-Hill Companies, Inc., All Rights Reserved.
Learning Objectives Copyright © 2002 South-Western/Thomson Learning Data Analysis: Bivariate Correlation and Regression CHAPTER sixteen.
Regression Analysis Once a linear relationship is defined, the independent variable can be used to forecast the dependent variable. Y ^ = bo + bX bo is.
Learning Objectives 1 Copyright © 2002 South-Western/Thomson Learning Data Analysis: Bivariate Correlation and Regression CHAPTER sixteen.
Statistical Analysis Regression & Correlation Psyc 250 Winter, 2013.
Statistical Analyses: Chi-square test Psych 250 Winter 2013.
Statistical Tests Karen H. Hagglund, M.S.
LINEAR REGRESSION: Evaluating Regression Models. Overview Standard Error of the Estimate Goodness of Fit Coefficient of Determination Regression Coefficients.
QUANTITATIVE DATA ANALYSIS
REGRESSION Want to predict one variable (say Y) using the other variable (say X) GOAL: Set up an equation connecting X and Y. Linear regression linear.
Bivariate Regression CJ 526 Statistical Analysis in Criminal Justice.
Correlation Patterns. Correlation Coefficient A statistical measure of the covariation or association between two variables. Are dollar sales.
1 Review of Correlation A correlation coefficient measures the strength of a linear relation between two measurement variables. The measure is based on.
Psychometrics & Statistical Concepts PSYC 101 Dr. Gregg Fall, 2006.
Correlation and Regression. Correlation What type of relationship exists between the two variables and is the correlation significant? x y Cigarettes.
Introduction to Probability and Statistics Linear Regression and Correlation.
Social Research Methods
Summary of Quantitative Analysis Neuman and Robson Ch. 11
Correlation Coefficients Pearson’s Product Moment Correlation Coefficient  interval or ratio data only What about ordinal data?
15: Linear Regression Expected change in Y per unit X.
Descriptive measures of the strength of a linear association r-squared and the (Pearson) correlation coefficient r.
Statistical hypothesis testing – Inferential statistics II. Testing for associations.
Statistics & SPSS Review Fall Types of Measures / Variables Nominal / categorical – Gender, major, blood type, eye color Ordinal – Rank-order of.
Example of Simple and Multiple Regression
Statistical Analyses t-tests Psych 250 Winter, 2013.
Chapter 12 Correlation and Regression Part III: Additional Hypothesis Tests Renee R. Ha, Ph.D. James C. Ha, Ph.D Integrative Statistics for the Social.
February  Study & Abstract StudyAbstract  Graphic presentation of data. Graphic presentation of data.  Statistical Analyses Statistical Analyses.
Introduction to Linear Regression and Correlation Analysis
Selecting the Correct Statistical Test
7.1 - Motivation Motivation Correlation / Simple Linear Regression Correlation / Simple Linear Regression Extensions of Simple.
Chapter 15 Correlation and Regression
Learning Objective Chapter 14 Correlation and Regression Analysis CHAPTER fourteen Correlation and Regression Analysis Copyright © 2000 by John Wiley &
Regression. Correlation and regression are closely related in use and in math. Correlation summarizes the relations b/t 2 variables. Regression is used.
Simple Linear Regression One reason for assessing correlation is to identify a variable that could be used to predict another variable If that is your.
Correlation Patterns.
Copyright © 2010 Pearson Education, Inc Chapter Seventeen Correlation and Regression.
Statistical Analysis Regression & Correlation Psyc 250 Winter, 2008.
Elementary Statistics Correlation and Regression.
Regression Models Residuals and Diagnosing the Quality of a Model.
Regression Lesson 11. The General Linear Model n Relationship b/n predictor & outcome variables form straight line l Correlation, regression, t-tests,
PS 225 Lecture 20 Linear Regression Equation and Prediction.
Chapter 16 Data Analysis: Testing for Associations.
Multiple Regression. Simple Regression in detail Y i = β o + β 1 x i + ε i Where Y => Dependent variable X => Independent variable β o => Model parameter.
Inferential Statistics. The Logic of Inferential Statistics Makes inferences about a population from a sample Makes inferences about a population from.
Chapter Thirteen Copyright © 2006 John Wiley & Sons, Inc. Bivariate Correlation and Regression.
Environmental Modeling Basic Testing Methods - Statistics III.
Copyright © 2010 Pearson Education, Inc Chapter Seventeen Correlation and Regression.
Commonly Used Statistics in the Social Sciences Chi-square Correlation Multiple Regression T-tests ANOVAs.
Statistical Analysis. Z-scores A z-score = how many standard deviations a score is from the mean (-/+) Z-scores thus allow us to transform the mean to.
Regression Analysis. 1. To comprehend the nature of correlation analysis. 2. To understand bivariate regression analysis. 3. To become aware of the coefficient.
Nonparametric Statistics
Topics, Summer 2008 Day 1. Introduction Day 2. Samples and populations Day 3. Evaluating relationships Scatterplots and correlation Day 4. Regression and.
Correlation and Regression Elementary Statistics Larson Farber Chapter 9 Hours of Training Accidents.
ScWk 298 Quantitative Review Session
Nonparametric Statistics
Regression Must have Interval or Ratio scaled measures
Chapter 20 Linear and Multiple Regression
REGRESSION G&W p
Chapter 12 Simple Linear Regression and Correlation
Introduction to Regression Analysis
REGRESSION (R2).
CHAPTER 7 Linear Correlation & Regression Methods
Dr. Siti Nor Binti Yaacob
Residuals and Diagnosing the Quality of a Model
Nonparametric Statistics
Chapter 12 Simple Linear Regression and Correlation
Correlation and Regression
15.1 The Role of Statistics in the Research Process
Presentation transcript:

Regression & Correlation

Review: Types of Variables & Steps in Analysis

Types of Variables Nominal / Categorical: each value is distinct category [gender, blood type, city] Scale / Interval: linear measure, same interval between each value [age, weight, IQ, GPA, SAT, income] Ordinal: ranking, un-equal intervals between values [Likert scale, preference ranking]

Variables & Statistical Tests Variable TypeExampleCommon Stat Method Nominal by nominal Blood type by gender Chi-square Scale by nominalGPA by gender GPA by major T-test Analysis of Variance Scale by scaleWeight by height GPA by SAT Regression Correlation

Evaluating an hypothesis Step 1: What is the relationship in the sample? Step 2: How confidently can one generalize from the sample to the universe from which it comes?

Evaluating an hypothesis Relationship in Sample Statistical Significance 2 ordinal vars.Cross-tab / contingency table “p value” from Chi Square Scale dep. & 2-cat indep. Means for each category “p value” from t- test Scale dep. & 3+ cat indep. Means for each category “p value” from ANOVA 2 scale vars.Regression line Correlation “p value” from reg or correlation

Relationships between Scale Variables Regression Correlation

Regression Amount that a dependent variable increases (or decreases) for each unit increase in an independent variable. Expressed as equation for a line – y = m(x) + b – the “regression line” Interpret by slope of the line: m (Or: interpret by “odds ratio” in “logistic regression”)

Correlation Strength of association of scale measures r = -1 to 0 to perfect positive correlation -1 perfect negative correlation 0 no correlation Interpret r in terms of variance

Mean & Variance

Survey of Class n = 42 Height Mother’s height Mother’s education SAT Estimate IQ Well-being (7 pt. Likert) Weight Father’s education Family income G.P.A. Health (7 pt. Likert)

Frequency Table for:HEIGHT Valid Cum Value Label Value Frequency Percent Percent Percent Total Valid cases 42 Missing cases 0

Frequency Table for:HEIGHT Valid Cum Value Label Value Frequency Percent Percent Percent Total Valid cases 42 Missing cases 0 Descriptive Statistics for:HEIGHT Valid Variable Mean Std Dev Variance Range Minimum Maximum N HEIGHT mean

Variance   x i - Mean ) 2 Variance = s 2 = N Standard Deviation = s =  variance

Frequency Table for:WEIGHT Valid Cum Value Label Value Frequency Percent Percent Percent Total Valid cases 42 Missing cases 0 Descriptive Statistics for:WEIGHT Valid Variable Mean Std Dev Variance Range Minimum Maximum N WEIGHT mean

Relationship of weight & height: Regression Analysis

“Least Squares” Regression Line Dependent = ( B ) (Independent) + constant weight = ( B ) ( height ) + constant

Regression line

Regression:WEIGHTonHEIGHT Multiple R R Square Adjusted R Square Standard Error Analysis of Variance DF Sum of Squares Mean Square Regression Residual F = Signif F = Variables in the Equation Variable B SE B Beta T Sig T HEIGHT (Constant) [ Equation:Weight = 3.3 ( height ) - 73 ]

Regression line W = 3.3 H - 73

Strength of Relationship “Goodness of Fit”: Correlation How well does the regression line “fit” the data?

Frequency Table for:WEIGHT Valid Cum Value Label Value Frequency Percent Percent Percent Total Valid cases 42 Missing cases 0 Descriptive Statistics for:WEIGHT Valid Variable Mean Std Dev Variance Range Minimum Maximum N WEIGHT mean

Variance = 454

Regression line mean

Correlation: “Goodness of Fit” Variance (average sum of squared distances from mean) = 454 “Least squares” (average sum of squared distances from regression line) = – 295 = / 454 =.35 Variance is reduced 35% by calculating from regression line

r 2 = % of variance in WEIGHT “explained” by HEIGHT Correlation coefficient = r

Correlation:HEIGHTwith WEIGHT HEIGHT WEIGHT HEIGHT ( 42) ( 42) P=. P=.000 WEIGHT ( 42) ( 42) P=.000 P=.

r =.59 r 2 =.35 HEIGHT “explains” 35% of variance in WEIGHT

Sentence & G.P.A. Regression: form of relationship Correlation: strength of relationship p value: statistical significance

G. P. A.

Length of Sentence

Scatterplot: Sentence on G.P.A.

Regression Coefficients Sentence = -3.5 G.P.A. + 18

Sent = -3.5 GPA + 18 “Least Squares” Regression Line

Correlation: Sentence & G.P.A.

Interpreting Correlations r = -22p =.31 r 2 =.05 G.P.A. “explains” 5% of the variance in length of sentence