Psych 230 Psychological Measurement and Statistics Pedro Wolf September 30, 2009.

Slides:



Advertisements
Similar presentations
Lesson 10: Linear Regression and Correlation
Advertisements

Linear Regression.  The following is a scatterplot of total fat versus protein for 30 items on the Burger King menu:  The model won’t be perfect, regardless.
Review ? ? ? I am examining differences in the mean between groups
Regression Greg C Elvers.
Statistics Measures of Regression and Prediction Intervals.
Learning Objectives 1 Copyright © 2002 South-Western/Thomson Learning Data Analysis: Bivariate Correlation and Regression CHAPTER sixteen.
CHAPTER 8: LINEAR REGRESSION
Scatter Diagrams and Linear Correlation
Correlation & Regression Chapter 15. Correlation statistical technique that is used to measure and describe a relationship between two variables (X and.
Chapter 15 (Ch. 13 in 2nd Can.) Association Between Variables Measured at the Interval-Ratio Level: Bivariate Correlation and Regression.
Copyright (c) Bani K. Mallick1 STAT 651 Lecture #18.
Lecture 11 PY 427 Statistics 1 Fall 2006 Kin Ching Kong, Ph.D
Basic Statistical Concepts Psych 231: Research Methods in Psychology.
The Simple Regression Model
Basic Statistical Concepts
Statistics Psych 231: Research Methods in Psychology.
Intro to Statistics for the Behavioral Sciences PSYC 1900 Lecture 6: Correlation.
Correlation and Regression. Correlation What type of relationship exists between the two variables and is the correlation significant? x y Cigarettes.
Introduction to Probability and Statistics Linear Regression and Correlation.
Analysis of Individual Variables Descriptive – –Measures of Central Tendency Mean – Average score of distribution (1 st moment) Median – Middle score (50.
Correlation and Regression. Relationships between variables Example: Suppose that you notice that the more you study for an exam, the better your score.
C82MCP Diploma Statistics School of Psychology University of Nottingham 1 Linear Regression and Linear Prediction Predicting the score on one variable.
1 Relationships We have examined how to measure relationships between two categorical variables (chi-square) one categorical variable and one measurement.
Week 9: Chapter 15, 17 (and 16) Association Between Variables Measured at the Interval-Ratio Level The Procedure in Steps.
McGraw-Hill/Irwin Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 13 Linear Regression and Correlation.
Correlation and Regression A BRIEF overview Correlation Coefficients l Continuous IV & DV l or dichotomous variables (code as 0-1) n mean interpreted.
Chapter 8: Bivariate Regression and Correlation
Lecture 15 Basics of Regression Analysis
Chapter 5 Regression. Chapter outline The least-squares regression line Facts about least-squares regression Residuals Influential observations Cautions.
Linear Regression and Correlation
Correlation and regression 1: Correlation Coefficient
Section #6 November 13 th 2009 Regression. First, Review Scatter Plots A scatter plot (x, y) x y A scatter plot is a graph of the ordered pairs (x, y)
Correlation and Regression
Correlation and Regression. The test you choose depends on level of measurement: IndependentDependentTest DichotomousContinuous Independent Samples t-test.
September In Chapter 14: 14.1 Data 14.2 Scatterplots 14.3 Correlation 14.4 Regression.
Linear Functions 2 Sociology 5811 Lecture 18 Copyright © 2004 by Evan Schofer Do not copy or distribute without permission.
Regression Analysis. Scatter plots Regression analysis requires interval and ratio-level data. To see if your data fits the models of regression, it is.
Notes Bivariate Data Chapters Bivariate Data Explores relationships between two quantitative variables.
Correlation and Regression PS397 Testing and Measurement January 16, 2007 Thanh-Thanh Tieu.
Chapter 20 Linear Regression. What if… We believe that an important relation between two measures exists? For example, we ask 5 people about their salary.
Notes Bivariate Data Chapters Bivariate Data Explores relationships between two quantitative variables.
Part IV Significantly Different Using Inferential Statistics Chapter 15 Using Linear Regression Predicting Who’ll Win the Super Bowl.
Part IV Significantly Different: Using Inferential Statistics
Scatterplot and trendline. Scatterplot Scatterplot explores the relationship between two quantitative variables. Example:
STATISTICS 12.0 Correlation and Linear Regression “Correlation and Linear Regression -”Causal Forecasting Method.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Political Science 30: Political Inquiry. Linear Regression II: Making Sense of Regression Results Interpreting SPSS regression output Coefficients for.
Chapter 6 (cont.) Difference Estimation. Recall the Regression Estimation Procedure 2.
Warm-Up Write the equation of each line. A B (1,2) and (-3, 7)
CHAPTER 5 CORRELATION & LINEAR REGRESSION. GOAL : Understand and interpret the terms dependent variable and independent variable. Draw a scatter diagram.
2.5 Using Linear Models A scatter plot is a graph that relates two sets of data by plotting the data as ordered pairs. You can use a scatter plot to determine.
Chapter 14: Inference for Regression. A brief review of chapter 4... (Regression Analysis: Exploring Association BetweenVariables )  Bi-variate data.
Correlation They go together like salt and pepper… like oil and vinegar… like bread and butter… etc.
Statistics: Analyzing 2 Quantitative Variables MIDDLE SCHOOL LEVEL  Session #2  Presented by: Dr. Del Ferster.
LESSON 6: REGRESSION 2/21/12 EDUC 502: Introduction to Statistics.
Regression. Outline of Today’s Discussion 1.Coefficient of Determination 2.Regression Analysis: Introduction 3.Regression Analysis: SPSS 4.Regression.
STATISTICS 12.0 Correlation and Linear Regression “Correlation and Linear Regression -”Causal Forecasting Method.
Simple Linear Regression The Coefficients of Correlation and Determination Two Quantitative Variables x variable – independent variable or explanatory.
© 2011 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license.
©The McGraw-Hill Companies, Inc. 2008McGraw-Hill/Irwin Linear Regression and Correlation Chapter 13.
CHAPTER 5: Regression ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
Correlation & Linear Regression Using a TI-Nspire.
Regression Analysis.
Regression Analysis AGEC 784.
Practice. Practice Practice Practice Practice r = X = 20 X2 = 120 Y = 19 Y2 = 123 XY = 72 N = 4 (4) 72.
Simple Linear Regression
Correlation and Regression
Review I am examining differences in the mean between groups How many independent variables? OneMore than one How many groups? Two More than two ?? ?
Honors Statistics Review Chapters 7 & 8
Presentation transcript:

Psych 230 Psychological Measurement and Statistics Pedro Wolf September 30, 2009

Homework Question 2 a.

Homework 2 b. = c. r 2 =.93. Ninety three percent of the variance in test y can be predicted in test x. 2d. Yes the large correlation indicates that one can predict scores on test Y with scores on test X.

Homework 6 a – This statement implies a cause and effect relation. – The correlation by itself does not imply this. – It may be that people with more education have more access to mental healthcare and therefore have the opportunity to use it.

Homework 6b. – Again this statement implies a cause and effect relation. – There could be other factors. Children who play an instrument may have parents who stress the value of practice and education 6c – This statement is false. – A negative correlation means there is a relationship

Homework 8 a. Test 4, r=.633: r 2 =.4007 b x 2= √.8014; r=.8952 c. They ask the same type of question.

Last Time…. Correlation r value indicates – Direction of relationship between two variables – Strength of relationship between two variables

Today…. Regression

Correlation tells us about the strength of the relationship between 2 variables It does not let us predict We can use linear regression to do this

Correlation When you run a correlation you convert everything to z scores r = (ΣZxZy) / N

Regression We build on correlation by adding a “line of best fit” to the data The previous plot was on a stand- ardized scale Any known X score lets us predict the Y score

Line of best fit Remember this from high school? Y = mX + b We use: Y = α + b y (X)+ error Where b y is the slope of the line a is the Y intercept (where the line hits the y- axis) error is the unexplained variance

Slope Slope (b y ) is the angle of the line Change in Y / Change in X The more Y changes for every unit change of X, the steeper the slope

Y-intercept This is where the line crosses the Y axis When X = 0, the value of Y is the intercept

Line of best fit The resulting line comes as close as possible to the existing data points

Determining the Regression Line The following is the formula for determining the slope For the intercept

Y prime The line formula gives us the value of Y we would predict if given X We write this as Y’ We have to differentiate from the actual Y, because our estimate Y’ is not totally accurate

Why predict Y? We already have Y scores Y’ isn’t as good as Y But, the regression lets you predict new data Use SAT scores to predict college performance Use morbidity data to predict longevity of smokers Use past status of markets to predict their future status

Making predictions You can rewrite the line formula as: The slope is the middle term b y = r(Sy/Sx) Get the intercept by moving stuff around

Example Jessica wants to predict her final exam grade from the midterm She earned a 74 on the midterm The mean grade on the midterm was 70 and s = 4 In previous years, the mean on the final was a 75 and s = 4. The correlation between the two tests was r =.60 What score can Jessica predict? Y’ = (4/4)(74 – 70)

Example Jessica wants to predict her final exam grade from the midterm She earned a 74 on the midterm The mean grade on the midterm was 70 and s = 4 In previous years, the mean on the final was a 75 and s = 4. The correlation between the two tests was r =.60 What score can Jessica predict? Y’ = (4/4)(74 – 70)

Example Y’ = (4/4)(74 – 70) Y’ = 75 + (.6)(1)(4) Y’ = Y’ = 77.4 What if the correlation between the midterm and final was 1?

Example Y’ = Ybar + r(Sy / Sx) (X – Xbar) Y’ = 75 + (1)(4/4)(74 – 70) Y’ = = 79 The correlation is perfect here A difference in score values reflects a difference in scale The distance from the mean is identical

Example Y’ = Ybar + r(Sy / Sx) (X – Xbar) What if the correlation between the midterm and final was 0?

Example Y’ = Ybar + r(Sy / Sx) (X – Xbar) Y’ = 75 + (0)(4/4)(74 – 70) Y’ = 75 The best prediction is the mean when the variables are uncorrelated, or the correlation is unknown. Regression allows us to beat the mean

Variation If r = +-1, all variation is explained, if r = 0 all variation is unexplained The closer the points fall to the regression line, the greater the variation explained

Causation As with correlation, we can’t infer causation with regression We’re observing variables that correlate, not running experiments Beware of lurking variables. Another explanation may fit the data better

Midterm For the midterm you are going to have to integrate what you have learned. You are going to be given one or more research problems with small datasets. – Because all you know how to do right now is descriptive statistics and correlation/regression analyses they will be correlational designs. You are going to have to run all the descriptive statistics you know. (e.g. what the mean, standard deviation, range, mode, etc. for the two variables). Draw a scatterplot. You will then calculate the correlation, report whether or not it is significant. You will then do a regression, calculate the slope and intercept and draw the line of best fit through the scatterplot. – I may give you a value for x and ask you to predict a corresponding y value given your regression line.

Homework 2 a-d 10 15