Regression and Correlation. Bivariate Analysis Can we say if there is a relationship between the number of hours spent in Facebook and the number of friends.

Slides:



Advertisements
Similar presentations
Chapter 16: Correlation.
Advertisements

Lesson 10: Linear Regression and Correlation
Learning Objectives 1 Copyright © 2002 South-Western/Thomson Learning Data Analysis: Bivariate Correlation and Regression CHAPTER sixteen.
Education 793 Class Notes Joint Distributions and Correlation 1 October 2003.
Correlation Chapter 9.
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 2-1 Statistics for Business and Economics 7 th Edition Chapter 2 Describing Data:
Correlation and Regression Analysis
PSY 307 – Statistics for the Behavioral Sciences
Chapter Seven The Correlation Coefficient. Copyright © Houghton Mifflin Company. All rights reserved.Chapter More Statistical Notation Correlational.
Chap 3-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 3 Describing Data: Numerical Statistics for Business and Economics.
Summary of Quantitative Analysis Neuman and Robson Ch. 11
Chapter 21 Correlation. Correlation A measure of the strength of a linear relationship Although there are at least 6 methods for measuring correlation,
Linear Regression Analysis
Correlation and Linear Regression
McGraw-Hill/Irwin Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 13 Linear Regression and Correlation.
© 2011 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license.
Lecture 16 Correlation and Coefficient of Correlation
Linear Regression and Correlation
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Simple Linear Regression Analysis Chapter 13.
Correlation Scatter Plots Correlation Coefficients Significance Test.
Linear Regression and Correlation
Correlation and regression 1: Correlation Coefficient
Data Collection & Processing Hand Grip Strength P textbook.
Covariance and correlation
Chapter 14 – Correlation and Simple Regression Math 22 Introductory Statistics.
Chapter 13 Statistics © 2008 Pearson Addison-Wesley. All rights reserved.
Introduction to Quantitative Data Analysis (continued) Reading on Quantitative Data Analysis: Baxter and Babbie, 2004, Chapter 12.
Chapter 6 & 7 Linear Regression & Correlation
JDS Special Program: Pre-training1 Basic Statistics 01 Describing Data.
SESSION Last Update 17 th June 2011 Regression.
Hypothesis of Association: Correlation
Basic Statistics Correlation Var Relationships Associations.
Production Planning and Control. A correlation is a relationship between two variables. The data can be represented by the ordered pairs (x, y) where.
Correlation Correlation is used to measure strength of the relationship between two variables.
Examining Relationships in Quantitative Research
Linear correlation and linear regression + summary of tests
Introduction to Correlation Analysis. Objectives Correlation Types of Correlation Karl Pearson’s coefficient of correlation Correlation in case of bivariate.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Chapter 16 Data Analysis: Testing for Associations.
Describing Relationships Using Correlations. 2 More Statistical Notation Correlational analysis requires scores from two variables. X stands for the scores.
Correlation and Regression. Section 9.1  Correlation is a relationship between 2 variables.  Data is often represented by ordered pairs (x, y) and.
CORRELATION. Correlation key concepts: Types of correlation Methods of studying correlation a) Scatter diagram b) Karl pearson’s coefficient of correlation.
Examining Relationships in Quantitative Research
Chapter Thirteen Copyright © 2006 John Wiley & Sons, Inc. Bivariate Correlation and Regression.
Linear correlation and linear regression + summary of tests Dr. Omar Al Jadaan Assistant Professor – Computer Science & Mathematics.
April 1 st, Bellringer-April 1 st, 2015 Video Link Worksheet Link
Section 2.6 – Draw Scatter Plots and Best Fitting Lines A scatterplot is a graph of a set of data pairs (x, y). If y tends to increase as x increases,
1 Virtual COMSATS Inferential Statistics Lecture-25 Ossam Chohan Assistant Professor CIIT Abbottabad.
CHAPTER 5 CORRELATION & LINEAR REGRESSION. GOAL : Understand and interpret the terms dependent variable and independent variable. Draw a scatter diagram.
2.5 Using Linear Models A scatter plot is a graph that relates two sets of data by plotting the data as ordered pairs. You can use a scatter plot to determine.
Correlation They go together like salt and pepper… like oil and vinegar… like bread and butter… etc.
Chapter 16: Correlation. So far… We’ve focused on hypothesis testing Is the relationship we observe between x and y in our sample true generally (i.e.
You can calculate: Central tendency Variability You could graph the data.
Linear Correlation (12.5) In the regression analysis that we have considered so far, we assume that x is a controlled independent variable and Y is an.
Linear Regression and Correlation Chapter GOALS 1. Understand and interpret the terms dependent and independent variable. 2. Calculate and interpret.
CORRELATION ANALYSIS.
© 2011 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license.
©The McGraw-Hill Companies, Inc. 2008McGraw-Hill/Irwin Linear Regression and Correlation Chapter 13.
Copyright © 2015 McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written consent of McGraw-Hill Education.
Correlation & Linear Regression Using a TI-Nspire.
The simple linear regression model and parameter estimation
Simple Linear Correlation
Warm Up Scatter Plot Activity.
Correlation & Linear Regression
Correlation and Regression
CORRELATION ANALYSIS.
Correlation and Regression
7.1 Draw Scatter Plots & Best-Fitting Lines
Presentation transcript:

Regression and Correlation

Bivariate Analysis Can we say if there is a relationship between the number of hours spent in Facebook and the number of friends you have? In this question, we have two seemingly unrelated variables –Number of hours spent in facebook –Number of facebook friends. 2

Bivariate Analysis UnivariateBivariate Single variableTwo variables Does not deal with causes or relationship Deals with causes or relationships Main purpose is to describeMajor purpose is to explain Central tendency, dispersion, frequency distributions, graphs Analysis of two variables simultaneously, correlations, comparisons, relationships, causes, explanations, independent and dependent variables 3

The question is: –How can we find a relationship between the two? (assuming that there exists such a relationship) Bivariate Analysis 4

Relative Variation/Dispersion Unitless; used to compare one data set to another Coefficient of Variation (CV) –Ratio of SD to mean, expressed in percentage –The higher the value, the more your data ‘varies’ is ‘dispersed’ Z Score –Measures how many SDs an observation is above/below the mean Refer to your handout for the formulae. 5

Covariance

prepared by: Rose Ann V. Sale7

Covariance Measure of relationship between two variables Computed as below: 8

Example1 Covariance:

Example2 Covariance: GPATV in hours per week

cov(X,Y) > 0 X and Y are positively correlated cov(X,Y) < 0 X and Y are inversely correlated cov(X,Y) = 0 X and Y are independent Interpreting Covariance 11

Correlation

prepared by: Rose Ann V. Sale13

Correlation Measures the relative strength of the linear relationship between two variables Unitless Ranges between –1 and 1 The closer to –1, the stronger the negative linear relationship The closer to 1, the stronger the positive linear relationship The closer to 0, the weaker the linear relationship 14

Correlation Pearson r –Used for quantitative data (remember interval & ratio levels?) –Computed as below: Spearman  –Pearson r between ordinal variables, used for qualitative data, and applies to quantitative as well! –Computed as below (if your data has no ties!): In case of ties, Pearson r formula is applicable to ranked data, (tied values’ rank = Ave of their positions in ascending order) 15

Scatter Plots of Data with Various Correlation Coefficients Y X Y X Y X r = -1 r = -.6 r = 0 16

Y X Y X r = +.3 r = +1 Y X r = 0 Scatter Plots of Data with Various Correlation Coefficients 17

Y X Y X Y Y X X Linear relationshipsCurvilinear relationships Linear Correlation 18

Y X Y X Y Y X X Strong relationshipsWeak relationships Linear Correlation 19

Linear Correlation Y X Y X No relationship 20

Example 1 r xy =  = 0.786

Example2 r xy = GPATV in hours per week  =

Linear Regression

In correlation, the two variables are treated as equals. In regression, one variable is considered independent (=predictor) variable (X) and the other the dependent (=outcome) variable Y. 24

Linear Regression Independent variable  cause Dependent variable  effect Linear regression is a method of predicting the value of dependent variable Y from the value of the independent variable X 25

What is “ Linear ” ? Remember this: y = mx + b B m 26

Line of Regression Prediction line or line of “best fit” –This is where you find the expected value of one variable given the other. Data points tend to cluster about this line (-1 < r < 1) General form given below: Can you give the y = mx + b equivalent of the above? 27

Standard Error of Estimate Absolute difference from your line of regression (“predicted”) to your actual measurements Calculated as follows: Interpreted as “ we can expect 68% of the time the true value of Y will lie in the band  y units from the line of regression. ” 28

Example1 Given the following ordered pair –(2,3) (5,5) (9,13) (12,7) (13,14) –Draw the scatter graph –Find the formula for the regression line –Draw an approximation of the regression line –Compute for expected value (Ŷ) given X=9 29

Example1 Answer 30 Y= x (Ŷ | x=9) =

Example2 31 -Compute for the regression line - Predict the number of hours spent in Facebook of a person who has 400 fb friends

Example2 Answer y = x – 0.13 (y | x=400) =