Correlation Scatter Plots Correlation Coefficients Significance Test.

Slides:



Advertisements
Similar presentations
Simple Linear Regression and Correlation by Asst. Prof. Dr. Min Aung.
Advertisements

Lesson 10: Linear Regression and Correlation
Forecasting Using the Simple Linear Regression Model and Correlation
Correlation and Regression
Describing Relationships Using Correlation and Regression
Chapter 15 (Ch. 13 in 2nd Can.) Association Between Variables Measured at the Interval-Ratio Level: Bivariate Correlation and Regression.
Introduction to Regression Analysis
2.2 Correlation Correlation measures the direction and strength of the linear relationship between two quantitative variables.
Correlation and Simple Regression Introduction to Business Statistics, 5e Kvanli/Guynes/Pavur (c)2000 South-Western College Publishing.
Class 5: Thurs., Sep. 23 Example of using regression to make predictions and understand the likely errors in the predictions: salaries of teachers and.
The Simple Regression Model
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 13 Introduction to Linear Regression and Correlation Analysis.
SIMPLE LINEAR REGRESSION
Chapter Topics Types of Regression Models
Correlation and Regression. Correlation What type of relationship exists between the two variables and is the correlation significant? x y Cigarettes.
REGRESSION AND CORRELATION
Introduction to Probability and Statistics Linear Regression and Correlation.
Math 227 Elementary Statistics Math 227 Elementary Statistics Sullivan, 4 th ed.
SIMPLE LINEAR REGRESSION
© 2000 Prentice-Hall, Inc. Chap Forecasting Using the Simple Linear Regression Model and Correlation.
Correlation & Regression Math 137 Fresno State Burger.
1 Chapter 10 Correlation and Regression We deal with two variables, x and y. Main goal: Investigate how x and y are related, or correlated; how much they.
Linear Regression Analysis
Correlation & Regression
SIMPLE LINEAR REGRESSION
Introduction to Linear Regression and Correlation Analysis
Relationship of two variables
Regression Analysis (2)
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Inference on the Least-Squares Regression Model and Multiple Regression 14.
Correlation and Regression
Chapter 14 – Correlation and Simple Regression Math 22 Introductory Statistics.
Chapter 13 Statistics © 2008 Pearson Addison-Wesley. All rights reserved.
Jon Curwin and Roger Slater, QUANTITATIVE METHODS: A SHORT COURSE ISBN © Thomson Learning 2004 Jon Curwin and Roger Slater, QUANTITATIVE.
© 2008 Pearson Addison-Wesley. All rights reserved Chapter 1 Section 13-6 Regression and Correlation.
Introduction to Linear Regression
Production Planning and Control. A correlation is a relationship between two variables. The data can be represented by the ordered pairs (x, y) where.
Elementary Statistics Correlation and Regression.
Correlation Analysis. A measure of association between two or more numerical variables. For examples height & weight relationship price and demand relationship.
Correlation Correlation is used to measure strength of the relationship between two variables.
1 Chapter 12 Simple Linear Regression. 2 Chapter Outline  Simple Linear Regression Model  Least Squares Method  Coefficient of Determination  Model.
Section 4.1 Scatter Diagrams and Correlation. Definitions The Response Variable is the variable whose value can be explained by the value of the explanatory.
CHAPTER 3 INTRODUCTORY LINEAR REGRESSION. Introduction  Linear regression is a study on the linear relationship between two variables. This is done by.
Basic Concepts of Correlation. Definition A correlation exists between two variables when the values of one are somehow associated with the values of.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Chapter Bivariate Data (x,y) data pairs Plotted with Scatter plots x = explanatory variable; y = response Bivariate Normal Distribution – for.
Creating a Residual Plot and Investigating the Correlation Coefficient.
Chapter 9: Correlation and Regression Analysis. Correlation Correlation is a numerical way to measure the strength and direction of a linear association.
Chapter 8: Simple Linear Regression Yang Zhenlin.
SIMPLE LINEAR REGRESSION AND CORRELLATION
Chapter 7 Calculation of Pearson Coefficient of Correlation, r and testing its significance.
Linear Regression and Correlation Chapter GOALS 1. Understand and interpret the terms dependent and independent variable. 2. Calculate and interpret.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics Seventh Edition By Brase and Brase Prepared by: Lynn Smith.
CORRELATION ANALYSIS.
©The McGraw-Hill Companies, Inc. 2008McGraw-Hill/Irwin Linear Regression and Correlation Chapter 13.
Chapter 11 Linear Regression and Correlation. Explanatory and Response Variables are Numeric Relationship between the mean of the response variable and.
Chapter 13 Linear Regression and Correlation. Our Objectives  Draw a scatter diagram.  Understand and interpret the terms dependent and independent.
Scatter Plots and Correlation
Regression and Correlation
Correlation & Regression
Correlation and Simple Linear Regression
Correlation and Regression
Chapter 5 STATISTICS (PART 4).
Correlation and Simple Linear Regression
Correlation and Regression
CORRELATION ANALYSIS.
Correlation and Simple Linear Regression
Correlation and Regression
Simple Linear Regression and Correlation
Correlation and the Pearson r
SIMPLE LINEAR REGRESSION
Presentation transcript:

Correlation Scatter Plots Correlation Coefficients Significance Test

Introduction We are often asked to describe the relationship between two or more variables Is there a relationship between points in the leaving cert and QCA Is there a relationship between parents IQ and children's IQ

What are Scatter Plots Two dimensional plot showing the (X,Y) value for each observation Used to determine whether there is any pronounced relationship and if so whether the relationship may be treated as approximately linear. Y is usually the response (dependent) variable X is usually the explanatory (independent) variable The response variable is the variable whose variation we wish to explain An explanatory variable is a variable used to explain variation in the response variable

Positive Linear Relationship

Negative Linear Relationship

No Linear Relationship

No Relationship

Example 1 Two sets of exam results for 11 students Maths & Physics Are they related Does a good performance in Maths go with a good performance in Physics Let the Maths mark be X Let the Physics mark be Y

Table of Results X Y X- Total is 440 X-mean is 40 Y-Total is 330 Y-mean is 30

Maths V s Physics

What does the Graph tell us The means divide the graph into four quadrants Most of the data lies in the bottom left or top right quadrants Only two fall outside these quadrants This indicates a probable relationship between X and Y for a particular student

Correlation Coefficient From a diagram we get a general idea of the relationship. For precision we need a numerical measure. We need to measure the strength of the relationship The most common measure is the Pearson Product Moment Correlation Coefficient Usually known as the Correlation Coefficient We will usually be dealing with population samples The sample correlation coefficient is called r

Properties of r r can take values from -1 to +1 r = +1 or r = -1 represents a perfect linear correlation or a perfect relationship between the variables r = 0 indicates little or no linear relationship i.e. as X increases there is no definite tendency for the values of Y to increase or decrease in a straight line r close to +1 indicates a large positive correlation i.e. Y tends to increase as X increases. r close to -1 indicates a large negative correlation i.e. Y tends to decrease as X increases. Further r differs from 0, the stronger the relationship. The sign of r indicates the direction of the relationship

Examples of various r values r = +1r = -1r = r = 0.70r = 0

The formula for Calculating r

Example 2 Find the correlation coefficient r between Y and X SubjectABCDEFG X Y

Create a table SubjectXiXi YiYi XiYiXiYi X i squared Y i squared A B C D E F G Total

Calculating S xx

Calculating S yy

Calculating S xy

Calculating r

Significance Test H o : No Linear relationship exists r equal to 0 H A: There is a linear relationship r not equal to 0 Confidence Interval say 90%, 95%, 99% etc This means alpha = 0.1, 0.05, 0.01 etc Use table 10: Percentage points of the Correlation Coefficient Left hand column choose v = n-2 ( n = sample size) Find critical value If r > critical value then reject H o

Conclusion r = 0.82 let alpha = 0.05 v = n-2 giving v = 5 From tables the critical point is > We reject Ho and conclude: We are 95% confident that there is a linear relationship between X and Y

Example 3 Is there an obvious relationship between X and Y Y = X+2 This is a Perfect Relationship What will r be r will be equal to 1 X Y

Set up the data table SubjectYXXYX squaredY squared A B C D E F Total

Calculate S xx

Calculate S yy

Calculate S xy

Calculate r Perfect Positive Linear Relationship

Back to Example 1 In our original example with the student results we drew a scatter plot. From the diagram it looked as if there was a probable positive linear relationship To be sure we need to calculate r Using a significance level of alpha = 0.05 we will test the claim that there is no linear correlation between Maths results and Physics results

Create a data table StudentXYXYX squaredY squared A B C D E F G H I J K Total

Apply the formulae

Correlation Coefficient is

Conclusion From the tables the critical point is r = > We Reject the claim and conclude that There is a Positive Linear Relationship between results in Maths and results in Physics

Regression Least Squares Predicting Y using X

What is Regression? Regression Analysis is used for prediction It allows us to predict the value of one variable given the value of another variable It gives us an equation that uses one variable to help explain variation in another In this course we deal with Simple Linear Regression

Simple Linear Regression First step in determining a relationship was drawing a scatter plot If a possible relationship was shown we found the strength of the relationship by calculating the correlation coefficient r The next stage is to calculate an equation which best describes the relationship between the two variables This line is called the Regression Line

What is the ‘best fit’ line Example 1

‘Least Squares’ best fit line We can have several lines of the form We want ‘best’ least residuals

Least Squares estimates are the least squares estimates of Closely related to r

Example 2

Combining we get

Regression line is

Example 3 X Y We know Y=X+2

Verifying the equation is correct

Giving

Example 1

Regression line

Example 1 continued If a student received a grade of 53 in Maths, what would the expected grade be in Physics We use the Regression line in order to predict the Physics result

Graphing The Regression Line