Creating a Residual Plot and Investigating the Correlation Coefficient.

Slides:



Advertisements
Similar presentations
Learning Objectives Copyright © 2002 South-Western/Thomson Learning Data Analysis: Bivariate Correlation and Regression CHAPTER sixteen.
Advertisements

Learning Objectives Copyright © 2004 John Wiley & Sons, Inc. Bivariate Correlation and Regression CHAPTER Thirteen.
Probabilistic & Statistical Techniques Eng. Tamer Eshtawi First Semester Eng. Tamer Eshtawi First Semester
Learning Objectives 1 Copyright © 2002 South-Western/Thomson Learning Data Analysis: Bivariate Correlation and Regression CHAPTER sixteen.
Correlation and Regression
Learn to locate and graph points on the coordinate plane.
Correlation and Regression. Relationships between variables Example: Suppose that you notice that the more you study for an exam, the better your score.
Regression, Residuals, and Coefficient of Determination Section 3.2.
Correlation 10/30. Relationships Between Continuous Variables Some studies measure multiple variables – Any paired-sample experiment – Training & testing.
Correlation vs. Causation What is the difference?.
7.1 Draw Scatter Plots & Best-Fitting Lines 7.1 HW Quiz: Friday 7.1, 7.2 Quiz: TBA 7.1, 7.2, 7.7 Test: Sept. 22 Make-up work needs to be made up by Monday.
Descriptive Methods in Regression and Correlation
Linear Regression.
Relationship of two variables
Residuals and Residual Plots Most likely a linear regression will not fit the data perfectly. The residual (e) for each data point is the ________________________.
Scatter Plots and Linear Correlation. How do you determine if something causes something else to happen? We want to see if the dependent variable (response.
1.9 Graphing Data on the Coordinate Plane Two number lines that intersect at right angles form a coordinate plane. X- axis – the horizontal axis Y- axis.
Learning Objective Chapter 14 Correlation and Regression Analysis CHAPTER fourteen Correlation and Regression Analysis Copyright © 2000 by John Wiley &
Biostatistics Unit 9 – Regression and Correlation.
1 Chapter 3: Examining Relationships 3.1Scatterplots 3.2Correlation 3.3Least-Squares Regression.
Chapter 6: Exploring Data: Relationships Lesson Plan Displaying Relationships: Scatterplots Making Predictions: Regression Line Correlation Least-Squares.
Prior Knowledge Linear and non linear relationships x and y coordinates Linear graphs are straight line graphs Non-linear graphs do not have a straight.
Chapter 6 & 7 Linear Regression & Correlation
Notes Bivariate Data Chapters Bivariate Data Explores relationships between two quantitative variables.
 Graph of a set of data points  Used to evaluate the correlation between two variables.
Notes Bivariate Data Chapters Bivariate Data Explores relationships between two quantitative variables.
Scatterplot and trendline. Scatterplot Scatterplot explores the relationship between two quantitative variables. Example:
Scatterplots are used to investigate and describe the relationship between two numerical variables When constructing a scatterplot it is conventional to.
Regression Regression relationship = trend + scatter
STATISTICS 12.0 Correlation and Linear Regression “Correlation and Linear Regression -”Causal Forecasting Method.
Relationships If we are doing a study which involves more than one variable, how can we tell if there is a relationship between two (or more) of the.
Chapter Thirteen Copyright © 2006 John Wiley & Sons, Inc. Bivariate Correlation and Regression.
Correlation and Regression: The Need to Knows Correlation is a statistical technique: tells you if scores on variable X are related to scores on variable.
Section 2.6 – Draw Scatter Plots and Best Fitting Lines A scatterplot is a graph of a set of data pairs (x, y). If y tends to increase as x increases,
Correlation The apparent relation between two variables.
CHAPTER 5 CORRELATION & LINEAR REGRESSION. GOAL : Understand and interpret the terms dependent variable and independent variable. Draw a scatter diagram.
Scatter Diagram of Bivariate Measurement Data. Bivariate Measurement Data Example of Bivariate Measurement:
2.5 Using Linear Models A scatter plot is a graph that relates two sets of data by plotting the data as ordered pairs. You can use a scatter plot to determine.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 3 Association: Contingency, Correlation, and Regression Section 3.3 Predicting the Outcome.
Regression Analysis. 1. To comprehend the nature of correlation analysis. 2. To understand bivariate regression analysis. 3. To become aware of the coefficient.
What Do You See?. A scatterplot is a graphic tool used to display the relationship between two quantitative variables. How to Read a Scatterplot A scatterplot.
.  Relationship between two sets of data  The word Correlation is made of Co- (meaning "together"), and Relation  Correlation is Positive when the.
STATISTICS 12.0 Correlation and Linear Regression “Correlation and Linear Regression -”Causal Forecasting Method.
Discovering Mathematics Week 9 – Unit 6 Graphs MU123 Dr. Hassan Sharafuddin.
Simple Linear Regression The Coefficients of Correlation and Determination Two Quantitative Variables x variable – independent variable or explanatory.
CORRELATION ANALYSIS.
Chapter 14 Introduction to Regression Analysis. Objectives Regression Analysis Uses of Regression Analysis Method of Least Squares Difference between.
REGRESSION MODELS OF BEST FIT Assess the fit of a function model for bivariate (2 variables) data by plotting and analyzing residuals.
GOAL: I CAN USE TECHNOLOGY TO COMPUTE AND INTERPRET THE CORRELATION COEFFICIENT OF A LINEAR FIT. (S-ID.8) Data Analysis Correlation Coefficient.
Correlation Definition: Correlation - a mutual relationship or connection between two or more things. (google.com) When two set of data appear to be connected.
Correlation & Linear Regression Using a TI-Nspire.
Correlation.
Pearson’s Correlation Coefficient
distance prediction observed y value predicted value zero
Correlation 10/27.
Axes Quadrants Ordered Pairs Correlations
Correlation 10/27.
Scatterplots A way of displaying numeric data
CHAPTER fourteen Correlation and Regression Analysis
CHAPTER 10 Correlation and Regression (Objectives)
Suppose the maximum number of hours of study among students in your sample is 6. If you used the equation to predict the test score of a student who studied.
2. Find the equation of line of regression
Correlation vs. Causation
CORRELATION ANALYSIS.
Residuals and Residual Plots
1.7 Nonlinear Regression.
Section 1.4 Curve Fitting with Linear Models
Correlation and Regression
11A Correlation, 11B Measuring Correlation
Presentation transcript:

Creating a Residual Plot and Investigating the Correlation Coefficient

X Y Plot the data: The regression Line is Y= x + 1 A residual is the vertical distance from the regression line to a point off of the line. +1 residual -1 residual

Residuals X Y (residual) These are ordered pairs. I will show you another way to find these values later in the notes.

Residual Points X Y (residual) (1, -1) (2, +1) (3, 0) (4, 0) (5, +1) (5, -1) These can be plotted in Quadrants I and IV of a coordinate plane. This is called a Residual Plot

A residual plot can help us determine if the data is linear. Random points means LINEAR “U” Shaped means NOT LINEAR

Vocabulary O Residual - A residual is the difference between the observed y-value (from scatter plot) and the predicted y-value (from regression equation line). It is the vertical distance from the actual point to the point on the regression line. You can think of a residual as how far the data "fall" from the regression line (sometimes referred to as "observed error").

O Residual Plot - A residual plot is a scatter plot that shows the residuals on the vertical axis and the independent variable on the horizontal axis. The plot will help you to decide on whether a linear model is appropriate for your data.

How to create a residual plot without a calculator. X Y - Y = residual Original Data Ordered Pairs 2- = = = = = = Residual Ordered Pairs (1, -1) Predicted y values by using regression equation y= x + 1 (2, 1) (3, 0) (4, 0) (5, -1) (5, 1) Plot these Points on the residual plot

Plot the Residual Ordered Pairs (1, -1) (2, 1) (3, 0) (4, 0) (5, -1) (5, 1) X-coordinate (independent Variable) Residual is the Y-coordinate

Correlation Coefficient - The quantity r, called the linear correlation coefficient, measures the strength and the direction of a linear relationship between two variables. The linear correlation coefficient is sometimes Referred to as the Pearson product moment correlation coefficient in honor of its developer Karl Pearson.

We use a calculator to find the correlation coefficient r. The correlation coefficient r can indicate: 1)A positive correlation 2)A negative correlation 3)No correlation 4)A perfect correlation

Positive correlation - If x and y have a strong positive linear correlation, r is close to +1. An r value of exactly +1 indicates a perfect positive fit. Positive values of r indicate a relationship Between x and y variables such that as values for x increases, values for y also increase

Negative Correlation - If x and y have a strong negative linear correlation, r is close to -1. An r value of exactly -1 indicates a perfect negative fit. Negative values indicate a relationship between x and y such that as values for x increase, values for y decrease.

No correlation - If there is no linear correlation or a weak linear correlation, r is close to 0. A value near zero means that there is a random, nonlinear relationship between the two variables

A perfect correlation - of ± 1 occurs only when the data points all lie exactly on a straight line. If r = +1, the slope of this line is positive. If r = -1, the slope of this line is negative.

Determine the type of correlation each has based on the correlation coefficient r: 1. r = 0.9Because r is close to 1 it has a positive linear correlation 2. r = -0.85Because r is close to -1 it has a negative linear correlation 3. r = 0.01Because r is so close to 0, it has no linear correlation

4. Which one has the strongest linear correlation? A) 0.27B) -0.92C) 0.85D) Which r value represents a perfect linear correlation? A) 0.98B) 0C) -0.89D) -1

Closure Explain how to tell from the residual plot that the data has a linear relationship.