Introduction to Regression

Slides:



Advertisements
Similar presentations
Residuals.
Advertisements

Statistics for the Social Sciences
Regression and Correlation
How To Make Graphs in Microsoft Excel Outline Making Bar Graphs Making Scatter Plots – 1 series Making Scatter Plots – Multiple Series.
Scatter Diagrams and Correlation
Regression, Residuals, and Coefficient of Determination Section 3.2.
C HAPTER 3: E XAMINING R ELATIONSHIPS. S ECTION 3.3: L EAST -S QUARES R EGRESSION Correlation measures the strength and direction of the linear relationship.
Introduction As we saw in the previous lesson, we can use a process called regression to fit the best equation with the data in an attempt to draw conclusions.
P.33 #14-19, p. 34 #32-34, Lesson Scatter Plots and Least-Squares Lines.
Objectives (BPS chapter 5)
Linear Regression.
Residuals and Residual Plots Most likely a linear regression will not fit the data perfectly. The residual (e) for each data point is the ________________________.
VCE Further Maths Least Square Regression using the calculator.
How do scientists show the results of investigations?
Biostatistics Unit 9 – Regression and Correlation.
1 Chapter 3: Examining Relationships 3.1Scatterplots 3.2Correlation 3.3Least-Squares Regression.
Check it out! 4.3.3: Distinguishing Between Correlation and Causation
Prior Knowledge Linear and non linear relationships x and y coordinates Linear graphs are straight line graphs Non-linear graphs do not have a straight.
Residuals Target Goal: I can construct and interpret residual plots to assess if a linear model is appropriate. 3.2c Hw: pg 192: 48, 50, 54, 56, 58 -
Ch 3 – Examining Relationships YMS – 3.1
Notes Bivariate Data Chapters Bivariate Data Explores relationships between two quantitative variables.
AP STATISTICS LESSON 3 – 3 LEAST – SQUARES REGRESSION.
4.1 Scatter Diagrams and Correlation. 2 Variables ● In many studies, we measure more than one variable for each individual ● Some examples are  Rainfall.
Correlation & Regression – Non Linear Emphasis Section 3.3.
3.3 Least-Squares Regression.  Calculate the least squares regression line  Predict data using your LSRL  Determine and interpret the coefficient of.
Max temp v min temp. It can be seen from the scatterplot that there is a correlation between max temp and min temp. Generally, as min temp increases,
Chapter Line of best fit. Objectives  Determine a line of best fit for a set of linear data.  Determine and interpret the correlation coefficient.
Notes Bivariate Data Chapters Bivariate Data Explores relationships between two quantitative variables.
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 4 Section 2 – Slide 1 of 20 Chapter 4 Section 2 Least-Squares Regression.
Regression Regression relationship = trend + scatter
Relationships If we are doing a study which involves more than one variable, how can we tell if there is a relationship between two (or more) of the.
Scatter Diagrams and Correlation Variables ● In many studies, we measure more than one variable for each individual ● Some examples are  Rainfall.
Plotting Points and Modeling Real-World Data Using Scatter Plots.
Scatter Plots And Looking at scatter plots Or Bivariate Data.
11/23/2015Slide 1 Using a combination of tables and plots from SPSS plus spreadsheets from Excel, we will show the linkage between correlation and linear.
3.2 - Least- Squares Regression. Where else have we seen “residuals?” Sx = data point - mean (observed - predicted) z-scores = observed - expected * note.
Get out the Notes from Monday Feb. 4 th, Example 2: Consider the table below displaying the percentage of recorded music sales coming from music.
Residuals. Residual or Deviation A residual or deviation = y data – y line = y actual – y theoetical = y experiment - y model A residual plot is if for.
HAWKES LEARNING SYSTEMS Students Matter. Success Counts. Copyright © 2013 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Section 12.3.
Examining Bivariate Data Unit 3 – Statistics. Some Vocabulary Response aka Dependent Variable –Measures an outcome of a study Explanatory aka Independent.
Jeopardy Final Jeopardy Parent Functions Equations Stats Other $100
Transformations.  Although linear regression might produce a ‘good’ fit (high r value) to a set of data, the data set may still be non-linear. To remove.
Creating a Residual Plot and Investigating the Correlation Coefficient.
Correlation The apparent relation between two variables.
1.5 Scatter Plots and Least-Squares Lines Objectives : Create a scatter plot and draw an informal inference about any correlation between the inference.
Using Technology & Lines of Best Fit  I can construct graphs and lines of best fit using a graphing calculator.
Correlation/Regression - part 2 Consider Example 2.12 in section 2.3. Look at the scatterplot… Example 2.13 shows that the prediction line is given by.
AP Statistics HW: p. 165 #42, 44, 45 Obj: to understand the meaning of r 2 and to use residual plots Do Now: On your calculator select: 2 ND ; 0; DIAGNOSTIC.
A P STATISTICS LESSON 3 – 3 (DAY 3) A P STATISTICS LESSON 3 – 3 (DAY 3) RISIDUALS.
Residuals Recall that the vertical distances from the points to the least-squares regression line are as small as possible.  Because those vertical distances.
^ y = a + bx Stats Chapter 5 - Least Squares Regression
Warm-up O Turn in HW – Ch 8 Worksheet O Complete the warm-up that you picked up by the door. (you have 10 minutes)
Residual Plots Unit #8 - Statistics.
Engineers often: Regress data to a model  Used for assessing theory  Used for predicting  Empirical or theoretical model Use the regression of others.
Chapter 8 Linear Regression. Fat Versus Protein: An Example 30 items on the Burger King menu:
Simple Linear Regression The Coefficients of Correlation and Determination Two Quantitative Variables x variable – independent variable or explanatory.
Residuals, Influential Points, and Outliers
REGRESSION MODELS OF BEST FIT Assess the fit of a function model for bivariate (2 variables) data by plotting and analyzing residuals.
Introduction to regression 3C. Least-squares regression.
PreCalculus 1-7 Linear Models. Our goal is to create a scatter plot to look for a mathematical correlation to this data.
Chapter 8 Part I Answers The explanatory variable (x) is initial drop, measured in feet, and the response variable (y) is duration, measured in seconds.
distance prediction observed y value predicted value zero
Regression and Correlation
Data Transformation Data Analysis.
Suppose the maximum number of hours of study among students in your sample is 6. If you used the equation to predict the test score of a student who studied.
Creating Scatterplots
Creating Scatterplots
Residuals and Residual Plots
Lesson 2.2 Linear Regression.
Residuals and Residual Plots
Presentation transcript:

Introduction to Regression 3E Residuals

Definition A residual is the vertical difference between each data point and the regression line.

To calculate the residuals for each data point: Calculate the predicted value of y from the regression equation. Calculate the difference between this predicted value and the original value.

Examples Read Worked Example 8 on p.130. Exercise 3E, Q.1

Note Calculate the residuals by (y-ypred), i.e. the actual values minus the predicted values. The sum of all the residuals always add to zero (or very close to zero after rounding), when least-squares regression is used.

Residual Analysis Why do we calculate residuals? Residuals give us more information about the type of regression. If we plot the residuals against the original x-values we get a pattern.

Types of residual plot Points of residual randomly scattered above and below x-axis. Original data probably linear.

Points of residual show curved pattern with a series of negative, then positive and back to negative residuals. Original data probably non-linear. Points of residual show curved pattern with a series of positive, then negative and back to positive residuals. Original data probably non-linear.

In other words, if you see any sort of pattern in the residual plot, there probably isn’t a linear relationship in the original data set.

Example. Ex 3E, Q.2, 4 You do Ex 3E, Q.5, 6, 7, 8

CAS calculator Once you have done the least-squares regression analysis on the Lists & Spreadsheet page: Move the cursor to the shaded cell in Column E and press: Ctrl MENU 4: Variables 3: Link To And select the list stat?.resid. The stat number will vary depending on your calculator and previously stored data. You can then plot the stat.resid against x-value by clicking on the y-axis label and selecting the residual list stat.resid