REGRESSION Stats 1 with Liz. AIMS By the end of the lesson, you should be able to… o Understand the method of least squares to find a regression line.

Slides:



Advertisements
Similar presentations
Least-Squares Regression Section 3.3. Correlation measures the strength and direction of a linear relationship between two variables. How do we summarize.
Advertisements

Linear regression involves finding the equation of the line of best fit on a scatter graph. The equation obtained can then be used to make an estimate.
Section 10-3 Regression.
FITTING TREND LINES AND FORECASTING.  FITTING BY EYE: inadequate technique. We will look at the following two:  3-MEDIAN METHOD AND  LEAST-SQUARES.
Kin 304 Regression Linear Regression Least Sum of Squares
Least Squares Regression
AP Statistics.  Least Squares regression is a way of finding a line that summarizes the relationship between two variables.
Chapter 3 Bivariate Data
Scatter Diagrams and Linear Correlation
Linear Regression Larson/Farber 4th ed. 1 Section 9.2.
Correlation and Linear Regression
McGraw-Hill/Irwin Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 13 Linear Regression and Correlation.
2.4 Using Linear Models. The Trick: Converting Word Problems into Equations Warm Up: –How many ways can a $50 bill be changed into $5 and $20 bills. Work.
Linear Regression.
Warm-up A. P Stats Shape-Changing Transformations/ Stats – 3.4 Diagonstics Copy the two columns and match them.
Regression lesson 4 Starter Dangers of Predicting (extrapolation) Interpretation questions Exam questions.
5-7 Scatter Plots. _______________ plots are graphs that relate two different sets of data by displaying them as ordered pairs. Usually scatter plots.
VCE Further Maths Least Square Regression using the calculator.
Chapter 13 Statistics © 2008 Pearson Addison-Wesley. All rights reserved.
Measure your handspan and foot length in cm to nearest mm We will record them as Bivariate data below: Now we need to plot them in what kind of graph?
Correlation and regression lesson 1 Introduction.
Warm-up with 3.3 Notes on Correlation
Biostatistics Unit 9 – Regression and Correlation.
Researchers, such as anthropologists, are often interested in how two measurements are related. The statistical study of the relationship between variables.
2-5 Using Linear Models Make predictions by writing linear equations that model real-world data.
Chapter 6 & 7 Linear Regression & Correlation
© 2008 Pearson Addison-Wesley. All rights reserved Chapter 1 Section 13-6 Regression and Correlation.
1.6 Linear Regression & the Correlation Coefficient.
Section 5.2: Linear Regression: Fitting a Line to Bivariate Data.
Warm-up with 3.3 Notes on Correlation Universities use SAT scores in the admissions process because they believe these scores provide some insight into.
Correlation 1 Scatter diagrams Measurement of correlation Using a calculator Using the formula Practice qs.
Warm Up Write the equation of the line passing through each pair of passing points in slope-intercept form. 1. (5, –1), (0, –3) 2. (8, 5), (–8, 7) Use.
Regression Regression relationship = trend + scatter
Linear Regression. Determine if there is a linear correlation between horsepower and fuel consumption for these five vehicles by creating a scatter plot.
Section 9.2 Linear Regression. Section 9.2 Objectives Find the equation of a regression line Predict y-values using a regression equation.
Relationships If we are doing a study which involves more than one variable, how can we tell if there is a relationship between two (or more) of the.
Section 12.3 Regression Analysis HAWKES LEARNING SYSTEMS math courseware specialists Copyright © 2008 by Hawkes Learning Systems/Quant Systems, Inc. All.
Examining Bivariate Data Unit 3 – Statistics. Some Vocabulary Response aka Dependent Variable –Measures an outcome of a study Explanatory aka Independent.
Revision: Pivot Table 1. Histogram 2. Trends 3. Linear 4. Exponential
Chapter 4 – Correlation and Regression before: examined relationship among 1 variable (test grades, metabolism, trip time to work, etc.) now: will examine.
5.7 Scatter Plots and Line of Best Fit I can write an equation of a line of best fit and use a line of best fit to make predictions.
Correlation The apparent relation between two variables.
CHAPTER 5 CORRELATION & LINEAR REGRESSION. GOAL : Understand and interpret the terms dependent variable and independent variable. Draw a scatter diagram.
Linear Regression 1 Section 9.2. Section 9.2 Objectives 2 Find the equation of a regression line Predict y-values using a regression equation.
^ y = a + bx Stats Chapter 5 - Least Squares Regression
Ch 5 Relationships Between Quantitative Variables (pg 150) –Will use 3 tools to describe, picture, and quantify 1) scatterplot 2) correlation 3) regression.
Linear Prediction Correlation can be used to make predictions – Values on X can be used to predict values on Y – Stronger relationships between X and Y.
STATISTICS 12.0 Correlation and Linear Regression “Correlation and Linear Regression -”Causal Forecasting Method.
Method 3: Least squares regression. Another method for finding the equation of a straight line which is fitted to data is known as the method of least-squares.
Response Variable: measures the outcome of a study (aka Dependent Variable) Explanatory Variable: helps explain or influences the change in the response.
Correlation and Regression Ch 4. Why Regression and Correlation We need to be able to analyze the relationship between two variables (up to now we have.
The Line of Best Fit CHAPTER 2 LESSON 3  Observed Values- Data collected from sources such as experiments or surveys  Predicted (Expected) Values-
1 Objective Given two linearly correlated variables (x and y), find the linear function (equation) that best describes the trend. Section 10.3 Regression.
10.2 Regression By the end of class you will be able to graph a regression line and use it to make predictions.
Linear Regression Essentials Line Basics y = mx + b vs. Definitions
Correlation S1 Maths with Liz.
Statistics 101 Chapter 3 Section 3.
Ch12.1 Simple Linear Regression
LSRL Least Squares Regression Line
Module 15-2 Objectives Determine a line of best fit for a set of linear data. Determine and interpret the correlation coefficient.
Least-Squares Regression
^ y = a + bx Stats Chapter 5 - Least Squares Regression
Section 2: Linear Regression.
Lesson 2.2 Linear Regression.
11C Line of Best Fit By Eye, 11D Linear Regression
Objectives Vocabulary
Section 3.2: Least Squares Regressions
Tuesday, September 29 Check HW Probs 3.13, 14, 20 (7-11-doubles)
Lesson 2.2 Linear Regression.
Presentation transcript:

REGRESSION Stats 1 with Liz

AIMS By the end of the lesson, you should be able to… o Understand the method of least squares to find a regression line o Use the regression line to estimate values & understand the limitations of that prediction o Use linear transformation to convert the variable in a regression line o Calculate residuals to identify outliers

Linear regression involves finding the equation of the line of best fit on a scatter graph. REGRESSION The best fitting line is the one that minimizes the sum of the squared deviations,, where d i is the vertical distance between the i th point and the line. d1d1 d2d2 d3d3 d4d4 d5d5 d6d6 The distances d i are sometimes referred to as residuals. Note: The best fitting line should pass through the mean point,. The equation can then be used to make an estimate of one variable given the value of the other variable.

The line that minimises the sum of squared deviations is formally known as the least squares regression line of y on x. The equation of the least squares regression line of y on x is: REGRESSION and : Recall : and y = a + bx where: y-intercept gradient

A BIT MORE ABOUT THE PMCC IN RELATION TO THE LINE OF BEST FIT… The fit of the line is related to the value of r (the PMCC). If r = 1or r = -1, it is considered a “perfect fit”. Being a perfect fit is also known as collinear. Example: If r = , the fit is “excellent”. If r = 0.854, the fit is “good”.

EXAMPLE 1 You can also find these values using your STAT feature on your calculator!

EXAMPLE 1

Look up the formulae on pg. 13. Enter data in calculator & list summarised data needed: Start by finding b: So we need:

EXAMPLE 1 Look up the formulae on pg. 13. Enter data in calculator & list summarised data needed: Start by finding b: and…

EXAMPLE 1 Look up the formulae on pg. 13. Enter data in calculator & list summarised data needed: Start by finding b: Next, find a:

EXAMPLE 1 Look up the formulae on pg. 13. Enter data in calculator & list summarised data needed: Bring it all together in the form: y = a + bx

EXAMPLE 1 When foot size, x = 7, When foot size, x = 14,

NOTE: A regression equation can only confidently be used to predict values of y that correspond to x values that lie within the range of the data values available. It is unwise to extrapolate beyond the given data because it is not guaranteed that the trend will continue. EXAMPLE 1

CALCULATING THE LEAST SQUARES REGRESSION LINE IN YOUR GDC… In your GDC, press MENU STAT enter data in lists 1 and 2 CALC (F2) REG (F3) X (F1) a + bx (F2) copy down your a and b values in the form y = a + bx

EXAM QUESTION JANUARY 2013, Q1 a represents the y-intercept (or the value when time, x = 0), so we can expect a to be 15cm.

EXAM QUESTION JANUARY 2013, Q1

b = which indicates that the candle length decreases by 0.64cm per hour. As time increases, the length of the candle decreases.

EXAM QUESTION JANUARY 2013, Q1 When time, x = 50, It is impossible to have length of -1 cm, so the claim is not justified.

SCALING & RESIDUALS EXAMPLE 2

Recalculate the least squares regression line without this piece of data.

INFLUENTIAL DATA POINTS A data point may be both influential and an outlier. An influential data point has an x-value much greater (or less) than the other x-values. However, an influential data point may not be an outlier.

INDEPENDENT STUDY (CHOOSE 3 OR MORE!) o mymaths – Least Squares Regression Line o mymaths – Scaling and Residuals o Stats Textbook – Pg. 171, Ex. 6D o Stats Textbook – Pg. 175, Ex. 6E o Stats Textbook – Pg. 178, Ex. 6F o Stats Textbook – Pg. 181, Revision Exercise 6 o SET HW – CORRELATION & REGRESSION