DEPARTMENT OF STATISTICS  What are they?  When should I use them?  How do Excel and GCs handle them?  Why should I be careful with the Nulake text?

Slides:



Advertisements
Similar presentations
5.4 Correlation and Best-Fitting Lines
Advertisements

Chapter 4 The Relation between Two Variables
Chapter 10 Regression. Defining Regression Simple linear regression features one independent variable and one dependent variable, as in correlation the.
DEPARTMENT OF STATISTICS Stats Questions We Are Often Asked.
FPP 10 kind of Regression 1. Plan of attack Introduce regression model Correctly interpret intercept and slope Prediction Pit falls to avoid 2.
CORRELATON & REGRESSION
2.2 Correlation Correlation measures the direction and strength of the linear relationship between two quantitative variables.
Regression, Residuals, and Coefficient of Determination Section 3.2.
Regression and Correlation
7.1 Draw Scatter Plots & Best-Fitting Lines 7.1 HW Quiz: Friday 7.1, 7.2 Quiz: TBA 7.1, 7.2, 7.7 Test: Sept. 22 Make-up work needs to be made up by Monday.
Forecasting using trend analysis
Linear & Non-Linear Regression
Correlation and regression 1: Correlation Coefficient
Linear Trend Lines Y t = b 0 + b 1 X t Where Y t is the dependent variable being forecasted X t is the independent variable being used to explain Y. In.
12b. Regression Analysis, Part 2 CSCI N207 Data Analysis Using Spreadsheet Lingma Acheson Department of Computer and Information Science,
The Scientific Method Interpreting Data — Correlation and Regression Analysis.
© The McGraw-Hill Companies, Inc., 2000 Business and Finance College Principles of Statistics Lecture 10 aaed EL Rabai week
Biostatistics Unit 9 – Regression and Correlation.
Jon Curwin and Roger Slater, QUANTITATIVE METHODS: A SHORT COURSE ISBN © Thomson Learning 2004 Jon Curwin and Roger Slater, QUANTITATIVE.
 Graph of a set of data points  Used to evaluate the correlation between two variables.
2 pt 3 pt 4 pt 5pt 1 pt 2 pt 3 pt 4 pt 5 pt 1 pt 2pt 3 pt 4pt 5 pt 1pt 2pt 3 pt 4 pt 5 pt 1 pt 2 pt 3 pt 4pt 5 pt 1pt Slope-Intercept Form Point-Slope.
Regression. Population Covariance and Correlation.
STATISTICS 12.0 Correlation and Linear Regression “Correlation and Linear Regression -”Causal Forecasting Method.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Scatter Diagrams Objective: Draw and interpret scatter diagrams. Distinguish between linear and nonlinear relations. Use a graphing utility to find the.
Exam Review Day 6 Chapters 2 and 3 Statistics of One Variable and Statistics of Two Variable.
Creating a Residual Plot and Investigating the Correlation Coefficient.
3.3 Correlation: The Strength of a Linear Trend Estimating the Correlation Measure strength of a linear trend using: r (between -1 to 1) Positive, Negative.
Section 2.6 – Draw Scatter Plots and Best Fitting Lines A scatterplot is a graph of a set of data pairs (x, y). If y tends to increase as x increases,
Requirements of a good GRAPH. GRAPH  Title (usually “dependent” vs. “independent”)  Go Big (cover at least ½ the page in both directions) This increases.
What you will learn How to draw and analyze a scatter plot
* SCATTER PLOT – GRAPH WITH MANY ORDERS PAIRS * LINE OF BEST FIT – LINE DRAWN THROUGH DATA THAT BEST REPRESENTS IT.
Chapter 4: Correlation and Regression 4.1 – Scatter Diagrams and Linear Correlation 4.2 – Linear Regression and the Coefficient of Determinant.
Scatter Diagrams scatter plot scatter diagram A scatter plot is a graph that may be used to represent the relationship between two variables. Also referred.
2.5 Using Linear Models A scatter plot is a graph that relates two sets of data by plotting the data as ordered pairs. You can use a scatter plot to determine.
Regression Analysis. 1. To comprehend the nature of correlation analysis. 2. To understand bivariate regression analysis. 3. To become aware of the coefficient.
Scatterplots and Linear Regressions Unit 8. Warm – up!! As you walk in, please pick up your calculator and begin working on your warm – up! 1. Look at.
Fall Looking Back In Chapters 7 & 8, we worked with LINEAR REGRESSION We learned how to: Create a scatterplot Describe a scatterplot Determine the.
STATISTICS 12.0 Correlation and Linear Regression “Correlation and Linear Regression -”Causal Forecasting Method.
Regression Analysis Deterministic model No chance of an error in calculating y for a given x Probabilistic model chance of an error First order linear.
Correlation and Median-Median Line Statistics Test: Oct. 20 (Wednesday)
Chapter 8 Linear Regression. Fat Versus Protein: An Example 30 items on the Burger King menu:
Introduction to Engineering Calculations Chapter 2.
Linear Best Fit Models Learn to identify patterns in scatter plots, and informally fit and use a linear model to solve problems and make predictions as.
PreCalculus 1-7 Linear Models. Our goal is to create a scatter plot to look for a mathematical correlation to this data.
1.6 Modeling Real-World Data with Linear Functions Objectives Draw and analyze scatter plots. Write a predication equation and draw best-fit lines. Use.
Week 2 Normal Distributions, Scatter Plots, Regression and Random.
1. Analyzing patterns in scatterplots 2. Correlation and linearity 3. Least-squares regression line 4. Residual plots, outliers, and influential points.
Correlation & Forecasting
Lecture 9 Sections 3.3 Objectives:
Warm Up Scatter Plot Activity.
SCATTERPLOTS, ASSOCIATION AND RELATIONSHIPS
Stats Questions We Are Often Asked
CHAPTER 10 Correlation and Regression (Objectives)
Investigating Relationships
2. Find the equation of line of regression
2.6 Draw Scatter Plots and Best-Fitting Lines
STEM Fair Graphs.
Scatter Plots and Best-Fit Lines
1.7 Nonlinear Regression.
Section 1.4 Curve Fitting with Linear Models
Regression.
Correlation and Regression
11A Correlation, 11B Measuring Correlation
7.1 Draw Scatter Plots & Best-Fitting Lines
y = mx + b Linear Regression line of best fit REMEMBER:
DSS-ESTIMATING COSTS Cost estimation is the process of estimating the relationship between costs and cost driver activities. We estimate costs for three.
Unit 2 Quantitative Interpretation of Correlation
Applying linear and median regression
Presentation transcript:

DEPARTMENT OF STATISTICS  What are they?  When should I use them?  How do Excel and GCs handle them?  Why should I be careful with the Nulake text? r and R 2 Matt Regan (et al) Department of Statistics The University of Auckland

DEPARTMENT OF STATISTICS r – little r – what is it?  r is the correlation coefficient between y and x  r measures the strength of a linear relationship  r is a multiple of the slope

DEPARTMENT OF STATISTICS r – when can it be used?  Only use r if the scatter plot is linear  Don’t use r if the scatter plot is non-linear! x y * * * * * * * * ** * * * * * * * * * * r = 0.99

DEPARTMENT OF STATISTICS r – what does it tell you?  How close the points in the scatter plot come to lying on the line r = 0.99 x y * * * * * * * * ** * * * * * * * * * * r = 0.57 x y * * * * * * * * * * * * * * * * * * * *

DEPARTMENT OF STATISTICS R 2 – big R 2 – what is it?  R 2 is the coefficient of determination  Measures how close the points in the scatter plot come to lying on the fitted line or curve x y * * * * * * * * * * * * * * * * * * * * x y * * * * * * * * ** * * * * * * * * * *

DEPARTMENT OF STATISTICS R 2 – big R 2 – when can it be used?  When the scatter plot of y versus x is linear or non-linear x y * * * * * * * * * * * * * * * * * * * * x y * * * * * * * * ** * * * * * * * * * *

DEPARTMENT OF STATISTICS R 2 – what does it tell you? xx Dotplot of the y ’s Shows the variation in the y ’s y y ˆ Dotplot of the y ’s Shows the variation in the y ’s ˆ ˆ

DEPARTMENT OF STATISTICS R 2 – what does it tell you? x We see some additional variation in the y ’s. The excess is not explained by the model. y ˆ y 2 Variation in y 's ˆ Variation in fitted values Variation in y values Variation in y 's R = = Variation in the y ’s: This amount of variation can be explained by the model ˆ

DEPARTMENT OF STATISTICS R 2 – what does it tell you?  When expressed as a percentage, R 2 is the percentage of the variation in Y that our regression model can explain  R 2 near 100%  model fits well  R 2 near 0%  model doesn’t fit well

DEPARTMENT OF STATISTICS R 2 – what does it tell you?  90% of the variation in Y is explained by our regression model. x y * * * * * * * * * * * * * * * * * * * * R 2 = 90%

DEPARTMENT OF STATISTICS R 2 – pearls of wisdom!  R 2 and r 2 have the same value ONLY when using a linear model  DON’T use R 2 to pick your model  Use your eyes!

DEPARTMENT OF STATISTICS R 2 and Excel & Graphics Calculators

DEPARTMENT OF STATISTICS Year 13 Statistics and Modelling Workbook – Nulake

DEPARTMENT OF STATISTICS 5.0 Statistical Investigation: page 224 – 261

DEPARTMENT OF STATISTICS 5.2 The Regression Line – p227 This straight line is called a regression line and we are required to calculate its equation. This is best done with a graphics calculator or spreadsheet programme on a computer. Regression Lines by Inspection You should only attempt to estimate a regression line if you do not have the technology available.

DEPARTMENT OF STATISTICS 5.2 The Regression Line – p229 Non-linear Modelling Equations It is possible to use your calculator to find other regression equations other than a straight line. These non-linear models are also built into your calculator. If they result in a coefficient closer to 1 then they are more appropriate than the straight line regression equation.

DEPARTMENT OF STATISTICS 5.4 Non-linear Regression Models – p241 Non-linear Regression Models It is possible that the best model for some data is a non- linear regression line (e.g. a curve) such as an exponential or power function. All the technology aids (calculator and spreadsheets programmes) are able to model data with different models and by inspecting the correlation coefficient, the researcher should be able to determine the best model.

DEPARTMENT OF STATISTICS 5.4 Non-linear Regression Models – p241

DEPARTMENT OF STATISTICS 5.4 Non-linear Regression Models – p241

DEPARTMENT OF STATISTICS 5.4 Non-linear Regression Models – p242

DEPARTMENT OF STATISTICS Modelling: Drawing Graphs and adding a Trendline using a Spreadsheet – page 337

DEPARTMENT OF STATISTICS Modelling: Drawing Graphs and adding a Trendline using a Spreadsheet – page 337

DEPARTMENT OF STATISTICS Correlation – What can go wrong? x1y1 x2y2 x3y3 x4y N11 Mean of x’s9.0 Mean of y’s7.5 Equation of regression liney = x Correlation coefficient (r )0.82

DEPARTMENT OF STATISTICS Correlation – What can go wrong?