1.6 Linear Regression & the Correlation Coefficient.

Slides:



Advertisements
Similar presentations
Chapter 3 Examining Relationships Lindsey Van Cleave AP Statistics September 24, 2006.
Advertisements

AP Statistics Section 3.2 C Coefficient of Determination
Lesson 10: Linear Regression and Correlation
Linear regression and correlation
Regression, Correlation. Research Theoretical empirical Usually combination of the two.
Correlation and Regression
1 Simple Linear Regression and Correlation The Model Estimating the Coefficients EXAMPLE 1: USED CAR SALES Assessing the model –T-tests –R-square.
Chapter 10 Regression. Defining Regression Simple linear regression features one independent variable and one dependent variable, as in correlation the.
2.2 Correlation Correlation measures the direction and strength of the linear relationship between two quantitative variables.
Regression and Correlation
Regression Chapter 10 Understandable Statistics Ninth Edition By Brase and Brase Prepared by Yixun Shi Bloomsburg University of Pennsylvania.
REGRESSION Predict future scores on Y based on measured scores on X Predictions are based on a correlation from a sample where both X and Y were measured.
Simple Linear Regression. Introduction In Chapters 17 to 19, we examine the relationship between interval variables via a mathematical equation. The motivation.
Correlation & Regression Math 137 Fresno State Burger.
Regression, Residuals, and Coefficient of Determination Section 3.2.
Correlation & Regression
McGraw-Hill/Irwin Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 13 Linear Regression and Correlation.
Linear Regression.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Simple Linear Regression Analysis Chapter 13.
Overview 4.2 Introduction to Correlation 4.3 Introduction to Regression.
Copyright © Cengage Learning. All rights reserved.
Correlation and regression 1: Correlation Coefficient
 Once data is collected and organized, we need to analyze the strength of the relationship and formalize it with an equation  By understanding the strength.
Scatter Plots and Linear Correlation. How do you determine if something causes something else to happen? We want to see if the dependent variable (response.
Chapter 14 – Correlation and Simple Regression Math 22 Introductory Statistics.
Chapter 4 Correlation and Regression Understanding Basic Statistics Fifth Edition By Brase and Brase Prepared by Jon Booze.
Chapter 13 Statistics © 2008 Pearson Addison-Wesley. All rights reserved.
Biostatistics Unit 9 – Regression and Correlation.
Linear Regression When looking for a linear relationship between two sets of data we can plot what is known as a scatter diagram. x y Looking at the graph.
Chapter 6 & 7 Linear Regression & Correlation
Chapter 3 concepts/objectives Define and describe density curves Measure position using percentiles Measure position using z-scores Describe Normal distributions.
Notes Bivariate Data Chapters Bivariate Data Explores relationships between two quantitative variables.
© 2008 Pearson Addison-Wesley. All rights reserved Chapter 1 Section 13-6 Regression and Correlation.
Correlation is a statistical technique that describes the degree of relationship between two variables when you have bivariate data. A bivariate distribution.
Section 5.2: Linear Regression: Fitting a Line to Bivariate Data.
2 Variable Stats Does the amount of time spent on homework improve your grade? If you eat more junk food will you gain weight? Does the amount of rainfall.
Notes Bivariate Data Chapters Bivariate Data Explores relationships between two quantitative variables.
Objective: Understanding and using linear regression Answer the following questions: (c) If one house is larger in size than another, do you think it affects.
Relationships If we are doing a study which involves more than one variable, how can we tell if there is a relationship between two (or more) of the.
Chapter 11 Correlation and Simple Linear Regression Statistics for Business (Econ) 1.
Examining Bivariate Data Unit 3 – Statistics. Some Vocabulary Response aka Dependent Variable –Measures an outcome of a study Explanatory aka Independent.
Chapter 9: Correlation and Regression Analysis. Correlation Correlation is a numerical way to measure the strength and direction of a linear association.
Scatter Plots, Correlation and Linear Regression.
CHAPTER 5 CORRELATION & LINEAR REGRESSION. GOAL : Understand and interpret the terms dependent variable and independent variable. Draw a scatter diagram.
Economics 173 Business Statistics Lecture 10 Fall, 2001 Professor J. Petry
1 Data Analysis Linear Regression Data Analysis Linear Regression Ernesto A. Diaz Department of Mathematics Redwood High School.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 3 Association: Contingency, Correlation, and Regression Section 3.3 Predicting the Outcome.
Linear Prediction Correlation can be used to make predictions – Values on X can be used to predict values on Y – Stronger relationships between X and Y.
Statistics: Unlocking the Power of Data Lock 5 STAT 250 Dr. Kari Lock Morgan Simple Linear Regression SECTION 2.6 Least squares line Interpreting coefficients.
1 Simple Linear Regression and Correlation Least Squares Method The Model Estimating the Coefficients EXAMPLE 1: USED CAR SALES.
Section 1.6 Fitting Linear Functions to Data. Consider the set of points {(3,1), (4,3), (6,6), (8,12)} Plot these points on a graph –This is called a.
Unit 4 Lesson 3 (5.3) Summarizing Bivariate Data 5.3: LSRL.
Simple Linear Regression The Coefficients of Correlation and Determination Two Quantitative Variables x variable – independent variable or explanatory.
Chapter 5 Lesson 5.2 Summarizing Bivariate Data 5.2: LSRL.
Chapters 8 Linear Regression. Correlation and Regression Correlation = linear relationship between two variables. Summarize relationship with line. Called.
Copyright © Cengage Learning. All rights reserved. 8 9 Correlation and Regression.
Copyright © 2015 McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written consent of McGraw-Hill Education.
Describing Relationships. Least-Squares Regression  A method for finding a line that summarizes the relationship between two variables Only in a specific.
Part II Exploring Relationships Between Variables.
Regression and Correlation of Data Correlation: Correlation is a measure of the association between random variables, say X and Y. No assumption that one.
Describing Bivariate Relationships. Bivariate Relationships When exploring/describing a bivariate (x,y) relationship: Determine the Explanatory and Response.
Copyright © Cengage Learning. All rights reserved.
Correlation & Regression
SCATTERPLOTS, ASSOCIATION AND RELATIONSHIPS
Linear Trends and Correlations
Section 13.7 Linear Correlation and Regression
Correlation and Regression
1.7 Nonlinear Regression.
Least-Squares Regression
9/27/ A Least-Squares Regression.
Presentation transcript:

1.6 Linear Regression & the Correlation Coefficient

Linear regression In data analysis, often want to determine if one variable (dependent/response) affected by another (independent/explanatory) Linear correlation –changes in Y tend to be proportional to changes in X Positive correlation –Y increases at constant rate while X increases Negative correlation –Y decreases at constant rate while X increases

Line of Best Fit Straight line that passes “as close as possible” to all the points on a scatterplot What does “as close as possible” mean? Linear regression –Analytic technique for determining relationship between Y and X –Usual method is least squares fit

Least Squares Fit Look at the square of the residual –Vertical distance between actual point and line of best fit Actual point Estimated point on line of best fit } Residual Want to minimize sum of the squares

Linear Regression Formulas y = ax + b Ick! We’ll let the computer calculate this for us

Linear Regression Use least squares to get equation of line of best fit The equation is a mathematical model of the relationship y = ax + b a is the slope, how y changes as x changes b is the y-intercept –Might be meaningless outside the data set

NOTE! Slope only tells you how Y varies with X on line of best fit Says nothing about strength of correlation The stronger the correlation, the more closely the data points cluster around the line of best fit How strong is strong?

The Correlation Coefficient, r r is quantitative measure of strength of linear relationship Related to product of deviations from means -1≤ r ≤1 r = 1 –Perfect positive correlation: all the points lie directly on the line of best fit r = -1 –Perfect negative correlation

The Correlation Coefficient, r When no correlation, Y equally likely to increase or decrease as X increases Deviations are randomly positive or negative Tend to cancel each other out r close to 0 if little or no correlation between variables

How r corresponds to strength of linear correlation Correlation Coefficient, r Negative Linear Correlation StrongWeak Moderate StrongWeak Moderate Positive Linear Correlation

How to calculate r Ick! Get the computer to do it for you

Example A farmer records the following data: Open “1.6.1 Farmer example.ftm” Create a scatterplot and sketch at right Under Graph, use Least Squares Line Write the equation and the correlation coefficient Note: Fathom calculates r 2. How do you find r? What conclusion can you draw?

Solution y = r = Moderate positive linear correlation Slope: the yield goes up by tonnes per hectare for every degree increase in the mean temperature y-intercept: the yield is 1.66 tonnes per hectare at 0 ºC (does this make sense? Note: moderate correlation does not mean how fast Y varies with X x

Let’s go to the handouts!