Regression and Correlation

Slides:



Advertisements
Similar presentations
R Squared. r = r = -.79 y = x y = x if x = 15, y = ? y = (15) y = if x = 6, y = ? y = (6)
Advertisements

Copyright © 2006 The McGraw-Hill Companies, Inc. Permission required for reproduction or display. 1 ~ Curve Fitting ~ Least Squares Regression Chapter.
Learning Objectives Copyright © 2002 South-Western/Thomson Learning Data Analysis: Bivariate Correlation and Regression CHAPTER sixteen.
Regression Analysis Once a linear relationship is defined, the independent variable can be used to forecast the dependent variable. Y ^ = bo + bX bo is.
Learning Objectives Copyright © 2004 John Wiley & Sons, Inc. Bivariate Correlation and Regression CHAPTER Thirteen.
Learning Objectives 1 Copyright © 2002 South-Western/Thomson Learning Data Analysis: Bivariate Correlation and Regression CHAPTER sixteen.
Correlation and Regression
Correlation Correlation is the relationship between two quantitative variables. Correlation coefficient (r) measures the strength of the linear relationship.
© The McGraw-Hill Companies, Inc., 2000 CorrelationandRegression Further Mathematics - CORE.
Correlation and Linear Regression
Regression Regression: Mathematical method for determining the best equation that reproduces a data set Linear Regression: Regression method applied with.
Chapter Topics Types of Regression Models
Regression Chapter 10 Understandable Statistics Ninth Edition By Brase and Brase Prepared by Yixun Shi Bloomsburg University of Pennsylvania.
Lecture 5 Correlation and Regression
Linear Regression.
Introduction to Linear Regression and Correlation Analysis
© The McGraw-Hill Companies, Inc., 2000 Business and Finance College Principles of Statistics Lecture 10 aaed EL Rabai week
Biostatistics Unit 9 – Regression and Correlation.
Researchers, such as anthropologists, are often interested in how two measurements are related. The statistical study of the relationship between variables.
Jon Curwin and Roger Slater, QUANTITATIVE METHODS: A SHORT COURSE ISBN © Thomson Learning 2004 Jon Curwin and Roger Slater, QUANTITATIVE.
Anthony Greene1 Regression Using Correlation To Make Predictions.
© The McGraw-Hill Companies, Inc., Chapter 11 Correlation and Regression.
1.6 Linear Regression & the Correlation Coefficient.
Holt Algebra Curve Fitting with Linear Models 2-7 Curve Fitting with Linear Models Holt Algebra 2 Lesson Presentation Lesson Presentation.
Basic Concepts of Correlation. Definition A correlation exists between two variables when the values of one are somehow associated with the values of.
Regression Regression relationship = trend + scatter
STATISTICS 12.0 Correlation and Linear Regression “Correlation and Linear Regression -”Causal Forecasting Method.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Correlation and Regression. Section 9.1  Correlation is a relationship between 2 variables.  Data is often represented by ordered pairs (x, y) and.
Examining Bivariate Data Unit 3 – Statistics. Some Vocabulary Response aka Dependent Variable –Measures an outcome of a study Explanatory aka Independent.
Chapter Thirteen Copyright © 2006 John Wiley & Sons, Inc. Bivariate Correlation and Regression.
Creating a Residual Plot and Investigating the Correlation Coefficient.
Chapter 4 Summary Scatter diagrams of data pairs (x, y) are useful in helping us determine visually if there is any relation between x and y values and,
2-7 Curve Fitting with Linear Models Warm Up Lesson Presentation
Section 2.6 – Draw Scatter Plots and Best Fitting Lines A scatterplot is a graph of a set of data pairs (x, y). If y tends to increase as x increases,
Correlation The apparent relation between two variables.
Scatter Plots, Correlation and Linear Regression.
7-3 Line of Best Fit Objectives
Scatter Diagrams scatter plot scatter diagram A scatter plot is a graph that may be used to represent the relationship between two variables. Also referred.
Scatter Diagram of Bivariate Measurement Data. Bivariate Measurement Data Example of Bivariate Measurement:
2.5 Using Linear Models A scatter plot is a graph that relates two sets of data by plotting the data as ordered pairs. You can use a scatter plot to determine.
Regression Analysis. 1. To comprehend the nature of correlation analysis. 2. To understand bivariate regression analysis. 3. To become aware of the coefficient.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics Seventh Edition By Brase and Brase Prepared by: Lynn Smith.
.  Relationship between two sets of data  The word Correlation is made of Co- (meaning "together"), and Relation  Correlation is Positive when the.
STATISTICS 12.0 Correlation and Linear Regression “Correlation and Linear Regression -”Causal Forecasting Method.
Simple Linear Regression The Coefficients of Correlation and Determination Two Quantitative Variables x variable – independent variable or explanatory.
The following data represents the amount of Profit (in thousands of $) made by a trucking company dependent on gas prices. Gas $
Correlation and Regression Basic Concepts. An Example We can hypothesize that the value of a house increases as its size increases. Said differently,
Copyright © 2015 McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written consent of McGraw-Hill Education.
Describing Relationships. Least-Squares Regression  A method for finding a line that summarizes the relationship between two variables Only in a specific.
Correlation and Linear Regression
Regression and Correlation
CHAPTER 3 Describing Relationships
Sections Review.
Statistics 101 Chapter 3 Section 3.
Regression and Correlation
LSRL Least Squares Regression Line
The following data represents the amount of Profit (in thousands of $) made by a trucking company dependent on gas prices. Gas $
CHAPTER 10 Correlation and Regression (Objectives)
Suppose the maximum number of hours of study among students in your sample is 6. If you used the equation to predict the test score of a student who studied.
Correlation and Simple Linear Regression
1) A residual: a) is the amount of variation explained by the LSRL of y on x b) is how much an observed y-value differs from a predicted y-value c) predicts.
REGRESSION.
Correlation and Regression
Correlation and Simple Linear Regression
1.7 Nonlinear Regression.
11C Line of Best Fit By Eye, 11D Linear Regression
Lesson 2.2 Linear Regression.
Correlation and Simple Linear Regression
Correlation and Simple Linear Regression
Presentation transcript:

Regression and Correlation Biostatistics Unit 9 Regression and Correlation

Regression and Correlation Regression and correlation analysis studies the relationships between variables. This area of statistics was started in the 1860s by Francis Galton (1822-1911) who was also Darwin’s Cousin.

Data for Regression and Correlation Data are in the form of (x,y) pairs. A scatter plot (x-y) plot is used to display regression and correlation data. The regression line has the form y = mx + b In actual practice, two forms are used which are y = ax + b and y = a + bx.

General Regression Line y = a + bx + e a is the y-intercept b is the slope e is the error term

Calculations For each (x,y) point, the vertical distance from the point to the regression line is squared. Adding these gives the sum of squares. Regression analysis allows the experimenter to predict one value based on the value of another. A similar procedure is used in biochemistry with standard curves.

Data Data are in the form of (x,y) pairs. List L1 contains the x values and List L2 contains the y values.

Calculation of regression equation using TI-83 The Linear Regression test is used. Conclusion: The equation of the regression line is y = 4.54x – 1.57

Using the regression equation Interpolation is used to find values of points between the data points. This is a relatively safe and accurate process. Extrapolation is used to find values of points outside the range of the data. This process is more risky especially as you get further and further from the ends of the line. Be careful to make sure that the calculations give realistic results.

Significance of regression analysis It is possible to perform the linear regression t test to give a probability. In this test: b is the population regression coefficient r is the population correlation coefficient The hypotheses are: H0: b and r = 0 HA: b and r 0

Calculations and Results Calculator setup

Calculations and Results Conclusion: p < .001 (.000206)

Correlation Correlation is used to give information about the relationship between x and y. When the regression equation is calculated, the correlation results indicate the nature and strength of the relationship.

Correlation Coefficient The correlation coefficient, r, indicates the nature and strength of the relationship. Values of r range from -1 to +1. A correlation coefficient of 0 means that there is no relationship.

Correlation Coefficient Perfect negative correlation, r = -1.

Correlation Coefficient No correlation, r = 0.

Correlation Coefficient Perfect positive correlation, r = +1.

Coefficient of Determination The coefficient of determination is r2. It has values between 0 and 1. The value of r2 indicates the percentage of the relationship resulting from the factor being studied.

Graphs Scatter plot

Graphs Scatter plot with regression line

Data for calculations

Calculations Calculate the regression equation

Calculations Calculate the regression equation Result: The regression equation is y = 4.54x – 1.57

Calculations Calculate the correlation coefficient

Coefficient of Determination The coefficient of determination is r2. It indicates the percentage of the contribution that the factor makes toward the relationship between x and y. With r = .974, the coefficient of determination r2 = .948. This means that about 95% of the relationship is due to the temperature.

Residuals The distance that each point is above or below the line is called a residual. With a good relationship, the values of the residuals will be randomly scattered. If there is not a random residual plot then there is another factor or effect involved that needs attention.

Calculate the residual variance

Calculate the residual variance Result: The residual variance is 56.1366. Residual SD is 7.4924 which TI-83 gives.

Results of linear regression t test

Results of linear regression t test

fin