Correlation Measures the relative strength of the linear relationship between two variables Unit-less Ranges between –1 and 1 The closer to –1, the stronger.

Slides:



Advertisements
Similar presentations
Review of ANOVA and linear regression. Review of simple ANOVA.
Advertisements

Some Terms Y =  o +  1 X Regression of Y on X Regress Y on X X called independent variable or predictor variable or covariate or factor Which factors.
FACTOR THE FOLLOWING: Opener. 2-5 Scatter Plots and Lines of Regression 1. Bivariate Data – data with two variables 2. Scatter Plot – graph of bivariate.
Correlation 1. Correlation - degree to which variables are associated or covary. (Changes in the value of one tends to be associated with changes in the.
Linear Regression Analysis
The Line of Best Fit Linear Regression. Definition - A Line of Best or a trend line is a straight line on a Scatter plot that comes closest to all of.
Graph the linear function What is the domain and the range of f?
Regression Analysis. Scatter plots Regression analysis requires interval and ratio-level data. To see if your data fits the models of regression, it is.
Correlation and Linear Regression INCM Correlation  Correlation coefficients assess strength of linear relationship between two quantitative variables.
Correlation Correlation is used to measure strength of the relationship between two variables.
Part IV Significantly Different: Using Inferential Statistics
Linear correlation and linear regression + summary of tests
CORRELATION: Correlation analysis Correlation analysis is used to measure the strength of association (linear relationship) between two quantitative variables.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Chapter 16 Data Analysis: Testing for Associations.
Aim: Review for Exam Tomorrow. Independent VS. Dependent Variable Response Variables (DV) measures an outcome of a study Explanatory Variables (IV) explains.
Correlation and Regression. Section 9.1  Correlation is a relationship between 2 variables.  Data is often represented by ordered pairs (x, y) and.
Linear correlation and linear regression + summary of tests Dr. Omar Al Jadaan Assistant Professor – Computer Science & Mathematics.
3.3 Correlation: The Strength of a Linear Trend Estimating the Correlation Measure strength of a linear trend using: r (between -1 to 1) Positive, Negative.
Chapter 4 Summary Scatter diagrams of data pairs (x, y) are useful in helping us determine visually if there is any relation between x and y values and,
Correlation and Regression: The Need to Knows Correlation is a statistical technique: tells you if scores on variable X are related to scores on variable.
MARE 250 Dr. Jason Turner Linear Regression. Linear regression investigates and models the linear relationship between a response (Y) and predictor(s)
* SCATTER PLOT – GRAPH WITH MANY ORDERS PAIRS * LINE OF BEST FIT – LINE DRAWN THROUGH DATA THAT BEST REPRESENTS IT.
5.4 Line of Best Fit Given the following scatter plots, draw in your line of best fit and classify the type of relationship: Strong Positive Linear Strong.
Correlation.
Scatter Diagrams scatter plot scatter diagram A scatter plot is a graph that may be used to represent the relationship between two variables. Also referred.
2.5 Using Linear Models A scatter plot is a graph that relates two sets of data by plotting the data as ordered pairs. You can use a scatter plot to determine.
Scatterplots and Linear Regressions Unit 8. Warm – up!! As you walk in, please pick up your calculator and begin working on your warm – up! 1. Look at.
.  Relationship between two sets of data  The word Correlation is made of Co- (meaning "together"), and Relation  Correlation is Positive when the.
Scatter Plots & Lines of Best Fit To graph and interpret pts on a scatter plot To draw & write equations of best fit lines.
GOAL: I CAN USE TECHNOLOGY TO COMPUTE AND INTERPRET THE CORRELATION COEFFICIENT OF A LINEAR FIT. (S-ID.8) Data Analysis Correlation Coefficient.
Correlation & Linear Regression Using a TI-Nspire.
Statistics 200 Lecture #6 Thursday, September 8, 2016
Linear Regression.
Regression Analysis.
Regression Analysis AGEC 784.
Correlation & Regression
Data Analysis Module: Correlation and Regression
Chindamanee School English Program
Lines of Best Fit #1 When data show a correlation, you can estimate and draw ____________________ that approximates a trend for a set of data and use it.
Tests for Continuous Outcomes II
Scatterplots A way of displaying numeric data
Interpret Scatterplots & Use them to Make Predictions
Section 13.7 Linear Correlation and Regression
Describe the association’s Form, Direction, and Strength
CHAPTER 10 Correlation and Regression (Objectives)
Simple Linear Regression
Correlation & Linear Regression
2. Find the equation of line of regression
2.6 Draw Scatter Plots and Best-Fitting Lines
Lesson 5.3 How do you write linear equations in point-slope form?
Multiple Regression A curvilinear relationship between one variable and the values of two or more other independent variables. Y = intercept + (slope1.
Correlation and Regression
Scatter Plots of Data with Various Correlation Coefficients
The Weather Turbulence
CORRELATION ANALYSIS.
STA 291 Summer 2008 Lecture 23 Dustin Lueker.
Line of Best Fit.
Section 1.4 Curve Fitting with Linear Models
Simple Linear Regression
Correlation and Regression
y = mx + b Linear Regression line of best fit REMEMBER:
CORRELATION AND MULTIPLE REGRESSION ANALYSIS
Objective: Interpret Scatterplots & Use them to Make Predictions.
7.1 Draw Scatter Plots and Best Fitting Lines
Correlation & Trend Lines
Warsaw Summer School 2017, OSU Study Abroad Program
STA 291 Spring 2008 Lecture 23 Dustin Lueker.
Scatter Plots Learning Goals
Presentation transcript:

Correlation Measures the relative strength of the linear relationship between two variables Unit-less Ranges between –1 and 1 The closer to –1, the stronger the negative linear relationship The closer to 1, the stronger the positive linear relationship The closer to 0, the weaker any linear relationship

Scatter Plots of Data with Various Correlation Coefficients Y Y Y X X X r = -1 r = -.6 r = 0 Y Y Y X X X r = +1 r = +.3 r = 0

Linear Correlation Linear relationships Curvilinear relationships Y Y X X Y Y X X

Linear Correlation Strong relationships Weak relationships Y Y X X Y Y

Linear Correlation No relationship Y X Y X

Linear regression In correlation, the two variables are treated as equals. In regression, one variable is considered independent (=predictor) variable (X) and the other the dependent (=outcome) variable Y.

What is “Linear”? Remember this: Y=mX+B? m B

What’s Slope? A slope of 2 means that every 1-unit change in X yields a 2-unit change in Y. Prediction: If you know something about X, this knowledge helps you predict something about Y.

Dataset 1: no relationship

Dataset 2: weak relationship

Dataset 3: weak to moderate relationship

Dataset 4: moderate relationship

The “Best fit” line Regression equation: E(Yi) = 28 + 0*vit Di (in 10 nmol/L)

The “Best fit” line Note how the line is a little deceptive; it draws your eye, making the relationship appear stronger than it really is! Regression equation: E(Yi) = 26 + 0.5*vit Di (in 10 nmol/L)

The “Best fit” line Regression equation: E(Yi) = 22 + 1.0*vit Di (in 10 nmol/L)

The “Best fit” line Regression equation: E(Yi) = 20 + 1.5*vit Di (in 10 nmol/L) Note: all the lines go through the point (63, 28)!

Multiple linear regression… What if age is a confounder here? Older men have lower vitamin D Older men have poorer cognition “Adjust” for age by putting age in the model: DSST score = intercept + slope1xvitamin D + slope2 xage

2 predictors: age and vit D…

Different 3D view…

Fit a plane rather than a line… On the plane, the slope for vitamin D is the same at every age; thus, the slope for vitamin D represents the effect of vitamin D when age is held constant.

Equation of the “Best fit” plane… DSST score = 53 + 0.0039xvitamin D (in 10 nmol/L) - 0.46 xage (in years) P-value for vitamin D >>.05 P-value for age <.0001 Thus, relationship with vitamin D was due to confounding by age!

Multiple Linear Regression More than one predictor… E(y)=  + 1*X + 2 *W + 3 *Z… Each regression coefficient is the amount of change in the outcome variable that would be expected per one-unit change of the predictor, if all other variables in the model were held constant. Control for confounders; improve predictions