Correlation and Regression Statistics 2126. Introduction Means etc are of course useful We might also wonder, “how do variables go together?” IQ is a.

Slides:



Advertisements
Similar presentations
Linear Regression (C7-9 BVD). * Explanatory variable goes on x-axis * Response variable goes on y-axis * Don’t forget labels and scale * Statplot 1 st.
Advertisements

Correlation Data collected from students in Statistics classes included their heights (in inches) and weights (in pounds): Here we see a positive association.
Review ? ? ? I am examining differences in the mean between groups
Education 793 Class Notes Joint Distributions and Correlation 1 October 2003.
Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. Relationships Between Quantitative Variables Chapter 5.
Regression Wisdom.
1 The Basics of Regression. 2 Remember back in your prior school daze some algebra? You might recall the equation for a line as being y = mx + b. Or maybe.
Chapter 2: Looking at Data - Relationships /true-fact-the-lack-of-pirates-is-causing-global-warming/
Class 5: Thurs., Sep. 23 Example of using regression to make predictions and understand the likely errors in the predictions: salaries of teachers and.
Basic Statistical Concepts
Statistics for the Social Sciences Psychology 340 Fall 2006 Relationships between variables.
Correlation A correlation exists between two variables when one of them is related to the other in some way. A scatterplot is a graph in which the paired.
Ch 2 and 9.1 Relationships Between 2 Variables
Correlation and Regression. Relationships between variables Example: Suppose that you notice that the more you study for an exam, the better your score.
Correlation and Regression Analysis
Least Squares Regression Line (LSRL)
Linear Regression Analysis
Correlation & Regression
Correlation and Regression A BRIEF overview Correlation Coefficients l Continuous IV & DV l or dichotomous variables (code as 0-1) n mean interpreted.
© 2011 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license.
Linear Regression.
Relationship of two variables
Scatterplots, Association, and Correlation Copyright © 2010, 2007, 2004 Pearson Education, Inc.
Correlation and regression 1: Correlation Coefficient
Chapter 14 – Correlation and Simple Regression Math 22 Introductory Statistics.
Copyright © 2011 Pearson Education, Inc. Slide 5-1 Unit 5E Correlation Coefficient.
Biostatistics Unit 9 – Regression and Correlation.
Scatterplots, Association,
Scatterplots, Associations, and Correlation
1 Chapter 10 Correlation and Regression 10.2 Correlation 10.3 Regression.
1.6 Linear Regression & the Correlation Coefficient.
Statistical Analysis Topic – Math skills requirements.
Correlation Association between 2 variables 1 2 Suppose we wished to graph the relationship between foot length Height
Association between 2 variables
Chapter 20 Linear Regression. What if… We believe that an important relation between two measures exists? For example, we ask 5 people about their salary.
Chapter 10 Correlation and Regression
When trying to explain some of the patterns you have observed in your species and community data, it sometimes helps to have a look at relationships between.
1 Everyday is a new beginning in life. Every moment is a time for self vigilance.
Chapter 2 Looking at Data - Relationships. Relations Among Variables Response variable - Outcome measurement (or characteristic) of a study. Also called:
Scatterplot and trendline. Scatterplot Scatterplot explores the relationship between two quantitative variables. Example:
Objective: Understanding and using linear regression Answer the following questions: (c) If one house is larger in size than another, do you think it affects.
CORRELATIONAL RESEARCH STUDIES
Describing Relationships Using Correlations. 2 More Statistical Notation Correlational analysis requires scores from two variables. X stands for the scores.
Chapter 8 Linear Regression *The Linear Model *Residuals *Best Fit Line *Correlation and the Line *Predicated Values *Regression.
CORRELATION. Correlation key concepts: Types of correlation Methods of studying correlation a) Scatter diagram b) Karl pearson’s coefficient of correlation.
April 1 st, Bellringer-April 1 st, 2015 Video Link Worksheet Link
Chapter 9: Correlation and Regression Analysis. Correlation Correlation is a numerical way to measure the strength and direction of a linear association.
Correlation.
Chapter 12: Correlation and Linear Regression 1.
Chapter 2 Examining Relationships.  Response variable measures outcome of a study (dependent variable)  Explanatory variable explains or influences.
Correlation – Recap Correlation provides an estimate of how well change in ‘ x ’ causes change in ‘ y ’. The relationship has a magnitude (the r value)
Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. More About Regression Chapter 14.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Chapter 10 Correlation and Regression 10-2 Correlation 10-3 Regression.
What Do You See?. A scatterplot is a graphic tool used to display the relationship between two quantitative variables. How to Read a Scatterplot A scatterplot.
Chapters 8 Linear Regression. Correlation and Regression Correlation = linear relationship between two variables. Summarize relationship with line. Called.
© 2011 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license.
Statistics 7 Scatterplots, Association, and Correlation.
Correlation & Linear Regression Using a TI-Nspire.
Part II Exploring Relationships Between Variables.
Two-Variable Data Analysis
Chapter 12: Correlation and Linear Regression 1.
Linear Regression Essentials Line Basics y = mx + b vs. Definitions
Statistics 200 Lecture #6 Thursday, September 8, 2016
Practice. Practice Practice Practice Practice r = X = 20 X2 = 120 Y = 19 Y2 = 123 XY = 72 N = 4 (4) 72.
Chapter 4 Correlation.
Suppose the maximum number of hours of study among students in your sample is 6. If you used the equation to predict the test score of a student who studied.
EQ: How well does the line fit the data?
The Least-Squares Line Introduction
Algebra Review The equation of a straight line y = mx + b
Review I am examining differences in the mean between groups How many independent variables? OneMore than one How many groups? Two More than two ?? ?
Presentation transcript:

Correlation and Regression Statistics 2126

Introduction Means etc are of course useful We might also wonder, “how do variables go together?” IQ is a great example It goes together with so much stuff

A scatterplot You tend to put the predictor on the x axis and the predicted on the y, though this is not a hard and fast rule A scatterplot is a pretty good EDA tool too eh Pick an appropriate scale for you axes Plot the (x,y) pairs

So what does it mean If, as one variable increases, the other variable increases we have a positive association If, as one goes up, the other goes down, we have a negative association There could be no association at all

Linear relationships BTW, I am only talking about straight line relationships Not curvilinear Say like the Yerkes Dotson Law, as far as a the stuff we will talk about, there is no relationship, yet we know there is

The strength is important too The more the points cluster around a line, the stronger the relationship is Height and weight vs height in cm vs height in inches We need something that ignores the units though, so if I did IQ and your income in real money or IQ and your income in that worthless stuff they use across the river, the numbers would be the same

The Pearson Product Moment Correlation Coefficient

Properties of r <= r <= The sign indicates ONLY the direction (think of it as going uphill or downhill) |r| indicates the strength So, r = -.77 is a stronger correlation than r =.40

Some examples

EDA is KEY

Check these out.. All of these have have the same correlation R =.7 in each case Note the problem of outliers Note the problem of two subpopulations

Remember this Correlation is not causation I said, correlation is not causation Let me say it again, correlation is not causation Birth control and the toaster method

Wouldn’t it be nice If we could predict y from x You know, like an equation Remember that in school, you would get an equation, plug in the x and get the y Well surprise surprise, there is a method like this in statistics

If we are going to predict with a line Well, we will make mistakes We will want to minimize those mistakes

There is a problem, a common problem Those prediction errors or residuals (e) sum to 0 Damn Though guess what we could do… Why square them of course So we get a line that minimizes squared residuals

The line will look like this

In general the equation of the line is….. Y hat (predicted y) Y interceptslope

This might help

So…. With a regression line you can predict y from x Just because it says that some value = a linear combination of numbers it does not mean that there is necessarily a causal link Don’t go outside the range Linear only