The Correlation Coefficient. Social Security Numbers.

Slides:



Advertisements
Similar presentations
IB Math Studies – Topic 6 Statistics.
Advertisements

Chapter 6: Exploring Data: Relationships Lesson Plan
Section 7.1 ~ Seeking Correlation
Basic Statistical Concepts Psych 231: Research Methods in Psychology.
Basic Statistical Concepts
Statistics Psych 231: Research Methods in Psychology.
SIMPLE LINEAR REGRESSION
Regression Chapter 10 Understandable Statistics Ninth Edition By Brase and Brase Prepared by Yixun Shi Bloomsburg University of Pennsylvania.
Basic Statistical Concepts Part II Psych 231: Research Methods in Psychology.
Topics: Correlation The road map
Correlation & Regression Math 137 Fresno State Burger.
Vocabulary for Box and Whisker Plots. Box and Whisker Plot: A diagram that summarizes data using the median, the upper and lowers quartiles, and the extreme.
Ms. Albarico Often, when real-world data is plotted, the result is a linear pattern. The general direction of the data can be seen, but the data points.
Descriptive Methods in Regression and Correlation
Correlation By Dr.Muthupandi,. Correlation Correlation is a statistical technique which can show whether and how strongly pairs of variables are related.
Correlation and regression 1: Correlation Coefficient
Chapters 8 and 9: Correlations Between Data Sets Math 1680.
Covariance and correlation
Scatter Plots and Linear Correlation. How do you determine if something causes something else to happen? We want to see if the dependent variable (response.
Chapter 13 Statistics © 2008 Pearson Addison-Wesley. All rights reserved.
Introduction to Quantitative Data Analysis (continued) Reading on Quantitative Data Analysis: Baxter and Babbie, 2004, Chapter 12.
Linear Regression When looking for a linear relationship between two sets of data we can plot what is known as a scatter diagram. x y Looking at the graph.
Chapter 6: Exploring Data: Relationships Lesson Plan Displaying Relationships: Scatterplots Making Predictions: Regression Line Correlation Least-Squares.
STATISTICS I COURSE INSTRUCTOR: TEHSEEN IMRAAN. CHAPTER 4 DESCRIBING DATA.
© 2008 Pearson Addison-Wesley. All rights reserved Chapter 1 Section 13-6 Regression and Correlation.
MEASURES of CORRELATION. CORRELATION basically the test of measurement. Means that two variables tend to vary together The presence of one indicates the.
An easy way to remember the order of operations is to use the mnemonic device: PEMDAS. P → Parentheses (Grouping Symbols) E → Exponents (Powers) MD → Multiplication.
Lesson Scatterplots and Correlation. Knowledge Objectives Explain the difference between an explanatory variable and a response variable Explain.
STA291 Statistical Methods Lecture 10. Measuring something over time… o Like a company’s stock value: o It kind of just sits there, making your eyes glaze.
 z – Score  Percentiles  Quartiles  A standardized value  A number of standard deviations a given value, x, is above or below the mean  z = (score.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
CORRELATIONAL RESEARCH STUDIES
Relationships If we are doing a study which involves more than one variable, how can we tell if there is a relationship between two (or more) of the.
Aim: What are the graphs for one and two variables data? Do Now: Use the STAT function on your calculator, enter L L
Chapter Bivariate Data (x,y) data pairs Plotted with Scatter plots x = explanatory variable; y = response Bivariate Normal Distribution – for.
Linear correlation and linear regression + summary of tests Dr. Omar Al Jadaan Assistant Professor – Computer Science & Mathematics.
LECTURE 9 Tuesday, 24 FEBRUARY STA291 Fall Administrative 4.2 Measures of Variation (Empirical Rule) 4.4 Measures of Linear Relationship Suggested.
Creating a Residual Plot and Investigating the Correlation Coefficient.
Chapter 4 Summary Scatter diagrams of data pairs (x, y) are useful in helping us determine visually if there is any relation between x and y values and,
Correlation The apparent relation between two variables.
Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics.
9.1B – Computing the Correlation Coefficient by Hand
Scatter Diagram of Bivariate Measurement Data. Bivariate Measurement Data Example of Bivariate Measurement:
Chapter 4 Scatterplots and Correlation. Chapter outline Explanatory and response variables Displaying relationships: Scatterplots Interpreting scatterplots.
Data Summary Using Descriptive Measures Sections 3.1 – 3.6, 3.8
1 Data Analysis Linear Regression Data Analysis Linear Regression Ernesto A. Diaz Department of Mathematics Redwood High School.
7.1 Draw Scatter Plots and Best Fitting Lines Pg. 255 Notetaking Guide Pg. 255 Notetaking Guide.
Regression Analysis. 1. To comprehend the nature of correlation analysis. 2. To understand bivariate regression analysis. 3. To become aware of the coefficient.
Math 3680 Lecture #18 Correlation. The Correlation Coefficient: Intuition.
.  Relationship between two sets of data  The word Correlation is made of Co- (meaning "together"), and Relation  Correlation is Positive when the.
LINEAR EQUATIONS & THEIR GRAPHS CHAPTER 6. INTRODUCTION We will explore in more detail rates of change and look at how the slope of a line relates to.
Unit 5 Lesson 2 Remediation Activity 2 Let’s practice some more with midpoint!
CORRELATION ANALYSIS.
Scatter Plots. Scatter plots are used when data from an experiment or test have a wide range of values. You do not connect the points in a scatter plot,
Summarizing Data Graphical Methods. Histogram Stem-Leaf Diagram Grouped Freq Table Box-whisker Plot.
1 MVS 250: V. Katch S TATISTICS Chapter 5 Correlation/Regression.
GOAL: I CAN USE TECHNOLOGY TO COMPUTE AND INTERPRET THE CORRELATION COEFFICIENT OF A LINEAR FIT. (S-ID.8) Data Analysis Correlation Coefficient.
By Christy Quattrone Click to View Types of Graphs Data Analysis, Grade 5.
Probability & Statistics Box Plots. Describing Distributions Numerically Five Number Summary and Box Plots (Box & Whisker Plots )
MM150 ~ Unit 9 Statistics ~ Part II. WHAT YOU WILL LEARN Mode, median, mean, and midrange Percentiles and quartiles Range and standard deviation z-scores.
Slide Copyright © 2009 Pearson Education, Inc. Types of Distributions Rectangular Distribution J-shaped distribution.
Stem-and-Leaf Plots …are a quick way to arrange a set of data and view its shape or distribution A key in the top corner shows how the numbers are split.
Correlation & Regression
6.2 Difference of 2 Squares Goal: To be able to recognize and completely factor the difference of two squares Remember to factor out a common factor before.
Aim: What are the graphs for one and two variables data?
Scatterplots A way of displaying numeric data
Correlation and Regression
Unit 4 Statistics Review
CORRELATION ANALYSIS.
The absolute value of each deviation.
Presentation transcript:

The Correlation Coefficient

Social Security Numbers

A Scatter Diagram

The Point of Averages Where is the center of the cloud? Take the average of the x-values and the average of the y-values; this is the point of averages. It locates the center of the cloud. Similarly, take the SD of the x-values and the SD of the y-values.

Examples

The Correlation Coefficient An association can be stronger or weaker. Remember: a strong association means that knowing one variable helps to predict the other variable to a large extend. The correlation coefficient is a numerical value expressing the strength of the association.

The Correlation Coefficient We denote the correlation coefficient by r. If r = 0, the cloud is completely formless; there is no correlation between the variables. If r = 1, all the points lie exactly on a line (not necessarily x = y) and there is perfect correlation.

Strong and Weak

The Correlation Coefficient What about negative values? The correlation coefficient is between –1 and 1, negative shows negative association, positive indicates positive association. Note that –0.90 shows the same degree of association as +0.90, only negative instead of positive.

Computing the Correlation Coefficient 1.Convert each variable to standard units. 2.The average of the products gives the correlation coefficient r. r = average of (x in standard units)  (y in standard units)

Example xy We must first convert to standard units. Find the average and the SD of the x-values: average = 4, SD = 2. Find the deviation: subtract the average from each value, and divide by the SD. Then do the same for the y-values.

Example Standard units xyxy x  yx  y

Example Finally, take the average of the products In this example, r = r = average of (x in standard units)  (y in standard units)

The SD line If there is some association, the points in the scatter diagram cluster around a line. But around which line? Generally, this is the SD line. It is the line through the point of averages. It climbs at the rate of one vertical SD for each horizontal SD. Its slope is (SD of y) / (SD of x) in case of a positive correlation, and –(SD of y) / (SD of x) in case of a negative correlation.

Five-point Summary Remember the five-point summary of a data set: minimum, lower quartile, median, upper quartile, and maximum. A five-point summary for a scatter plot is: average x-values, SD x-values, average y- values, SD y-values, and correlation coefficient r.