Class 2: Tues., Sept. 14th Correlation (2.2) Introduction to Measurement Theory: –Reliability of measurements and correlation –Example that demonstrates.

Slides:



Advertisements
Similar presentations
Scatterplots and Correlation
Advertisements

Correlation Data collected from students in Statistics classes included their heights (in inches) and weights (in pounds): Here we see a positive association.
EXAMINING RELATIONSHIPS Section 3.2 Correlation. Recall from yesterday…  A scatterplot displays form, direction, and strength of the relationship between.
Statistics 100 Lecture Set 7. Chapters 13 and 14 in this lecture set Please read these, you are responsible for all material Will be doing chapters
Chapter 8 Linear Regression © 2010 Pearson Education 1.
Describing the Relation Between Two Variables
Class 5: Thurs., Sep. 23 Example of using regression to make predictions and understand the likely errors in the predictions: salaries of teachers and.
Class 3: Thursday, Sept. 16 Reliability and Validity of Measurements Introduction to Regression Analysis Simple Linear Regression (2.3)
Intro to Statistics for the Behavioral Sciences PSYC 1900 Lecture 6: Correlation.
Stat 217 – Day 25 Regression. Last Time - ANOVA When?  Comparing 2 or means (one categorical and one quantitative variable) Research question  Null.
RESEARCH STATISTICS Jobayer Hossain Larry Holmes, Jr November 6, 2008 Examining Relationship of Variables.
Discovering and Describing Relationships
CHAPTER 3 Describing Relationships
Multiple Regression – Basic Relationships
Chapter 7 Correlational Research Gay, Mills, and Airasian
Aim: How do we calculate and interpret correlation coefficients with SPSS? SPSS Assignment Due Friday 2/12/10.
MAT 1000 Mathematics in Today's World. Last Time We saw how to use the mean and standard deviation of a normal distribution to determine the percentile.
Describing Relationships: Scatter Plots and Correlation ● The world is an indivisible whole (butterfly effect and chaos theory; quantum entanglement, etc.)
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Simple Linear Regression Analysis Chapter 13.
CENTRE FOR INNOVATION, RESEARCH AND COMPETENCE IN THE LEARNING ECONOMY Session 2: Basic techniques for innovation data analysis. Part I: Statistical inferences.
Scatterplots, Association, and Correlation Copyright © 2010, 2007, 2004 Pearson Education, Inc.
Covariance and correlation
September In Chapter 14: 14.1 Data 14.2 Scatterplots 14.3 Correlation 14.4 Regression.
A P STATISTICS LESSON 3 – 2 CORRELATION.
Chapter 15 Correlation and Regression
Examining Relationships Prob. And Stat. 2.2 Correlation.
Scatterplots, Associations, and Correlation
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 14 Measurement and Data Quality.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
1 Chapter 7 Scatterplots, Association, and Correlation.
Reliability & Validity
The Correlational Research Strategy
Psych 230 Psychological Measurement and Statistics Pedro Wolf September 23, 2009.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 3 Describing Relationships 3.1 Scatterplots.
Objectives 2.1Scatterplots  Scatterplots  Explanatory and response variables  Interpreting scatterplots  Outliers Adapted from authors’ slides © 2012.
Appraisal and Its Application to Counseling COUN 550 Saint Joseph College For Class # 3 Copyright © 2005 by R. Halstead. All rights reserved.
Correlation & Regression Chapter 15. Correlation It is a statistical technique that is used to measure and describe a relationship between two variables.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
The Practice of Statistics
Relationships If we are doing a study which involves more than one variable, how can we tell if there is a relationship between two (or more) of the.
Regression and Least Squares The need for a mathematical construct… Insert fig 3.8.
Relationships Scatterplots and correlation BPS chapter 4 © 2006 W.H. Freeman and Company.
4.2 Correlation The Correlation Coefficient r Properties of r 1.
Section 5.1: Correlation. Correlation Coefficient A quantitative assessment of the strength of a relationship between the x and y values in a set of (x,y)
Chapter 3-Examining Relationships Scatterplots and Correlation Least-squares Regression.
Chapter 4 Scatterplots and Correlation. Chapter outline Explanatory and response variables Displaying relationships: Scatterplots Interpreting scatterplots.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Chapter 10 Correlation and Regression 10-2 Correlation 10-3 Regression.
Chapter 16: Correlation. So far… We’ve focused on hypothesis testing Is the relationship we observe between x and y in our sample true generally (i.e.
LSP 120: Quantitative Reasoning and Technological Literacy Topic 1: Introduction to Quantitative Reasoning and Linear Models Lecture Notes 1.2 Prepared.
Developing a Hiring System Measuring Applicant Qualifications or Statistics Can Be Your Friend!
©2011 Brooks/Cole, Cengage Learning Elementary Statistics: Looking at the Big Picture1 Lecture 35: Chapter 13, Section 2 Two Quantitative Variables Interval.
Correlation  We can often see the strength of the relationship between two quantitative variables in a scatterplot, but be careful. The two figures here.
(Slides not created solely by me – the internet is a wonderful tool) SW388R7 Data Analysis & Compute rs II Slide 1.
Statistics 7 Scatterplots, Association, and Correlation.
5. Evaluation of measuring tools: reliability Psychometrics. 2011/12. Group A (English)
Week 2 Normal Distributions, Scatter Plots, Regression and Random.
Scatterplots Chapter 6.1 Notes.
Describing Relationships
CHAPTER 7 LINEAR RELATIONSHIPS
CHAPTER 3 Describing Relationships
SCATTERPLOTS, ASSOCIATION AND RELATIONSHIPS
Reliability & Validity
Describing Relationships
Evaluation of measuring tools: reliability
Chapter 2 Looking at Data— Relationships
Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Correlation Coefficient
3.2 Correlation Pg
Presentation transcript:

Class 2: Tues., Sept. 14th Correlation (2.2) Introduction to Measurement Theory: –Reliability of measurements and correlation –Example that demonstrates reliability is not the same as accuracy –Validity

Strength of association Strength of the association: Measure of how strong is the positive or negative association. Statistical associations are overall tendencies, not ironclad rules. If there is a strong association between two variables, then knowing one helps a lot in predicting the other. But when there is a weak association, information about one variable does not help much in guessing the other.

For which data set, (A) or (B), is X more strongly associated with Y?

Correlation Motivation: We would like a numerical measure of association. Correlation (r) : numerical measure of how close X and Y are to a straight line (the straight line that best summarizes the relationship of X and Y in scatterplot). Formula (do not need to use, we use JMP to calculate) Correlation is always between –1 and 1. Correlations near –1 indicate strong negative association (X and Y close to a downward sloping line), correlations near 0 indicate little association and correlations near 1 indicate strong positive association (X and Y close to an upward sloping line).

Computing Correlation in JMP Click Analyze, Multivariate Methods, Multivariate. Put variables into Y, Columns for which you want to compute the correlation. This produces scatterplot matrix and matrix of correlations above it. In scatterplot matrix, the ellipses contains approximately 95% of the points.

Properties of Correlation 1.Correlation makes no use of distinction between response and explanatory variables. It makes no difference what we call X and what we call Y. 2.Correlation requires both variables to be quantitative. We can’t compute correlation between religion and alcohol use. 3.The correlation is dimensionless. The correlation does not change if we change the units of measurement (e.g., change inches to feet) of X or Y. 4.The correlation also does not change if we add a constant to X or Y.

To summarize the relationship between X and Y numerically, we need to know the means and standard deviations of X and Y in addition to the correlation. Example 2.9: Competitive divers are scored on their form by a panel of judges who uses a scale from 1 to 10. We have the scores awarded by the two judges. Ivan and George, on a large number of dives. How well do they agree? We do some calculation and find that the correlation between their scores is r=0.9. But the mean of Ivan’s scores is 3 points lower than George’s mean.

Missing Data Missing data is a fact of life in most real data sets. In surveys, people often refuse or don’t bother to answer certain questions.

How JMP handles missing data for correlations In Analyze, Multivariate, the Correlations Multivariate option that comes up by default uses only units for which all variables listed in Y, Columns are recorded. To obtain pairwise correlations that use all the units for which both of the two variables being considered are recorded, click on the red triangle next to Multivariate and click Pairwise Correlations after obtaining the scatterplot matrix.

More Properties of Correlation When looking for relationships between variables, what size of correlation should we get excited about? Unfortunately, there is no absolute answer. Time series of two variables are usually highly correlated, often above In social science research on the relationship between people’s attitudes and their characteristics (e.g., income level), researchers are often excited by a correlation of Resistant statistic: Statistic that is not strongly affected by a few outliers. Correlation is not resistant.

Nonlinear relationships Correlation measures how close the observations in a scatterplot are to a straight line. Correlation is only a good measure of the association between two variables if the mean of X given Y roughly follows a straight line as Y increases.

Correlation and Nonlinear Relationships Correlation = But Strong Association Moral: Don’t use correlation to summarize association when relationship is nonlinear.

Correlation and Reliability of Measurements Measurement theory: Branch of applied statistics that attempts to describe, categorize, evaluate and improve the quality of measurements. Measurement theory for psychological attributes such as intellectual ability or personality is called psychometrics. Reliability of a measurement: The degree of consistency with which a trait or attribute is measured. A perfectly reliable measurement will produce the same value each time assuming the trait or attribute remains constant. Validity of a measurement: The degree to which a measurement measures what it purports to measure. The reliability of a measurement is often determined by the correlation between repeated measurements of the same trait/attribute.

Reliability of Pulse Measurement Correlation =

When is a Measurement Reliable Enough? It is often said that a reliability (correlation) of greater than 0.90 is high. For example, a reliability of 0.90 for educational tests has been considered adequate to assure the quality of standardized tests and large scale assessment programs.

Shoe Shopping and the Reliability Coefficient Citation: This example was developed by David Rogosa of Stanford University. Dedicated to Al Bundy A man who cares as much about good measurement as he does about his own children.

What reliability would you assign this man?

Reliability and Accuracy Try this on 1.A population of male and female shoe-shoppers who have true shoe sizes between size 5 and size 15 (e.g., the small sizes are female feet translated to the male shoe-size scale). 2.Mr. Bundy measures each shopper’s shoe size as either too large or too small with equal probability. On a good day Mr. Bundy misses the correct shoe size by one-half size too big or one-half size too small. On other days Mr. Bundy misses the correct shoe size by a full size too big or a full size too small. In each case the shoe size measurement error has mean 0 (overall and at each level of shoe size) and is uncorrelated with actual shoe size.

The accuracy of shoe fitting on the good day is poor (as most wearers would notice a half-size misfitting), and on the other days the accuracy is totally unacceptable (as a full-size misfitting would presumably be unwearable). The reliability coefficient for Al Bundy on the good day is.973 (better than any standardized test, even though accuracy is poor). The reliability coefficient for Al Bundy making errors of a full shoe size is.902 (comparable to many standardized tests, even though accuracy is unacceptable). Moral: Reliability is not the same as accuracy. We’ll discuss more about this and how to measure accuracy on Thursday.

Validity Validity of a measurement: The degree to which a measurement measures what it purports to measure. To what extent is pulse a valid measure of one’s general state of health. What time did you go to bed last night? –Is it reliable? –Is it valid as a measure of what time you actually went to bed? –Is it valid as a measure of time spent studying?

Summary Correlation: –Use: Measure of a certain type of association, how close points are to a straight line. –Caveat: Check in scatterplot that mean of Y given X is roughly a straight line as X increases. Otherwise, correlation is not a good measure of association. –Not good for measuring association that is nonlinear. Measurement Theory: –Reliability: The correlation between repeated measurements of same trait, quantifies how consistent is the measurement. Not the same as accuracy. –Validity: Does the measurement measure what it purports to measure.