Joint Distributions AND CORRELATION Coefficients (Part 3)

Slides:

Advertisements

Similar presentations

Copyright © Allyn & Bacon (2007) Statistical Analysis of Data Graziano and Raulin Research Methods: Chapter 5 This multimedia product and its contents.

Advertisements

Review ? ? ? I am examining differences in the mean between groups

Psychology: A Modular Approach to Mind and Behavior, Tenth Edition, Dennis Coon Appendix Appendix: Behavioral Statistics.

Table of Contents Exit Appendix Behavioral Statistics.

Education 793 Class Notes Joint Distributions and Correlation 1 October 2003.

Overview Correlation Regression -Definition

Correlation & Regression Chapter 15. Correlation statistical technique that is used to measure and describe a relationship between two variables (X and.

Chapter 6: Correlational Research Examine whether variables are related to one another (whether they vary together). Correlation coefficient: statistic.

Correlation CJ 526 Statistical Analysis in Criminal Justice.

Chapter 15 (Ch. 13 in 2nd Can.) Association Between Variables Measured at the Interval-Ratio Level: Bivariate Correlation and Regression.

CJ 526 Statistical Analysis in Criminal Justice

Calculating & Reporting Healthcare Statistics

Lesson Fourteen Interpreting Scores. Contents Five Questions about Test Scores 1. The general pattern of the set of scores  How do scores run or what.

Lecture 11 PY 427 Statistics 1 Fall 2006 Kin Ching Kong, Ph.D

Basic Statistical Concepts Psych 231: Research Methods in Psychology.

Regression and Correlation

Basic Statistical Concepts

Statistics Psych 231: Research Methods in Psychology.

Correlation “A statistician is someone who loves to work with numbers but doesn't have the personality to be an accountant.”

Chapter Seven The Correlation Coefficient. Copyright © Houghton Mifflin Company. All rights reserved.Chapter More Statistical Notation Correlational.

Basic Statistical Concepts Part II Psych 231: Research Methods in Psychology.

Topics: Correlation The road map

Chapter 9 For Explaining Psychological Statistics, 4th ed. by B. Cohen 1 What is a Perfect Positive Linear Correlation? –It occurs when everyone has the.

2 Textbook Shavelson, R.J. (1996). Statistical reasoning for the behavioral sciences (3 rd Ed.). Boston: Allyn & Bacon. Supplemental Material Ruiz-Primo,

Correlation and Regression A BRIEF overview Correlation Coefficients l Continuous IV & DV l or dichotomous variables (code as 0-1) n mean interpreted.

© 2011 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license.

Understanding Research Results

This Week: Testing relationships between two metric variables: Correlation Testing relationships between two nominal variables: Chi-Squared.

© 2005 The McGraw-Hill Companies, Inc., All Rights Reserved. Chapter 12 Describing Data.

Data Collection & Processing Hand Grip Strength P textbook.

Simple Covariation Focus is still on ‘Understanding the Variability” With Group Difference approaches, issue has been: Can group membership (based on ‘levels.

Covariance and correlation

Bivariate Description Heibatollah Baghi, and Mastee Badii.

Chapter 15 Correlation and Regression

Statistics in Applied Science and Technology Chapter 13, Correlation and Regression Part I, Correlation (Measure of Association)

Thinking About Psychology: The Science of Mind and Behavior 2e Charles T. Blair-Broeker Randal M. Ernst.

Hypothesis of Association: Correlation

UNDERSTANDING RESEARCH RESULTS: DESCRIPTION AND CORRELATION © 2012 The McGraw-Hill Companies, Inc.

METHODS IN BEHAVIORAL RESEARCH NINTH EDITION PAUL C. COZBY Copyright © 2007 The McGraw-Hill Companies, Inc.

Basic Statistics Correlation Var Relationships Associations.

Descriptive Statistics

Chapter 6 Foundations of Educational Measurement Part 1 Jeffrey Oescher.

Figure 15-3 (p. 512) Examples of positive and negative relationships. (a) Beer sales are positively related to temperature. (b) Coffee sales are negatively.

Examining Relationships in Quantitative Research

TYPES OF STATISTICAL METHODS USED IN PSYCHOLOGY Statistics.

Statistical analysis Outline that error bars are a graphical representation of the variability of data. The knowledge that any individual measurement.

By: Amani Albraikan.  Pearson r  Spearman rho  Linearity  Range restrictions  Outliers  Beware of spurious correlations….take care in interpretation.

DESCRIPTIVE STATISTICS © LOUIS COHEN, LAWRENCE MANION & KEITH MORRISON.

Describing Relationships Using Correlations. 2 More Statistical Notation Correlational analysis requires scores from two variables. X stands for the scores.

11/23/2015Slide 1 Using a combination of tables and plots from SPSS plus spreadsheets from Excel, we will show the linkage between correlation and linear.

Reasoning in Psychology Using Statistics Psychology

Chapter 16: Correlation. So far… We’ve focused on hypothesis testing Is the relationship we observe between x and y in our sample true generally (i.e.

Overview and interpretation

Chapter 15: Correlation. Correlations: Measuring and Describing Relationships A correlation is a statistical method used to measure and describe the relationship.

1 MVS 250: V. Katch S TATISTICS Chapter 5 Correlation/Regression.

© 2011 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license.

Statistics Josée L. Jarry, Ph.D., C.Psych. Introduction to Psychology Department of Psychology University of Toronto June 9, 2003.

Chapter 15 Association Between Variables Measured at the Interval-Ratio Level.

Educational Research Descriptive Statistics Chapter th edition Chapter th edition Gay and Airasian.

Chapter 12 Understanding Research Results: Description and Correlation

Statistical analysis.

Variables Dependent variable: measures an outcome of a study

Statistical analysis.

مقدمة في الإحصاء الحيوي مع تطبيقات برنامج الحزم الإحصائية SPSS

Understanding Research Results: Description and Correlation

Reasoning in Psychology Using Statistics

An Introduction to Correlational Research

Review I am examining differences in the mean between groups How many independent variables? OneMore than one How many groups? Two More than two ?? ?

Presentation transcript:

Joint Distributions AND CORRELATION Coefficients (Part 3)

Textbook Credits Textbook Shavelson, R.J. (1996). Statistical reasoning for the behavioral sciences (3rd Ed.). Boston: Allyn & Bacon. Supplemental Material Ruiz-Primo, M.A., Mitchell, M., & Shavelson, R.J. (1996). Student guide for Shavelson statistical reasoning for the behavioral sciences (3rd Ed.). Boston: Allyn & Bacon.

Overview Joint Distributions Correlation Coefficients

Joint Distributions and Correlation Coefficients Correlational studies answer the question “What is the relationship of variable X and variable Y?” or “How are scores on one measure (X) associated with scores on another measure(Y)?” First, we want to summarize the scores, and Second, examine the relationship between the scores on the two measures First step: Arrange the scores to represent them in the form of a joint distribution (the representation of a pair of scores for each subject) Second step: Summarize the relationship represented by the JD with a single number we call correlation coefficient(a descriptive statistic that represents the magnitude of the relation, 0 to |1|, and the direction of the relation, + or -).

The Psychological Belief Scale and Student Achievement Research Example The Psychological Belief Scale and Student Achievement Intuition and prior experience suggest that it is easier to learn from teachers who have the same beliefs as the students Prediction(intuition): Students with similar beliefs as their instructors will earn the highest scores on exams Exam scores should decrease as the difference in the students’ and instructors’ beliefs increases Study: 3 introductory Psych. classes at 3 different colleges with 7 students each, with variable X representing a Belief Score and Y representing an Exam Score What method to use? General: Combine the data and for all 3 classes examine one overall average X with Y? More Specific: Examine X and Y in each class separately? Are the data consistent with predictions?

The Psychological Belief Scale and Student Achievement Research Example The Psychological Belief Scale and Student Achievement Test 2 types of belief approaches: Humanistic(H) & Behavioristic(B) Example: The central focus of the study of human behavior should be The specific principles that apply to unique individuals(H) The general principles that apply to all individuals(B) Instructors and students received the belief scale beginning of course Behavioristic orientation on the belief scale indicative by high scores Humanistic orientation on the belief scale indicative by low scores

Joint Distribution: Tabular Representation Behavioristic orientation on the belief scale indicative by high scores Humanistic orientation on the belief scale indicative by low scores Achievement(exam) score: students’ total scores earned in all class exams

Joint Distribution: Tabular Representation Divided into 3 classes with 3 columns each. Take Class 1 as example: Low belief scores are associated with moderately high exam scores(subjects 1 & 2) Moderate belief scores are associated with high exam scores (subjects 3, 4, & 7) High belief scores are associated with low exam scores (subjects 5 & 6)

Relationship of Student’s Belief & Exam Scores Lines represent relationship between belief scale scores & exam scores The magnitude of students’ scores differ from one class to the next as each instructor gave a different exam So all exam scores were converted to standard scores showing how far above (+) or below (-) the class average a particular exam falls

Scatterplot of Student’s Belief & Exam Scores A graphical representation of a JD showing pairs of each subject’s scores

Scatterplots for 3 classes & Instructor’s Belief Score Comparison of Scatterplots for each of the 3 classes in the study Curvilinear Relationship Linear Relationship Suspect Outlier

Correlation Coefficients: Linear Relationships

Properties of Linear Correlation Coefficients The coefficient can take values from -1.00 to + 1.00 - A correlations of -0.95 indicates a very strong negative relationship between X & Y - A correlation of +0.95 indicates a very strong positive relationship between X & Y - A correlation of 0 indicates that there is no linear relationship between X & Y The sign indicates the direction of the relationship between 2 variables A positive relationship means: - Low scores on X go with low scores on Y - High scores on X go with high scores on Y(As X scores , Y scores ) A negative relationship means: - Low scores on X go with high scores on Y - High scores on X go with low scores on Y(As X scores , Y scores )

Determining the Correlation Coefficient Magnitude Scatterplot characteristics are indicative of slope and data clustering: - Correlation is 0 if slope is horizontal & vertical slope is undefined - The clustering of data points determines the magnitude of correlation - Tight clustering means the magnitude of the correlation coefficient is high - Lose clustering means the magnitude of the correlation coefficient is low

SAT & GPA Relationships Scatterplot characteristics are indicative of slope and data clustering: - Correlation is 0 if slope is vertical or horizontal

SAT & GPA Relationships Developing Statistics - Student’s #1 deviation score on the SAT is: - Student’s #1 deviation score on the GPA is: - Student 1 earned scores below the mean for both SAT and GPA 𝒙=𝑿− 𝑿 =𝟒𝟓𝟎−𝟓𝟕𝟕=−𝟏𝟐𝟕 𝒚=𝒀− 𝒀 =𝟐.𝟒𝟎−𝟑.𝟏𝟎=−𝟎.𝟕𝟎

SAT & GPA Minitab Results Descriptive Statistics: SAT(X), GPA(Y) Total Variable Count Mean StDev Variance Sum SAT(X) 5 577.0 126.1 15895.0 2885.0 GPA(Y) 5 3.100 0.477 0.228 15.500

Covariance of SAT & GPA Scores Measuring how two sets of deviation go together or covary - Student’s #1 covariance (cross product)is: - Note: When |x| and |y| are large  xy is large (students 1 & 5) - Note: When |x| and |y| are small  xy is small (students 2, 3, &4) - Covariance: - Pearson product-moment correlation coefficient measures the strength with X and Y - Correlation coefficient: 𝑪𝒐𝒗𝒙𝒚= 𝒙𝒚 𝑵−𝟏 = 𝟐𝟏𝟑.𝟔𝟓 𝟒 =𝟓𝟑.𝟒𝟏 𝒄𝒐𝒓𝒓𝒆𝒍𝒂𝒕𝒊𝒐𝒏 𝑿, 𝒀 =𝒓𝒙𝒚= 𝑪𝒐𝒗𝒙𝒚 𝒔𝒙𝒔𝒚 = 𝟓𝟑.𝟒𝟏 𝟏𝟐𝟔.𝟎𝟖 𝟎.𝟒𝟖 =𝟎.𝟖𝟗 𝒙𝒚= −𝟏𝟐𝟕 −𝟎.𝟕𝟎 =𝟖𝟖.𝟗𝟎

Covariance of SAT & GPA Scores Measuring how two sets of deviation go together or covary - Student’s #1 covariance (cross product)is: - Note: When |x| and |y| are large  xy is large (students 1 & 5) - Note: When |x| and |y| are small  xy is small (students 2, 3, &4) - Covariance: - Pearson product-moment correlation coefficient measures the strength with X and Y - Correlation coefficient: 𝒙𝒚= −𝟏𝟐𝟕 −𝟎.𝟕𝟎 =𝟖𝟖.𝟗𝟎 𝑪𝒐𝒗𝒙𝒚= 𝒙𝒚 𝑵−𝟏 = 𝟐𝟏𝟑.𝟔𝟓 𝟒 =𝟓𝟑.𝟒𝟏 𝒄𝒐𝒓𝒓𝒆𝒍𝒂𝒕𝒊𝒐𝒏 𝑿, 𝒀 =𝒓𝒙𝒚= 𝑪𝒐𝒗𝒙𝒚 𝒔𝒙𝒔𝒚 = 𝟓𝟑.𝟒𝟏 𝟏𝟐𝟔.𝟎𝟖 𝟎.𝟒𝟖 =𝟎.𝟖𝟗 Minitab Results Covariances: SAT(X), GPA(Y) SAT(X) GPA(Y) SAT(X) 15895.000 GPA(Y) 53.413 0.228 Correlations: SAT(X), GPA(Y) Pearson correlation of SAT(X) and GPA(Y) = 0.888

Correlation Between SAT & GPA Scores Looking at the scatterplot to validate the correlation findings - A linear relationship with a positive slope indicates a positive correlation - The absolute magnitude 0.89 provides an index of the relationship strength(-1to +1) - Points cluster closely about an imaginary line validating the relationship magnitude

Minitab Output: SAT & GPA Scores Looking at the scatterplot to validate the correlation findings - A linear relationship with a positive slope indicates a positive correlation - The absolute magnitude 0.89 provides an index of the relationship strength(-1to +1) - Points cluster closely about an imaginary line validating the relationship magnitude

Excel Output: SAT & GPA Scores

Excel Output SAT & GPA Scores

The Squared Correlation Coefficient The squared correlation coefficient is the coefficient of determination - It is the amount of variability that can be explained between X & Y Recall: The larger |rxy| is, the stronger the relationship between X & Y We previously found that: So Now we want to convert to percentage of variance - Tells us the percentage that X shares with Y in terms of variability to one another - The % of variance in Y and X that can be explained is: 𝒓𝒙𝒚= 𝑪𝒐𝒗𝒙𝒚 𝒔𝒙𝒔𝒚 = 𝟓𝟑.𝟒𝟏 𝟏𝟐𝟔.𝟎𝟖 𝟎.𝟒𝟖 =𝟎.𝟖𝟗 𝒓 𝟐 𝒙𝒚= 𝟎.𝟖𝟗 𝟐 =𝟎.𝟕𝟗𝟐𝟏 𝒓 𝟐 𝒙𝒚 ×𝟏𝟎𝟎= 𝟎.𝟕𝟗𝟐𝟏 ×𝟏𝟎𝟎=𝟕𝟗.𝟐𝟏%

Percentage of Variance Pictorial representation of the % of variance in exam scores accounted for by the variability in belief scores (computed from class 3 data) Variability in X Variability in Y

Spearman Rank Correlation Coefficient Non-linear (curvilinear) monotonic increasing or decreasing functions Monotonically decreasing f Monotonically increasing f

Spearman Rank Correlation Coefficient Example: Y is a monotonically increasing function of X

Spearman Rank Correlation Coefficient Rank ordering the data for both X & Y and graph - The converted ordered graph is now linear - We can now compute the Pearson correlation coefficient for ranks between X & Y

Spearman Rank Correlation Coefficient

Sources of Misleading Correlation Coefficients Too much confidence can lead to misleading interpretations - Restriction of the range of values on one of the variables may reduce the magnitude of the correlation coefficient

Sources of Misleading Correlation Coefficients Too much confidence can lead to misleading interpretations - Use of extreme groups may inflate the correlation coefficient

Sources of Misleading Correlation Coefficients Too much confidence can lead to misleading interpretations - Combining groups with different means on one or both variables may have an unpredictable effect on the correlation coefficient

Sources of Misleading Correlation Coefficients Too much confidence can lead to misleading interpretations - Extreme scores (Outliers) may have a marked effect on the correlation coefficient, especially if the sample size is small

Sources of Misleading Correlation Coefficients Too much confidence can lead to misleading interpretations - A curvilinear relationship between X and Y may account for a near-zero correlation coefficient No systematic relationship Curvilinearly related: Use the eta (h) ratio coefficient measurement instead of the Pearson correlation coefficient

Correlation and Causality Correlation does not imply causality Many possible interpretations of a correlation coefficient: Most common problem inferring causality from correlation: Selectivity! X: beliefs Y: Achievement Z: Knowledge gained from related courses

Practice Exercises Part 3 Practice Exercises Select a hypothetical product or a process and create some test data of your choice (plausible, no more than 10) as shown in textbook/class Show your type of experimental approach Create a detailed table of frequency distributions Display your data with different types of graphs Calculate the measures of central tendency and variability Calculate the Z-score(s) and indicate the relative position in the normal distribution. Provide any other pertinent information as a result Part 3 Practice Exercises Represent your joint distribution data in a tabular form Create a scatterplot of your data Create a covariance table (as table 6-4) and calculate the covariance Calculate the correlation of the two variables Calculate the R squared value and explain your findings as a result

Comments/Questions ?