Linear Correlation. PSYC 6130, PROF. J. ELDER 2 Perfect Correlation 2 variables x and y are perfectly correlated if they are related by an affine transform.

Slides:



Advertisements
Similar presentations
Wednesday AM  Presentation of yesterday’s results  Associations  Correlation  Linear regression  Applications: reliability.
Advertisements

Tests of Significance for Regression & Correlation b* will equal the population parameter of the slope rather thanbecause beta has another meaning with.
Correlation Oh yeah!.
Sampling: Final and Initial Sample Size Determination
Bivariate Analyses.
Linear Regression. PSYC 6130, PROF. J. ELDER 2 Correlation vs Regression: What’s the Difference? Correlation measures how strongly related 2 variables.
Correlation Mechanics. Covariance The variance shared by two variables When X and Y move in the same direction (i.e. their deviations from the mean are.
Describing Relationships Using Correlation and Regression
Correlation CJ 526 Statistical Analysis in Criminal Justice.
Copyright ©2011 Brooks/Cole, Cengage Learning Testing Hypotheses about Means Chapter 13.
Copyright ©2011 Brooks/Cole, Cengage Learning Testing Hypotheses about Means Chapter 13.
Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. More About Significance Tests Chapter 13.
Correlation. Introduction Two meanings of correlation –Research design –Statistical Relationship –Scatterplots.
t-Tests Overview of t-Tests How a t-Test Works How a t-Test Works Single-Sample t Single-Sample t Independent Samples t Independent Samples t Paired.
PSY 307 – Statistics for the Behavioral Sciences
10-1 Introduction 10-2 Inference for a Difference in Means of Two Normal Distributions, Variances Known Figure 10-1 Two independent populations.
Lecture 11 PY 427 Statistics 1 Fall 2006 Kin Ching Kong, Ph.D
Lecture 19: Tues., Nov. 11th R-squared (8.6.1) Review
The Simple Regression Model
Correlation. Two variables: Which test? X Y Contingency analysis t-test Logistic regression Correlation Regression.
CORRELATION COEFFICIENTS What Does a Correlation Coefficient Indicate? What is a Scatterplot? Correlation Coefficients What Could a Low r mean? What is.
Chapter 11: Inference for Distributions
PSY 307 – Statistics for the Behavioral Sciences
5-3 Inference on the Means of Two Populations, Variances Unknown
Summary of Quantitative Analysis Neuman and Robson Ch. 11
Review for Exam 2 Some important themes from Chapters 6-9 Chap. 6. Significance Tests Chap. 7: Comparing Two Groups Chap. 8: Contingency Tables (Categorical.
Chapter 9 For Explaining Psychological Statistics, 4th ed. by B. Cohen 1 What is a Perfect Positive Linear Correlation? –It occurs when everyone has the.
Relationships Among Variables
PSY 307 – Statistics for the Behavioral Sciences
Chapter 15 Nonparametric Statistics
Correlation and Linear Regression
Topics: Significance Testing of Correlation Coefficients Inference about a population correlation coefficient: –Testing H 0 :  xy = 0 or some specific.
Lecture 16 Correlation and Coefficient of Correlation
AP Statistics Section 13.1 A. Which of two popular drugs, Lipitor or Pravachol, helps lower bad cholesterol more? 4000 people with heart disease were.
Chapter 9 Two-Sample Tests Part II: Introduction to Hypothesis Testing Renee R. Ha, Ph.D. James C. Ha, Ph.D Integrative Statistics for the Social & Behavioral.
AM Recitation 2/10/11.
This Week: Testing relationships between two metric variables: Correlation Testing relationships between two nominal variables: Chi-Squared.
Things that I think are important Chapter 1 Bar graphs, histograms Outliers Mean, median, mode, quartiles of data Variance and standard deviation of.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Inference on the Least-Squares Regression Model and Multiple Regression 14.
Education 793 Class Notes T-tests 29 October 2003.
The paired sample experiment The paired t test. Frequently one is interested in comparing the effects of two treatments (drugs, etc…) on a response variable.
CORRELATION & REGRESSION
Chapter 15 Correlation and Regression
14 Elements of Nonparametric Statistics
Dan Piett STAT West Virginia University
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 17 Inferential Statistics.
University of Ottawa - Bio 4118 – Applied Biostatistics © Antoine Morin and Scott Findlay 08/10/ :23 PM 1 Some basic statistical concepts, statistics.
Hypothesis Testing Using the Two-Sample t-Test
© 2011 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license.
Correlation Chapter 15. A research design reminder >Experimental designs You directly manipulated the independent variable. >Quasi-experimental designs.
AP Statistics Section 13.1 A. Which of two popular drugs, Lipitor or Pravachol, helps lower bad cholesterol more? 4000 people with heart disease were.
By: Amani Albraikan.  Pearson r  Spearman rho  Linearity  Range restrictions  Outliers  Beware of spurious correlations….take care in interpretation.
7. Comparing Two Groups Goal: Use CI and/or significance test to compare means (quantitative variable) proportions (categorical variable) Group 1 Group.
1 Inferences About The Pearson Correlation Coefficient.
ITEC6310 Research Methods in Information Technology Instructor: Prof. Z. Yang Course Website: c6310.htm Office:
Chapter 10 For Explaining Psychological Statistics, 4th ed. by B. Cohen 1 A perfect correlation implies the ability to predict one score from another perfectly.
Chapter Twelve The Two-Sample t-Test. Copyright © Houghton Mifflin Company. All rights reserved.Chapter is the mean of the first sample is the.
Inferential Statistics. The Logic of Inferential Statistics Makes inferences about a population from a sample Makes inferences about a population from.
The basic task of most research = Bivariate Analysis
© Buddy Freeman, 2015 Let X and Y be two normally distributed random variables satisfying the equality of variance assumption both ways. For clarity let.
© 2011 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license.
8.1 Estimating µ with large samples Large sample: n > 30 Error of estimate – the magnitude of the difference between the point estimate and the true parameter.
Chapter 21prepared by Elizabeth Bauer, Ph.D. 1 Ranking Data –Sometimes your data is ordinal level –We can put people in order and assign them ranks Common.
Chapter Eleven Performing the One-Sample t-Test and Testing Correlation.
Chapter 13 Understanding research results: statistical inference.
Lecture 7: Bivariate Statistics. 2 Properties of Standard Deviation Variance is just the square of the S.D. If a constant is added to all scores, it has.
Confidence Intervals. Point Estimate u A specific numerical value estimate of a parameter. u The best point estimate for the population mean is the sample.
Marginal Distribution Conditional Distribution. Side by Side Bar Graph Segmented Bar Graph Dotplot Stemplot Histogram.
Statistical Inference for the Mean: t-test
Presentation transcript:

Linear Correlation

PSYC 6130, PROF. J. ELDER 2 Perfect Correlation 2 variables x and y are perfectly correlated if they are related by an affine transform y = ax + b The correlation is positive if a>0 and negative if a<0. By corollary, 2 variables are perfectly positively correlated if and only if each pair of corresponding values has the same z-score. If the 2 variables are perfectly negatively correlated, corresponding z-scores will be equal in magnitude but opposite in sign.

PSYC 6130, PROF. J. ELDER 3 Pearson’s r

PSYC 6130, PROF. J. ELDER 4 Scatterplots

PSYC 6130, PROF. J. ELDER 5 Pearson’s r only measures linear dependence Two variables can have low correlation and still be highly dependent.

PSYC 6130, PROF. J. ELDER 6 Higher-Order Models

PSYC 6130, PROF. J. ELDER 7 Pearson’s r depends on the range of the variables under study r 2 measures the proportion of variance in one variable accounted for by the other. If the range of variable X is restricted, it will account for less of the variance in Y.

PSYC 6130, PROF. J. ELDER 8 Pearson’s r is Sensitive to Outliers Outlier (Fake Student)

PSYC 6130, PROF. J. ELDER 9 Standard Definition of Correlation (Population)

PSYC 6130, PROF. J. ELDER 10 Standard Definition of Correlation (Sample)

PSYC 6130, PROF. J. ELDER 11 Alternative (Equivalent) Formula

PSYC 6130, PROF. J. ELDER 12 Computational Formula covariance For a population: For a sample: unbiased covariance

PSYC 6130, PROF. J. ELDER 13 Example: 6130A Assignment Marks

End of Lecture 7 Wed, Oct

Correlation and the Power of Matched Tests

PSYC 6130, PROF. J. ELDER 16 Correlation and the Power of Matched t-tests Now that we understand correlation, we can better understand the power of matched t-tests when scores in the two conditions are correlated.

PSYC 6130, PROF. J. ELDER 17 Recall formulae for standard error for independent and matched tests Independent t-testMatched t-test

PSYC 6130, PROF. J. ELDER 18 Knowing the expected std error, we can estimate the expected t-value Independent t-testMatched t-test

PSYC 6130, PROF. J. ELDER 19 The power of matched t-tests Large positive correlations between scores in the two conditions will mean a greater expected t-score for the matched design. But keep in mind that the critical value for the matched design will be somewhat larger as well, due to a smaller df. Which test is more powerful is decided by the exact tradeoff between these two effects.

Applying Correlation Analysis

PSYC 6130, PROF. J. ELDER 21 Adjusted Correlation Coefficient

PSYC 6130, PROF. J. ELDER 22 Testing Pearson’s r for Significance

PSYC 6130, PROF. J. ELDER 23 Underlying Assumptions (For Inference) Independent random sampling Bivariate normal distribution Probability

PSYC 6130, PROF. J. ELDER 24 Applications of Pearson’s r Measuring reliability and validity –Examples: e.g., test-retest reliability Split-half reliability Inter-rater reliability Criterion validity of self-report (correlate self-report against behavioural measure) Correlation between tests that are supposed to measure the same thing. Correlation between algorithmic model and human responses in behavioural studies. Measuring relationships between variables (correlational studies) –e.g., frequency of cannabis and alcohol use Measuring relationships between IVs and DVs (experimental studies, when IV on interval/ratio scale –e.g., exam performance as a function of alcohol consumption on previous night.

PSYC 6130, PROF. J. ELDER 25 Power Analysis for Pearson’s r

PSYC 6130, PROF. J. ELDER 26 Confidence Intervals for Pearson’s r Pearson’s r is bounded on [-1..1]. Consequently, sampling distribution for r is not normal. Sampling distribution for  >0 is negatively skewed. Sampling distribution for  <0 is positively skewed. Thus confidence intervals are generally not symmetric.

PSYC 6130, PROF. J. ELDER 27 Fisher Transform Fisher transform (Appendix r′): Method for symmetrizing r to facilitate calculation of confidence interval using standard normal table.

PSYC 6130, PROF. J. ELDER 28 Confidence Intervals on r

End of Lecture 8 Nov

PSYC 6130, PROF. J. ELDER 30 Testing Difference of Pearson Correlations from 2 Independent Samples Converting the skewed r distribution to an (approximately) normal distribution allows straightforward two-sample testing:

PSYC 6130, PROF. J. ELDER 31 Example N=43 N=44