Regression vs. Correlation Both: Two variables Continuous data Regression: Change in X causes change in Y Independent and dependent variables or Predict.

Slides:



Advertisements
Similar presentations
Regression and correlation methods
Advertisements

Forecasting Using the Simple Linear Regression Model and Correlation
Hypothesis Testing Steps in Hypothesis Testing:
Learning Objectives Copyright © 2002 South-Western/Thomson Learning Data Analysis: Bivariate Correlation and Regression CHAPTER sixteen.
Linear regression models
Learning Objectives 1 Copyright © 2002 South-Western/Thomson Learning Data Analysis: Bivariate Correlation and Regression CHAPTER sixteen.
Quantitative Data Analysis: Hypothesis Testing
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Nonparametric Methods Chapter 15.
Correlation & Regression Chapter 15. Correlation statistical technique that is used to measure and describe a relationship between two variables (X and.
Data Transformations. Data transformations  Data transformation can “correct” deviation from normality and uneven variance (heteroscedasticity)  Right.
Université d’Ottawa / University of Ottawa 2001 Bio 4118 Applied Biostatistics L10.1 CorrelationCorrelation The underlying principle of correlation analysis.
Multiple regression analysis
Correlation and Regression. Spearman's rank correlation An alternative to correlation that does not make so many assumptions Still measures the strength.
Lecture 4: Correlation and Regression Laura McAvinue School of Psychology Trinity College Dublin.
Regression and Correlation
Chapter Eighteen MEASURES OF ASSOCIATION
Correlation. Two variables: Which test? X Y Contingency analysis t-test Logistic regression Correlation Regression.
Chapter Topics Types of Regression Models
Educational Research by John W. Creswell. Copyright © 2002 by Pearson Education. All rights reserved. Slide 1 Chapter 8 Analyzing and Interpreting Quantitative.
Simple Linear Regression and Correlation
EDUC 200C Section 4 – Review Melissa Kemmerle October 19, 2012.
Regression and Correlation Methods Judy Zhong Ph.D.
Equations in Simple Regression Analysis. The Variance.
Correlation and Regression
Chapter 14: Nonparametric Statistics
Simple Linear Regression Models
Copyright © 2010, 2007, 2004 Pearson Education, Inc Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by.
Correlation.
Chapter 15 Correlation and Regression
Learning Objective Chapter 14 Correlation and Regression Analysis CHAPTER fourteen Correlation and Regression Analysis Copyright © 2000 by John Wiley &
1 Experimental Statistics - week 10 Chapter 11: Linear Regression and Correlation.
1 Experimental Statistics - week 10 Chapter 11: Linear Regression and Correlation Note: Homework Due Thursday.
Regression For the purposes of this class: –Does Y depend on X? –Does a change in X cause a change in Y? –Can Y be predicted from X? Y= mX + b Predicted.
Statistical Analysis. Statistics u Description –Describes the data –Mean –Median –Mode u Inferential –Allows prediction from the sample to the population.
Correlation and Regression Used when we are interested in the relationship between two variables. NOT the differences between means or medians of different.
Basic Statistics Correlation Var Relationships Associations.
Nonparametric Statistics aka, distribution-free statistics makes no assumption about the underlying distribution, other than that it is continuous the.
Examining Relationships in Quantitative Research
Relationship between two variables Two quantitative variables: correlation and regression methods Two qualitative variables: contingency table methods.
6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y is.
Lesson 15 - R Chapter 15 Review. Objectives Summarize the chapter Define the vocabulary used Complete all objectives Successfully answer any of the review.
STATISTICAL ANALYSIS FOR THE MATHEMATICALLY-CHALLENGED Associate Professor Phua Kai Lit School of Medicine & Health Sciences Monash University (Sunway.
The Pearson Product-Moment Correlation Coefficient.
Inferential Statistics. The Logic of Inferential Statistics Makes inferences about a population from a sample Makes inferences about a population from.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Session 13: Correlation (Zar, Chapter 19). (1)Regression vs. correlation Regression: R 2 is the proportion that the model explains of the variability.
© 2006 by The McGraw-Hill Companies, Inc. All rights reserved. 1 Chapter 12 Testing for Relationships Tests of linear relationships –Correlation 2 continuous.
Statistics in Applied Science and Technology Chapter14. Nonparametric Methods.
Correlation & Regression Analysis
Introducing Communication Research 2e © 2014 SAGE Publications Chapter Seven Generalizing From Research Results: Inferential Statistics.
Nonparametric Statistics
Copyright © 2010 Pearson Education, Inc Chapter Seventeen Correlation and Regression.
Regression Analysis. 1. To comprehend the nature of correlation analysis. 2. To understand bivariate regression analysis. 3. To become aware of the coefficient.
Linear Correlation (12.5) In the regression analysis that we have considered so far, we assume that x is a controlled independent variable and Y is an.
Chapter 21prepared by Elizabeth Bauer, Ph.D. 1 Ranking Data –Sometimes your data is ordinal level –We can put people in order and assign them ranks Common.
Experimental Statistics - week 9
Nonparametric Statistics
Nonparametric statistics. Four levels of measurement Nominal Ordinal Interval Ratio  Nominal: the lowest level  Ordinal  Interval  Ratio: the highest.
Nonparametric Statistics Overview. Objectives Understand Difference between Parametric and Nonparametric Statistical Procedures Nonparametric methods.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
The “Big Picture” (from Heath 1995). Simple Linear Regression.
Slide 1 Copyright © 2004 Pearson Education, Inc. Chapter 10 Correlation and Regression 10-1 Overview Overview 10-2 Correlation 10-3 Regression-3 Regression.
Inference about the slope parameter and correlation
CHAPTER fourteen Correlation and Regression Analysis
After ANOVA If your F < F critical: Null not rejected, stop right now!! If your F > F critical: Null rejected, now figure out which of the multiple means.
Nonparametric Statistics Overview
6-1 Introduction To Empirical Models
Product moment correlation
Nonparametric Statistics
Presentation transcript:

Regression vs. Correlation Both: Two variables Continuous data Regression: Change in X causes change in Y Independent and dependent variables or Predict X based on Y Correlation: No dependence (causation) assumed Estimate the degree to which 2 variables vary together

Correlation: more on bivariate statistics No dependence (causation) assumed Can call variables XY or X 1 X 2 Are to variables independent, or do they covary

Nature of variables Purpose of investigator Y random, X fixed Both random Establish and estimate dependence of Y upon X, describe functional relationship or predict Y from X Model I regression Model II regression, with few exceptions, eg prediction Establish and estimate association (interdependence) between X & Y MeaninglessCorrelation co- efficient, significance only if, normally distributed Adapted from Sokal & Rolf pg 559

X1X1 Y(X 2 ) X1X1 Visualize Correlation positive negative Increase in X associated with increase in Y Increase in X associated with decrease in Y

X1X1 Y(X 2 ) X1X1 No correlation vertical horizontal

r =  xy Pearson product-moment correlation coefficient Summed products of deviations of x & y  x 2  y 2    (x-xbar) 2 *  (y-ybar) 2 =  [(x-xbar) *(y-ybar)] ss X * ss Y  =

Equivalent calculations (1) r =  xy (n-1) s x s y Wheres x = SD X s y = SD Y

(r 2 ) = regression SS total SS  (Yi-Ybar)2  (Ŷi-Ybar)2 = Equivalent calculations (2) r= r2 =  regression SS total SS 

Testing significance: H 0 : r (  ) = 0 Assumes that data come from bivariate normal distribution true population parameter

t = r srsr Reject null if…… t calc > t  (2),  srsr =  1-r 2 n-2 SE of r

data start; infile 'C:\Documents and Settings\cmayer3\My Documents\teaching\Biostatistics\Lectures\monitoring data for corr.csv' dlm=',' DSD; input year day site $ depth temp DO spCond turb pH Kpar secchi alk Chla; options ls=180; proc print; data one; set start; options ls=100; proc corr; var temp DO spCond turb pH Kpar secchi alk Chla; Correlations on raw data data two; set start; lnturb=log(turb); Create new variables by transformation lnsecchi=log(secchi); lgturb=log10(turb); lgsecchi=log10(secchi); sqturb=sqrt(turb); sqsecchi=sqrt(secchi); proc print; data three; set two; Correlations on transformed data proc corr; var lnturb lnsecchi; proc corr; var lgturb lgsecchi; proc corr; var sqturb sqsecchi; data four; set two; Plot raw and transformed options ls=100; proc plot; plot turb*secchi; plot lnturb*lnsecchi; plot lgturb*lgsecchi; plot sqturb*sqsecchi; run;

Pearson Correlation Coefficients Prob > |r| under H0: Rho=0 Number of Observations temp DO spCond turb pH Kpar secchi alk Chla temp DO < < spCond < <.0001 <.0001 < turb < <.0001 < pH < < Kpar <.0001 < <.0001 < secchi <.0001 < <.0001 < alk < <.0001 < Chla < <

 Sometimes called distribution free statistics because they do not require that the data fit a normal distribution  Many nonparametric procedures are based on ranked data. Data are ranked by ordering them from lowest to highest and assigning them, in order, the integer values from 1 to the sample size. Nonparametric statistics

Some Commonly Used Statistical Tests Normal theory based test Corresponding nonparametric test Purpose of test t test for independent samples Mann-Whitney U test; Wilcoxon rank- sum test Compares two independent samples Paired t test Wilcoxon matched pairs signed-rank test Examines a set of differences Pearson correlation coefficient Spearman rank correlation coefficient Assesses the linear association between two variables. One way analysis of variance (F test) Kruskal-Wallis analysis of variance by ranks Compares three or more groups Two way analysis of variance Friedman Two way analysis of variance Compares groups classified by two different factors From:

Data transformations  Data transformation can “correct” deviation from normality and uneven variance (heteroscedasticity)  See chapter 13 in Zar  Pretty much….. Whatever works, works. Some common ones are for % or proportion use asin of square root log10 for density (#/m 2 )  Right transformation can allow you to use parametric statistics