Stats Club Marnie Brennan

Slides:



Advertisements
Similar presentations
Correlation and regression
Advertisements

Inference for Regression
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
Learning Objectives Copyright © 2002 South-Western/Thomson Learning Data Analysis: Bivariate Correlation and Regression CHAPTER sixteen.
Learning Objectives 1 Copyright © 2002 South-Western/Thomson Learning Data Analysis: Bivariate Correlation and Regression CHAPTER sixteen.
Correlation and Regression
Chapter 13 Multiple Regression
Chapter 12 Multiple Regression
The Simple Regression Model
Chapter 11 Multiple Regression.
Simple Linear Regression Analysis
SIMPLE LINEAR REGRESSION
Measures of Association Deepak Khazanchi Chapter 18.
Correlation and Regression Analysis
Chapter 7 Forecasting with Simple Regression
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS & Updated by SPIROS VELIANITIS.
Quantitative Business Analysis for Decision Making Multiple Linear RegressionAnalysis.
Introduction to Linear Regression and Correlation Analysis
Inference for regression - Simple linear regression
Correlation and Linear Regression
Maths Study Centre CB Open 11am – 5pm Semester Weekdays
Chapter 14 – Correlation and Simple Regression Math 22 Introductory Statistics.
Learning Objective Chapter 14 Correlation and Regression Analysis CHAPTER fourteen Correlation and Regression Analysis Copyright © 2000 by John Wiley &
Chapter 12 Examining Relationships in Quantitative Research Copyright © 2013 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin.
Hypothesis of Association: Correlation
1 1 Slide © 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
Correlation & Regression
Examining Relationships in Quantitative Research
Lesson Multiple Regression Models. Objectives Obtain the correlation matrix Use technology to find a multiple regression equation Interpret the.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
1 Regression Analysis The contents in this chapter are from Chapters of the textbook. The cntry15.sav data will be used. The data collected 15 countries’
Correlation and Linear Regression Peter T. Donnan Professor of Epidemiology and Biostatistics Statistics for Health Research.
28. Multiple regression The Practice of Statistics in the Life Sciences Second Edition.
Introducing Communication Research 2e © 2014 SAGE Publications Chapter Seven Generalizing From Research Results: Inferential Statistics.
STATS 10x Revision CONTENT COVERED: CHAPTERS
Chapter 11 Linear Regression and Correlation. Explanatory and Response Variables are Numeric Relationship between the mean of the response variable and.
Stats Methods at IC Lecture 3: Regression.
Chapter 13 Simple Linear Regression
Lecture #25 Tuesday, November 15, 2016 Textbook: 14.1 and 14.3
Inference for Least Squares Lines
Statistics for Managers using Microsoft Excel 3rd Edition
Non-Parametric Tests 12/1.
Non-Parametric Tests 12/1.
Inference for Regression
Non-Parametric Tests 12/6.
Chapter 11 Simple Regression
Non-Parametric Tests.
Understanding Standards Event Higher Statistics Award
CHAPTER fourteen Correlation and Regression Analysis
Slides by JOHN LOUCKS St. Edward’s University.
CHAPTER 10 Correlation and Regression (Objectives)
Essential Statistics (a.k.a: The statistical bare minimum I should take along from STAT 101)
Normality or not? Different distributions and their importance
Correlation and Regression
Central tendency and spread
Stats Club Marnie Brennan
CHAPTER 26: Inference for Regression
Multiple Regression Models
Ass. Prof. Dr. Mogeeb Mosleh
CORRELATION ANALYSIS.
Relationship between two continuous variables: correlations and linear regression both continuous. Correlation – larger values of one variable correspond.
Correlation and Regression
SIMPLE LINEAR REGRESSION
Linear Regression and Correlation
Linear Regression and Correlation
Warsaw Summer School 2017, OSU Study Abroad Program
EE, NCKU Tien-Hao Chang (Darby Chang)
Introductory Statistics
Chapter 13 Simple Linear Regression
Presentation transcript:

Stats Club Marnie Brennan Correlation Stats Club Marnie Brennan

Have you used correlation before?

References Petrie and Sabin - Medical Statistics at a Glance: Chapter 26 Good Petrie and Watson - Statistics for Veterinary and Animal Science: Chapter 10 & 12 (12.7) Good Kirkwood and Sterne – Essential Medical Statistics: Chapter 10 & 30 Thrusfield – Veterinary Epidemiology: Chapter 14 (pges 263-264)

Other reads Mathematics Learning Support Centre – Statistics: Correlation - Good http://www.lboro.ac.uk/media/wwwlboroacuk/content/mlsc/downloads/Correlation.pdf Bewick, V, Cheek, L and Ball, J (2003) Statistics review 7: Correlation and regression. Critical Care, Vol. 7, 451-459 Swinscow, TDV (1976) Statistics at square one: XVIII – Correlation. British Medical Journal, Vol. 2, 680-681. Swinscow, TDV (1976) Statistics at square one: XIX – Correlation (continued). British Medical Journal, Vol. 2, 747-748. Swinscow, TDV (1976) Statistics at square one: XIX – Correlation (concluded). British Medical Journal, Vol. 2, 802-803.

What is correlation? Use it to look at the relationship between two continuous (numerical) variables To see if you have a linear relationship between them If you interchanged the x and y axes, you would still have the same relationship To measure the degree of the relationship

How does it differ from other calculations? T-tests and ANOVAs These measure the differences between subsets within your data/between groups Regression (this will be covered in a later session) This describes the linear relationship between two variables Describes how one variable (independent) predicts the other (dependent) You cannot interchange the variables between the x and y axes

When would you use correlation? To see if there is a relationship between two variables, and how strong it is As a prequel to linear regression If variables are highly correlated (or collinear), this will effect how they interact in a linear regression calculation Therefore, you need to know whether variables are correlated or not before you do a linear regression Other reasons??

When wouldn’t you use correlation! To compare two different methods of measurement on the same thing (reliability) E.g. How many neutrophils are in a blood sample? Compare using an IDEXX machine versus counting on a blood smear To compare the same method of measurement but used multiple times (reproducibility) E.g. How many neutrophils are in a blood sample? Preparing two blood smears from the same sample, and comparing the results Kappa (and associated) analysis – covered in a previous session Categorical data

First step – descriptive stats Scatter plot What does the relationship look like?

Shape and what does it mean? Does it increase or decrease as the values get higher? Does the relationship look linear? How ‘steep’ is the slope of the shape? Are there any outliers? Taken from Petrie and Sabin

Scatter plot examples

Demonstration if required….

Demonstration if required…. GenStat Graphics, 2D Scatter Plot (Y variate, X variate) Run

Measurement for correlation Correlation coefficient Numerical representation of the degree of association between the variables Between -1 and +1 Positive – the relationship is increasing with increasing values Negative – the relationship is decreasing with decreasing values It is dimensionless (spooky….) No units of measurement

What are the assumptions that have to be satisfied? There is a linear relationship between the variables There are no outliers present There are no subgroups within the data that affect the relationship e.g. sex Multiple data from the same subjects Taken from Petrie and Sabin

Represented as r (rho) Calculation of correlation coefficient Pearson’s coefficient – r If our null hypothesis is that there is no relationship e.g. the correlation coefficient is zero At least one variable has to be normally distributed Taken from Petrie and Sabin

P-values and confidence intervals As standard with p-values (you can calculate confidence intervals – not normally included) You state your cut off e.g. if p<0.05, reject the null hypothesis and conclude that there is a relationship A significant result is not necessarily CAUSAL There is an ASSOCIATION

Minitab

Minitab – Pearsons

SPSS

SPSS - Pearsons

GenStat GenStat Stat, summary, correlations

What if our assumptions cannot be met? Use Spearman’s rank correlation coefficient (or Kendall’s coefficient) Rank your variables from lowest to highest, and then calculate the correlation coefficient Still between -1 and 1 Remove outliers Rule of thumb – if the outliers are outside +/-2 standard deviations away from the group mean, you can remove them Transform data into a linear relationship????

Minitab – Spearman’s

Minitab – Spearman’s

SPSS – Spearman’s

SPSS – Spearman’s

GenStat GenStat Stat, summary, correlations Rank??

How does this fit with what you do or have seen/experienced?

Summary Make sure you are using correlation in the correct way Eyeball first with a scatter plot – look for strength, slope, relationship, outliers If it doesn’t fit the assumptions, use a non-parametric equivalent e.g. Spearman’s

Next time… Linear regression……