Correlation @UWE_JT9 @dave_lush Scientific Practice Correlation @UWE_JT9 @dave_lush.

Slides:



Advertisements
Similar presentations
Simple Linear Regression 1. 2 I want to start this section with a story. Imagine we take everyone in the class and line them up from shortest to tallest.
Advertisements

Correlation & Regression Chapter 10. Outline Section 10-1Introduction Section 10-2Scatter Plots Section 10-3Correlation Section 10-4Regression Section.
 Once you know the correlation coefficient for your sample, you might want to determine whether this correlation occurred by chance.  Or does the relationship.
C82MCP Diploma Statistics School of Psychology University of Nottingham 1 Designing Experiments In designing experiments we: Manipulate the independent.
Cal State Northridge  320 Andrew Ainsworth PhD Correlation.
Correlation A correlation exists between two variables when one of them is related to the other in some way. A scatterplot is a graph in which the paired.
Correlations Determining Relationships: Chapter 5:
Review for Exam 2 Some important themes from Chapters 6-9 Chap. 6. Significance Tests Chap. 7: Comparing Two Groups Chap. 8: Contingency Tables (Categorical.
Chapter 9 For Explaining Psychological Statistics, 4th ed. by B. Cohen 1 What is a Perfect Positive Linear Correlation? –It occurs when everyone has the.
Relationships Among Variables
Statistical Analysis Statistical Analysis
Correlation. Definition: the degree of relationship between two or more variables. For example, smoking and lung cancer are correlated: – if we look at.
The Scientific Method Interpreting Data — Correlation and Regression Analysis.
Statistical Significance R.Raveendran. Heart rate (bpm) Mean ± SEM n In men ± In women ± The difference between means.
Statistical Analysis Mean, Standard deviation, Standard deviation of the sample means, t-test.
Statistical Analysis Topic – Math skills requirements.
Correlation and Regression Used when we are interested in the relationship between two variables. NOT the differences between means or medians of different.
When trying to explain some of the patterns you have observed in your species and community data, it sometimes helps to have a look at relationships between.
Psych 230 Psychological Measurement and Statistics Pedro Wolf September 23, 2009.
Correlation Analysis. Correlation Analysis: Introduction Management questions frequently revolve around the study of relationships between two or more.
Essential Question:  How do scientists use statistical analyses to draw meaningful conclusions from experimental results?
Correlation & Regression Chapter 15. Correlation It is a statistical technique that is used to measure and describe a relationship between two variables.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Inferential Statistics Introduction. If both variables are categorical, build tables... Convention: Each value of the independent (causal) variable has.
IMPORTANCE OF STATISTICS MR.CHITHRAVEL.V ASST.PROFESSOR ACN.
Lecturer’s desk Physics- atmospheric Sciences (PAS) - Room 201 s c r e e n Row A Row B Row C Row D Row E Row F Row G Row H Row A
Chapter 13 Understanding research results: statistical inference.
Jump to first page Inferring Sample Findings to the Population and Testing for Differences.
Educational Research Inferential Statistics Chapter th Chapter 12- 8th Gay and Airasian.
Other tests of significance. Independent variables: continuous Dependent variable: continuous Correlation: Relationship between variables Regression:
Comparing Multiple Groups:
Comparing Multiple Factors:
Scatter Plots and Correlation
GS/PPAL Section N Research Methods and Information Systems
Statistical analysis.
Regression and Correlation
Learning Objectives: 1. Understand the use of significance levels. 2
Chi-square: Comparing Observed and
Regression Scientific
Using the t-distribution
Modify—use bio. IB book  IB Biology Topic 1: Statistical Analysis
Statistical analysis.
Chi-Square X2.
Practice & Communication of Science
PCB 3043L - General Ecology Data Analysis.
Correlation – Regression
Non-parametric Tests: When Things
Basic Statistics Overview
Understanding Standards Event Higher Statistics Award
POSC 202A: Lecture Lecture: Substantive Significance, Relationship between Variables 1.
Spearman’s rho Chi-square (χ2)
Scientific Practice Correlation.
Inferential Statistics
Correlation and Simple Linear Regression
Comparing Multiple Groups: Analysis of Variance ANOVA (1-way)
Descriptive Analysis and Presentation of Bivariate Data
Kin 304 Inferential Statistics
Correlation and Regression
Inferential Statistics
The Chi-Square Distribution and Test for Independence
Chi-square: Comparing Observed and Expected Counts
Review: What influences confidence intervals?
Statistical Analysis Error Bars
Correlation and Simple Linear Regression
Scientific Practice Regression.
Association, correlation and regression in biomedical research
Chapter 12 Power Analysis.
Inferential Statistics
Chapter Thirteen McGraw-Hill/Irwin
Inferential testing.
Presentation transcript:

Correlation @UWE_JT9 @dave_lush Scientific Practice Correlation @UWE_JT9 @dave_lush

Where We Are/Where We Are Going We have looked at experiments in which we actively do something to see if we can influence something else eg the effect of a drug (or drugs) on BP a manipulative experiment these are analysed using tests such as t-tests or ANOVA But what if we are interested in the influence of height on BP? we can’t go around ‘manipulating’ people’s heights! instead, we use natural variation to explore the relationship called correlational

Correlation An apparent association between two variables as one variable changes, so does the other could be in the same direction, eg height and weight could be opposite, eg BP and life expectancy Relationships can be demonstrated graphically is there a linear ‘line of best fit’ I could draw? The degree of association can be quantified in terms of its correlation coefficient, r aka Pearson Product Moment Correlation Coefficient 0  1 (-1  +1 when direction taken into account) strength of ‘link’ between the two (elastic  rigid) r is independent of the units used (eg feet/inches)

Correlation Correlation coefficient, r = -1

Correlation Correlation coefficient, r = 1

Correlation Correlation coefficient, r = 0.992

Correlation Correlation coefficient, r = 0.954

Correlation Correlation coefficient, r = 0

Correlation & Its Pitfalls Correlation between height and weight in male children By convention, x-axis used for independent variable, y-axis for dependent variable ie can’t help thinking of causation… weight  height? if anything… height  weight reverse causation

Correlation & Its Pitfalls This is a bit better, as the idea that increased height causes increased weight makes more sense (generally, taller bodies need more cells!) BUT… correlation does not imply causation third factors

Correlation & Its Pitfalls Third Factors Eg there is a (positive) correlation between the number of fire engines at the scene of a fire and the damage done that does not mean that the fire engines cause the damage! the fire is the third factor; the size of the fire determines… the amount of damage done and the number of fire engines called out

Correlation & Its Pitfalls Third Factors can be used to nefarious ends… Eg there is a correlation between smoking an ill health tobacco lobby often cites that association does not imply causation could be that other factors cause both ill health and the increased tendency to smoke eg stress tobacco industry excels at exploiting the idea that there are no certainties in science!

Interpreting r Even if we generate random data, we will always get a non-zero value for r 21 random x/y pairs generated in Excel r= -0.227

Interpreting r So how do we tell ‘random’ associations from ‘real’ ones? how do we determine the significance of r? The usual 4 steps apply… 1) Generate the Null Hypothesis there is no association between the two variables ie r is zero so the value we get is just ‘bad luck’ 2) Generate the test statistic easy! r is the test statistic!

Interpreting r 3) Work out the probability, p this is the probability of getting an r value as big as we did even though the underlying population of all readings has an r of zero (ie the Null Hypo)

Interpreting r 3) Work out the probability, p there is a critical value of r that needs to be exceeded depends on df, which is N-2 the greater the N, the lower is the critical r depends on level of probability required (eg p=0.05) the smaller the p, the higher is the critical r use a table…

Interpreting r Our 21 random data pairs  df 21-2 = 19 Critical r (at 0.05, two-tailed) is 0.433 Our r = 0.227 (ignore sign) So p > 0.05

Interpreting r 4) Interpret the probability there is a greater than 5% chance that the Null Hypothesis is true so we can’t reject the Null Hypo so we accept that there is no significant association between the two variables

Summary Correlation is a units-independent estimation of the degree of association between two variables it estimates their tendency to change together Correlation coefficient range is -1  +1 ignoring sign, it is 0  1 no correlation  maximum correlation ‘out of step’  ‘marching together’ Significance of r can be tested is r significantly different to zero? Correlation does not imply causation reverse causation and third factors