Spearman’s Rank For relationship data.

Slides:



Advertisements
Similar presentations
Mann Whitney U For comparison data. Using Mann Whitney U Non-parametric i.e. no assumptions are made about data fitting a normal distribution Is used.
Advertisements

Spearman’s Rank For relationship data. Non-parametric i.e. no assumptions are made about data fitting a normal distribution You must have more than 5.
2013/12/10.  The Kendall’s tau correlation is another non- parametric correlation coefficient  Let x 1, …, x n be a sample for random variable x and.
Hypothesis Testing Steps in Hypothesis Testing:
KRUSKAL-WALIS ANOVA BY RANK (Nonparametric test)
Lesson Inferences Between Two Variables. Objectives Perform Spearman’s rank-correlation test.
Lecture 10 Non Parametric Testing STAT 3120 Statistical Methods I.
PSY 307 – Statistics for the Behavioral Sciences
Chi Square Test Dealing with categorical dependant variable.
1 PARAMETRIC VERSUS NONPARAMETRIC STATISTICS Heibatollah Baghi, and Mastee Badii.
Statistical Analysis. Purpose of Statistical Analysis Determines whether the results found in an experiment are meaningful. Answers the question: –Does.
Chapter 15 Nonparametric Statistics
Lecture 5 Correlation and Regression
Topics: Significance Testing of Correlation Coefficients Inference about a population correlation coefficient: –Testing H 0 :  xy = 0 or some specific.
Means Tests Hypothesis Testing Assumptions Testing (Normality)
Basic Statistics. Basics Of Measurement Sampling Distribution of the Mean: The set of all possible means of samples of a given size taken from a population.
Chapter 15 Correlation and Regression
14 Elements of Nonparametric Statistics
Hypothesis of Association: Correlation
Data analysis – Spearman’s Rank 1.Know what Spearman’s rank is and how to use it 2.Be able to produce a Spearman’s rank correlation graph for your results.
Nonparametric Statistics aka, distribution-free statistics makes no assumption about the underlying distribution, other than that it is continuous the.
© 2011 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics S eventh Edition By Brase and Brase Prepared by: Lynn Smith.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Simple linear regression Tron Anders Moger
Nonparametric statistics. Four levels of measurement Nominal Ordinal Interval Ratio  Nominal: the lowest level  Ordinal  Interval  Ratio: the highest.
Statistics for A2 Biology Standard deviation Student’s t-test Chi squared Spearman’s rank.
Research Methods: 2 M.Sc. Physiotherapy/Podiatry/Pain Correlation and Regression.
Testing for a difference
Nonparametric Statistics
Data measurement, probability and Spearman’s Rho
Regression and Correlation
Spearman’s Rho Correlation
CORRELATION.
Correlation I have two variables, practically „equal“ (traditionally marked as X and Y) – I ask, if they are independent and if they are „correlated“,
10.2 Regression If the value of the correlation coefficient is significant, the next step is to determine the equation of the regression line which is.
Non-Parametric Tests 12/1.
Non-Parametric Tests 12/1.
Correlation – Regression
Non-Parametric Tests 12/6.
Non-parametric Tests: When Things
Non-Parametric Tests.
Hypothesis Testing Using the Chi Square (χ2) Distribution
Spearman’s Rank Correlation Test
Inferential Statistics
Spearman’s rho Chi-square (χ2)
10: Leisure at an International Scale: Sport
Inferential Statistics
Correlation and Simple Linear Regression
Non-parametric Tests: When Things Aren’t ‘Normal’
Formation of relationships Matching Hypothesis
Correlation and Regression
Spearman Rank Order Correlation Example
Nonparametric Statistics
Logistic Regression --> used to describe the relationship between
Using Statistical techniques in Geography
Correlation and Simple Linear Regression
Non-parametric tests, part A:
Using Data to Analyze Trends: Spearman’s Rank
Statistics.
Simple Linear Regression and Correlation
Inferences Between Two Variables
SIMPLE LINEAR REGRESSION
Nonparametric Statistics
Chapter 18: The Chi-Square Statistic
Spearman’s Rank Correlation Coefficient
Warsaw Summer School 2017, OSU Study Abroad Program
Graphs and Chi Square.
Correlation and Simple Linear Regression
Correlation and Simple Linear Regression
Presentation transcript:

Spearman’s Rank For relationship data

Using Spearman’s Rank Non-parametric i.e. no assumptions are made about data fitting a normal distribution You must have more than 5 pairs of data (10+ better) Measures the strength and direction of the relationship between two variables

The value for rs (spearman rank) will be between +1 and -1 Positive correlation rs = +1 The value for rs (spearman rank) will be between +1 and -1 +1 indicates a perfect positive correlation -1 indicates a perfect negative correlation 0 indicates no correlation at all 0.5 Velocity (m/s) 1 2 3 Distance Downstream (km) 60 40 Discharge (cumecs) No correlation rs = 0 20 1 2 3 Number of passing dog walkers 30 20 Bedload particle size (cm) 10 Negative correlation rs = -1 1 2 3 Distance Downstream (km)

The Equation rs = 1 - 6  d2 n (n2 - 1)

The Equation rs = 1 - 6  d2 n (n2 - 1) Where: rs = Spearman Rank Correlation Coefficient  d2 = Sum of the squared differences between ranks n = Number of pairs of observations in the sample

Method 1. Establish the Null Hypothesis H0 (this is always the negative form. i.e. there is no significant correlation between the variables) and the alternative hypothesis (H1). H0 - There is no significant correlation between variable X and variable Y H1 - There is a significant correlation between variable X and variable Y

Distance from source (km) 2. Copy your data into the table below as variable x and variable y and label the data sets Distance from source (km) Rank R1 PO4 ppm R2 d (R1 - R2) d2 (variable x) (variable y) 50 2 4 40 6 20 8 10

3. Rank the individual data sets in sets in increasing order as separate sets of data (i.e. Give the lowest data value the lowest rank) Distance from source (km) Rank R1 PO4 ppm R2 (variable x) (variable y) 50 2 4 40 6 20 8 10 Distance from source (km) Rank R1 PO4 ppm R2 (variable x) (variable y) 1 50 2 4 3 40 6 20 8 5 10 Take each variable in turn Lowest value gets a rank of 1 When you have data values that are the same, they must have the same rank If the next two data values were not the same we would be assigning ranks 5 and 6 5 + 6 = 11 so we will divide this rank equally between the data values (there are 2 data values so we divide 11 by 2) 11 / 2 = 5.5 so both the data values are assigned a rank of 5.5 The same thing is done for all data values that are the same

Distance from source (km) The assigned ranks should be recorded in the table Distance from source (km) Rank R1 PO4 ppm R2 d (R1 - R2) d2 (variable x) (variable y) 1 50 5.5 2 4 3 40 6 20 8 5 10

Distance from source (km) 4. Calculate the difference between each pair of ranks R1-R2 (if done correctly the differences should equal zero) Take each variable in turn and record the differences in column d Distance from source (km) Rank R1 PO4 ppm R2 d (R1 - R2) d2 (variable x) (variable y) 1 50 5.5 2 4 3 40 6 20 8 5 10

Distance from source (km) 5. Square the differences in column d Record in column d2 Distance from source (km) Rank R1 PO4 ppm R2 d (R1 - R2) d2 (variable x) (variable y) 1 50 5.5 -4.5 2 -3.5 4 3 40 -1 6 20 8 5 10

Distance from source (km) 6. Calculate Sum of d2 Add up all the values in the d2 column Distance from source (km) Rank R1 PO4 ppm R2 d (R1 - R2) d2 (variable x) (variable y) 1 50 5.5 -4.5 21.25 2 -3.5 12.25 4 3 40 -1 6 20 8 5 10 9 25

Substitute the numbers calculated for the symbols in the equation 6. Calculate the rs value 6 x 69.5 = 417 d2 = 69.5 rs = 1 - 6  d2 n (n2 - 1) 6 x 35 = 210 62 -1 = 35 n =6 Substitute the numbers calculated for the symbols in the equation Work out each part in turn e.g. 1. Work out 6 x  d2 2. Work out n2 3. Work out n2 – 1 4. Work out n x answer to step 3 5. Work out answer to step one divided by the answer to step 4 6. Work out 1 – the answer to step 5 1 - 1.986 = 0.986 417/ 210 = 1.986

Is your rs value positive or negative? If it is a positive number then you have a positive correlation If it is a negative number then you have a negative correlation (You do not need to worry about + and – for the next bit!)

Compare your rs value against the table of critical values If rs is greater than or equal to the critical value, then there is a significant correlation and the null hypothesis can be rejected

Critical values for Spearman’s Rank Correlation Coefficient Significance level Number of pairs of measurements (n) p = 0.05 (95%) (+ or -) p = 0.01 (99%) 5 1.000 6 0.886 7 0.786 0.929 8 0.738 0.881 9 0.683 0.833 10 0.648 0.818 11 0.623 0.794 12 0.591 0.780 13 0.566 0.745 14 0.545 0.716 15 0.525 0.689 16 0.507 0.666 17 0.490 0.645 18 0.476 0.625 19 0.462 0.608 20 0.450

Check the P-0.05 (95%) confidence level first This means we are 95% confident our results were not due to chance If we have a significant correlation at 95% we can go back and check if we have a significant correlation at 99% as well (so we can be 99% confident our results were not due to chance)

Is our rs value smaller or larger than our critical value from the critical value table? If the rs value is greater than or equal to the critical value then the null hypothesis can be rejected – There is a significant correlation If the rs value is NOT greater than or equal to the critical value then the null hypothesis cannot be rejected – There is no significant correlation

Use the following data to calculate rs independently Light (Lux) Rank R1 Hedera helix leaf area cm2 R2 d (R1 - R2) d2 (variable x) (variable y) 1165 22.8 980 24.7 26.8 700 37.5 760 500 495 41.3 366 44.6 348 78.3 298 58.0

Key questions Is there a significant correlation? Which data value/s would you consider to be anomalous and why? Which graph would you use to present this data?