Data analysis – Spearman’s Rank 1.Know what Spearman’s rank is and how to use it 2.Be able to produce a Spearman’s rank correlation graph for your results.

Slides:



Advertisements
Similar presentations
Aim: How and why does vegetation change across a psammosere at Llobregat delta, Barcelona? Data Presentation Techniques Discuss data presentation techniques.
Advertisements

Spearman’s Rank Correlation Coefficient
Linear regression and correlation
Chi-Square Test A fundamental problem is genetics is determining whether the experimentally determined data fits the results expected from theory (i.e.
Correlation and regression Dr. Ghada Abo-Zaid
Correlation and Regression
Hypothesis Testing IV Chi Square.
How and why to use Spearman’s Rank… If you have done scattergraphs, Spearman’s Rank offers you the opportunity to use a statistical test to get a value.
CORRELATON & REGRESSION
1 More Regression Information. 2 3 On the previous slide I have an Excel regression output. The example is the pizza sales we saw before. The first thing.
PPA 415 – Research Methods in Public Administration Lecture 9 – Bivariate Association.
Correlation A correlation exists between two variables when one of them is related to the other in some way. A scatterplot is a graph in which the paired.
Social Research Methods
Spearman’s Rank Statistical Analysis to show Relationship Strength.
Simple ideas of correlation Correlation refers to a connection between two sets of data. We will also be able to quantify the strength of that relationship.
Correlation and Linear Regression Chapter 13 Copyright © 2013 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin.
STATISTICS ELEMENTARY C.M. Pascual
Linear Regression.
Statistical Analysis I have all this data. Now what does it mean?
Linear Regression and Correlation
Statistical Analysis to show Relationship Strength.
Covariance and correlation
Correlation.
Sections 9-1 and 9-2 Overview Correlation. PAIRED DATA Is there a relationship? If so, what is the equation? Use that equation for prediction. In this.
Is there a relationship between the lengths of body parts ?
1 Chapter 9. Section 9-1 and 9-2. Triola, Elementary Statistics, Eighth Edition. Copyright Addison Wesley Longman M ARIO F. T RIOLA E IGHTH E DITION.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved Section 10-1 Review and Preview.
9.1 Correlation Key Concepts: –Scatter Plots –Correlation –Sample Correlation Coefficient, r –Hypothesis Testing for the Population Correlation Coefficient,
Linear Regression When looking for a linear relationship between two sets of data we can plot what is known as a scatter diagram. x y Looking at the graph.
Statistical Analysis A Quick Overview. The Scientific Method Establishing a hypothesis (idea) Collecting evidence (often in the form of numerical data)
Statistical Analysis Mean, Standard deviation, Standard deviation of the sample means, t-test.
WELCOME TO THETOPPERSWAY.COM.
Hypothesis of Association: Correlation
Production Planning and Control. A correlation is a relationship between two variables. The data can be represented by the ordered pairs (x, y) where.
1 rules of engagement no computer or no power → no lesson no SPSS → no lesson no homework done → no lesson GE 5 Tutorial 5.
Investigating the Relationship between Scores
McGraw-Hill/Irwin Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 13 Linear Regression and Correlation.
Correlation Analysis. Correlation Analysis: Introduction Management questions frequently revolve around the study of relationships between two or more.
PCB 3043L - General Ecology Data Analysis. OUTLINE Organizing an ecological study Basic sampling terminology Statistical analysis of data –Why use statistics?
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Describing Relationships Using Correlations. 2 More Statistical Notation Correlational analysis requires scores from two variables. X stands for the scores.
Sampling  When we want to study populations.  We don’t need to count the whole population.  We take a sample that will REPRESENT the whole population.
11/23/2015Slide 1 Using a combination of tables and plots from SPSS plus spreadsheets from Excel, we will show the linkage between correlation and linear.
More statistics notes!. Syllabus notes! (The number corresponds to the actual IB numbered syllabus.) Put the number down from the syllabus and then paraphrase.
Correlation They go together like salt and pepper… like oil and vinegar… like bread and butter… etc.
Chapter 7 Calculation of Pearson Coefficient of Correlation, r and testing its significance.
Outline of Today’s Discussion 1.Introduction to Correlation 2.An Alternative Formula for the Correlation Coefficient 3.Coefficient of Determination.
Organizing and Analyzing Data. Types of statistical analysis DESCRIPTIVE STATISTICS: Organizes data measures of central tendency mean, median, mode measures.
1 MVS 250: V. Katch S TATISTICS Chapter 5 Correlation/Regression.
Correlation Assumptions: You can plot a scatter graph You know what positive, negative and no correlation look like on a scatter graph.
+ Mortality. + Starter for 10…. In pairs write on a post it note: One statistic that we use to measure mortality On another post it note write down: A.
Principles of Biostatistics Chapter 17 Correlation 宇传华 网上免费统计资源(八)
Advanced Higher STATISTICS Spearman’s Rank (Spearman’s rank correlation coefficient) Lesson Objectives 1. Explain why it is used. 2. List the advantages.
©2013, The McGraw-Hill Companies, Inc. All Rights Reserved Chapter 3 Investigating the Relationship of Scores.
Week 2 Normal Distributions, Scatter Plots, Regression and Random.
Linear Regression Essentials Line Basics y = mx + b vs. Definitions
Review and Preview and Correlation
Spearman’s Rho Correlation
CORRELATION.
Chapter 5 STATISTICS (PART 4).
Elementary Statistics
Spearman’s Rank Correlation Test
Social Research Methods
10: Leisure at an International Scale: Sport
Correlation and Regression
Statistical Analysis Error Bars
Using Statistical techniques in Geography
STATISTICS Topic 1 IB Biology Miss Werba.
Using Data to Analyze Trends: Spearman’s Rank
Spearman’s Rank For relationship data.
Presentation transcript:

Data analysis – Spearman’s Rank 1.Know what Spearman’s rank is and how to use it 2.Be able to produce a Spearman’s rank correlation graph for your results.

The Spearman's Rank Correlation Coefficient is used to discover the strength of a link between two sets of data.

This example looks at the strength of the link between the price of a convenience item (a 50cl bottle of water) and distance from the Contemporary Art Museum in El Ravel, Barcelona

The scatter graph shows the possibility of a negative correlation between the two variables. The Spearman's rank correlation technique should be used to see if there is indeed a correlation, and to test the strength of the relationship.

Spearman’s Rank correlation coefficient A correlation can easily be drawn as a scatter graph, but the most precise way to compare several pairs of data is to use a statistical test - this establishes whether the correlation is really significant or if it could have been the result of chance alone. Spearman’s Rank correlation coefficient is a technique which can be used to summarise the strength and direction (negative or positive) of a relationship between two variables. The result will always be between 1 and minus 1.

Example hypothesis: the further away from Upper Bath Street the fewer pedestrians there will be. DistanceNo. of pedestrians Rank DistanceRank pedestrians Difference (d) Difference (d²) Total1477.5

Method - calculating the coefficient Distance from Upper Bath Street No. of pedestrians Rank Distance Rank pedestrians Difference (d) Difference (d²) 1. Log all your data2. Rank all your data 3. Minus the second data set from the first and then square your results 4. Carry out the Spearman’s Rank correlation co-efficient, as per the formula Ranking is achieved by giving the ranking '1' to the biggest number in a column, '2' to the second biggest value and so on. The smallest value in the column will get the lowest ranking. This should be done for both sets of measurements.

Tied scores are given the mean (average) rank. For example, the three tied scores of 1 Euro in the example below are ranked fifth in order of price, but occupy three positions (fifth, sixth and seventh) in a ranking hierarchy of ten. The mean rank in this case is calculated as (5+6+7) ÷ 3 = 6.

Convenience Store Distance from CAM (m) RankPrice of 50cl bottle (€) RankDifference between the ranks (d) d² d² =

Write a hypothesis and rank the data here.

Find the difference in the ranks (d): This is the difference between the ranks of the two values on each row of the table. The rank of the second value (price) is subtracted from the rank of the first (distance from the museum).

Convenience Store Distance from CAM (m) Ran k Price of 50cl bottle (€) RankDifference between the ranks (d) d² d² =

Square the differences (d²) to remove negative values.

Convenience Store Distance from CAM (m) RankPrice of 50cl bottle (€) RankDifference between the ranks (d) d² d² =

and then sum them ( d²).

Convenience Store Distance from CAM (m) Ran k Price of 50cl bottle (€) RankDifference between the ranks (d) d² d² = 285.5

When written in mathematical notation the Spearman Rank formula looks like this :

Calculate the coefficient (r²) using the formula below. The answer will always be between 1.0 (a perfect positive correlation) and -1.0 (a perfect negative correlation).

Find the value of all the d² values by adding up all the values in the Difference² column. In our example this is Multiplying this by 6 gives 1713.

The value n is the number of sites at which you took measurements. This, in our example is 10. Substituting these values into n³ - n we get

We now have the formula:

which gives; 1.73

a value for R²

What does this R² value of mean? The closer r is to +1 or -1, the stronger the likely correlation. A perfect positive correlation is +1 and a perfect negative correlation is -1. The R² value of suggests a fairly strong negative relationship.

A further technique is now required to test the significance of the relationship.

Work out the 'degrees of freedom' you need to use. This is the number of pairs in your sample minus 2 (n-2). In the example it is 8 (10 - 2). Now plot your result on the table.

If it is below the line marked 5%, then it is possible your result was the product of chance and you must reject the hypothesis.

If it is above the 0.1% significance level, then we can be 99.9% confident the correlation has not occurred by chance.

If it is above 1%, but below 0.1%, you can say you are 99% confident.

If it is above 5%, but below 1%, you can say you are 95% confident (i.e. statistically there is a 5% likelihood the result occurred by chance).

In the example, the value 0.73 gives a significance level of slightly less than 5%. That means that the probability of the relationship you have found being a chance event is about 5 in a 100. You are 95% certain that your hypothesis is correct.

The fact two variables correlate cannot prove anything - only further research can actually prove that one thing affects the other. Data reliability is related to the size of the sample. The more data you collect, the more reliable your result.

Your IA You will now need to create a new hypothesis to include within the results section of your IA to show how 2 pieces of data might be linked. Which ones would you expect there to be a correlation with?

Example of student’s work…

Your IA I cannot help you do the Spearman’s Correlation calculations on your own data set. There are the handouts and the notes you have from today’s lesson…if you follow these closely you should be able to work it out.