Choosing Your Test Spearman’s? Chi-squared? Mann-Whitney?

Slides:



Advertisements
Similar presentations
Categorical and discrete data. Non-parametric tests.
Advertisements

Correlation and Linear Regression.
Learning Objectives 1 Copyright © 2002 South-Western/Thomson Learning Data Analysis: Bivariate Correlation and Regression CHAPTER sixteen.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Nonparametric Methods Chapter 15.
Ordinal Data. Ordinal Tests Non-parametric tests Non-parametric tests No assumptions about the shape of the distribution No assumptions about the shape.
statistics NONPARAMETRIC TEST
Chapter 14 Analysis of Categorical Data
Non-parametric equivalents to the t-test Sam Cromie.
Statistical Analysis SC504/HS927 Spring Term 2008 Week 17 (25th January 2008): Analysing data.
Introduction to Probability and Statistics Linear Regression and Correlation.
Inferences About Process Quality
Student’s t statistic Use Test for equality of two means
Lecture 11 Chapter 6. Correlation and Linear Regression.
Statistical hypothesis testing – Inferential statistics II. Testing for associations.
Nonparametrics and goodness of fit Petter Mostad
Chapter 15 Nonparametric Statistics
Non-Parametric Methods Professor of Epidemiology and Biostatistics
Nonparametric or Distribution-free Tests
Non-parametric Dr Azmi Mohd Tamil.
TOPIC 1 STATISTICAL ANALYSIS
Chi-squared Goodness of fit. What does it do? Tests whether data you’ve collected are in line with national or regional statistics.  Are there similar.
Chapter 13 Statistics © 2008 Pearson Addison-Wesley. All rights reserved.
Measure your handspan and foot length in cm to nearest mm We will record them as Bivariate data below: Now we need to plot them in what kind of graph?
Non-Parametric Methods Professor of Epidemiology and Biostatistics
Nonparametrics and goodness-of-fit
Identify the I.V. and D.V. variables as you read the story: Two researchers were talking shop over coffee one morning. The topic was intensive early training.
STEM Fair Graphs & Statistical Analysis. Objectives: – Today I will be able to: Construct an appropriate graph for my STEM fair data Evaluate the statistical.
Copyright © 2012 Pearson Education. Chapter 23 Nonparametric Methods.
Correlation is a statistical technique that describes the degree of relationship between two variables when you have bivariate data. A bivariate distribution.
Statistical Analysis. Statistics u Description –Describes the data –Mean –Median –Mode u Inferential –Allows prediction from the sample to the population.
Correlation – Pearson’s. What does it do? Measures straight-line correlation – how close plotted points are to a straight line Takes values between –1.
Statistical analysis Prepared and gathered by Alireza Yousefy(Ph.D)
Chapter 14 Nonparametric Tests Part III: Additional Hypothesis Tests Renee R. Ha, Ph.D. James C. Ha, Ph.D Integrative Statistics for the Social & Behavioral.
School of Information - The University of Texas at Austin LIS 397.1, Introduction to Research in Library and Information Science LIS Introduction.
By: Amani Albraikan.  Pearson r  Spearman rho  Linearity  Range restrictions  Outliers  Beware of spurious correlations….take care in interpretation.
Correlation & Regression Chapter 15. Correlation It is a statistical technique that is used to measure and describe a relationship between two variables.
STATISTICAL ANALYSIS FOR THE MATHEMATICALLY-CHALLENGED Associate Professor Phua Kai Lit School of Medicine & Health Sciences Monash University (Sunway.
Chapter 13 CHI-SQUARE AND NONPARAMETRIC PROCEDURES.
Copyright © 2010 Pearson Education, Inc. Slide
Sampling  When we want to study populations.  We don’t need to count the whole population.  We take a sample that will REPRESENT the whole population.
T-test - paired Testing for a difference. What does it do? Tests for a difference in means Compares two cases (eg soil moisture content north & south.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
1 Regression & Correlation (1) 1.A relationship between 2 variables X and Y 2.The relationship seen as a straight line 3.Two problems 4.How can we tell.
Simple linear regression Tron Anders Moger
Correlation. Correlation is a measure of the strength of the relation between two or more variables. Any correlation coefficient has two parts – Valence:
IMPORTANCE OF STATISTICS MR.CHITHRAVEL.V ASST.PROFESSOR ACN.
Tuesday PM  Presentation of AM results  What are nonparametric tests?  Nonparametric tests for central tendency Mann-Whitney U test (aka Wilcoxon rank-sum.
Biostatistics Nonparametric Statistics Class 8 March 14, 2000.
+ Data Analysis Chemistry GT 9/18/14. + Drill The crown that King Hiero of Syracuse gave to Archimedes to analyze had a volume of 575 mL and a mass of.
MAKING MEANING OUT OF DATA Statistics for IB-SL Biology.
Chapter 21prepared by Elizabeth Bauer, Ph.D. 1 Ranking Data –Sometimes your data is ordinal level –We can put people in order and assign them ranks Common.
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Nonparametric Statistics.
PART 2 SPSS (the Statistical Package for the Social Sciences)
Jump to first page Inferring Sample Findings to the Population and Testing for Differences.
Wilcoxon Signed Rank Testing for a difference R+ RR
Chi-squared Association Index. What does it do? Looks for “links” between two factors  Do dandelions and plantains tend to grow together?  Does the.
Correlation – Spearman’s. What does it do? Measures rank correlation – whether highest value in the 1 st data set corresponds to highest in the 2 nd set.
Hypothesis Testing Procedures Many More Tests Exist!
King Faisal University جامعة الملك فيصل Deanship of E-Learning and Distance Education عمادة التعلم الإلكتروني والتعليم عن بعد [ ] 1 جامعة الملك فيصل عمادة.
Nonparametric statistics. Four levels of measurement Nominal Ordinal Interval Ratio  Nominal: the lowest level  Ordinal  Interval  Ratio: the highest.
Dr.Rehab F.M. Gwada. Measures of Central Tendency the average or a typical, middle observed value of a variable in a data set. There are three commonly.
Determining and Interpreting Associations between Variables Cross-Tabs Chi-Square Correlation.
Data measurement, probability and Spearman’s Rho
Testing for a difference
How regression works The Right Questions about Statistics:
Testing for a difference
Inferential Statistics
STEM Fair Graphs & Statistical Analysis
Chi-squared Association Index
Nonparametric Statistics
Presentation transcript:

Choosing Your Test Spearman’s? Chi-squared? Mann-Whitney?

Choosing the correct technique Choose your technique BEFORE collecting your data! Your choice will depend on:  What you want to test  What sort of data you can collect Once you’ve chosen your test, it will tell you how much data you must collect

What do you want to test? For Correlation (eg between pollution levels and distance from city centre For AssociationFor Association (eg is there an association between snail shell colour and choice of habitat) Whether Local Data are in accordance with General FiguresWhether Local Data are in accordance with General Figures (eg are there the same proportions of cold, warm and hot days in a local city as in the county as a whole) For a Difference Between Two Populations (eg difference in numbers of a particular invertebrate between short & long grass) Click here Click here for a flow chart on choosing tests

Correlation can be positive, negative or zero  Positive correlation: as one variable increases, so does the other.  Eg nitrate concentration and algal growth  Negative correlation: as one variable increases, the other decreases.  Eg pollution levels and distance from the city centre  No correlation: one variable increasing has no consistent effect on the other Testing for Correlation

Types of correlation Two kinds of correlation:  Straight Line correlation This measures how close your data are to a straight line graph. Data must be continuous  Rank correlation This measures whether things are in the same order, but they don’t have to be in a straight line. Data do not have to be continuous

Choosing A Correlation Coefficient Straight line correlation - Pearson’s Product Moment Correlation Coefficient  Best to use if data actually are near a straight line  If you have lots of data, easier to calculate than Spearman’s (use a spreadsheet)  Lets you work out the equation of a “best-fit” line Rank correlation - Spearman’s Rank Correlation Coefficient  Data do not need to be close to a straight line  Needs absolute minimum of 4 data pairs – but more is better  Not valid if you have too many “ties”

Testing for Association Chi-Squared Association Index This lets you investigate whether there’s any association between two factors – are they linked?  Eg: Are snail shell colour and habitat preference associated?  If they are associated, it means that if we know a snail has a dark shell, it’s more likely to choose a particular habitat To do this test, we need:  Numbers of people/items etc in categories (in the example, it would be numbers of snails of each shell colour living in each habitat)  An average of at least 5 in each category

Testing Whether Local Data fit General Figures Chi-Squared Goodness-of-Fit This lets you compare local data with general – eg county, national or global – figures.  Eg does a city have significantly more warm days than the county it is in? To do this test, we need:  Numbers of people/items etc in categories (in the example, it would be numbers of days of various temperatures)  An average of at least 5 in each category  National/global/county data, to compare your data to

Testing For A Difference Difference between numbers of items in two or more categoriesDifference between numbers of items in two or more categories (eg numbers of invertebrates at 3 locations) Difference between averages (eg average area of lichen at two locations)

Difference between numbers of items in 2+ categories Chi-squared – testing for a difference Tests whether, for example, you get the same number of plants of a given species in different locations We need:  Numbers of people/ things in categories  An average of 5 in each category Use this rather than an “average” test if:  You have more than two categories (eg 3 locations)  You have just two categories, and it’s easier just to work with numbers of people/things

Difference Between Averages To choose the right test for averages, you must ask:  Are the data paired or not? Eg: Matched positions on either side of hedge Corresponding positions on two shores  Are the data likely to be normally distributed?  Only continuous data (eg lengths, widths etc) can be normal  Can check visually whether normal by diagram

Which Test for Averages? Not normal Not paired Mann-Whitney U-test Not normal Paired Wilcoxon Signed Rank Normal Not paired Unpaired t-test Normal Paired Paired t-test You could actually do Mann-Whitney for all of these. But it is easier to get a significant result using the others. All need at least 5 values in each sample.

Choosing A Test Flowchart