Intro to Parametric & Nonparametric Statistics “Kinds” of statistics often called nonparametric statistics Defining parametric and nonparametric statistics.

Slides:



Advertisements
Similar presentations
Parametric & Nonparametric Models for Tests of Association Models we will consider X 2 Tests for qualitative variables Parametric tests Pearson’s correlation.
Advertisements

CHOOSING A STATISTICAL TEST © LOUIS COHEN, LAWRENCE MANION & KEITH MORRISON.
Nonparametric Statistics Timothy C. Bates
Intro to Parametric & Nonparametric Statistics Kinds of variables & why we care Kinds & definitions of nonparametric statistics Where parametric stats.
PSY 307 – Statistics for the Behavioral Sciences Chapter 20 – Tests for Ranked Data, Choosing Statistical Tests.
Ordinal Data. Ordinal Tests Non-parametric tests Non-parametric tests No assumptions about the shape of the distribution No assumptions about the shape.
INTRODUCTION TO NON-PARAMETRIC ANALYSES CHI SQUARE ANALYSIS.
Basic Statistical Review
Decision Tree Type of Data Qualitative (Categorical) Type of Categorization One Categorical Variable Chi-Square – Goodness-of-Fit Two Categorical Variables.
Differences Between Group Means
Final Review Session.
Intro to Statistics for the Behavioral Sciences PSYC 1900 Lecture 17: Nonparametric Tests & Course Summary.
Statistical Analysis SC504/HS927 Spring Term 2008 Week 17 (25th January 2008): Analysing data.
Univariate Parametric & Nonparametric Statistics & Statistical Tests Kinds of variables & why we care Univariate stats –qualitative variables –parametric.
Parametric & Nonparametric Models for Within-Groups Comparisons overview X 2 tests parametric & nonparametric stats Mann-Whitney U-test Kruskal-Wallis.
Parametric & Nonparametric Models for Univariate Tests, Tests of Association & Between Groups Comparisons Models we will consider Univariate Statistics.
Educational Research by John W. Creswell. Copyright © 2002 by Pearson Education. All rights reserved. Slide 1 Chapter 8 Analyzing and Interpreting Quantitative.
Intro to Parametric & Nonparametric Statistics Kinds & definitions of nonparametric statistics Where parametric stats come from Consequences of parametric.
Data Analysis Statistics. Levels of Measurement Nominal – Categorical; no implied rankings among the categories. Also includes written observations and.
Intro to Parametric Statistics, Assumptions & Degrees of Freedom Some terms we will need Normal Distributions Degrees of freedom Z-values of individual.
Summary of Quantitative Analysis Neuman and Robson Ch. 11
Nonparametrics and goodness of fit Petter Mostad
Inferential Statistics
MGT-491 QUANTITATIVE ANALYSIS AND RESEARCH FOR MANAGEMENT
Leedy and Ormrod Ch. 11 Gray Ch. 14
STATISTICAL TECHNIQUES FOR research ROMMEL S. DE GRACIA ROMMEL S. DE GRACIA SEPS for PLANNING & RESEARCH SEPS for PLANNING & RESEARCH.
AM Recitation 2/10/11.
Statistics & Statistical Tests: Assumptions & Conclusions Kinds of degrees of freedom Kinds of Distributions Kinds of Statistics & assumptions required.
Copyright © 2008 by Pearson Education, Inc. Upper Saddle River, New Jersey All rights reserved. John W. Creswell Educational Research: Planning,
Descriptive Statistics e.g.,frequencies, percentiles, mean, median, mode, ranges, inter-quartile ranges, sds, Zs Describe data Inferential Statistics e.g.,
Analysis of Results A lecture delivered to participants In Research Methodology And Use of Technologies In Research On 16 th Nov. (G.V.M. College of Edu.Sonepat)
Chapter 15 Data Analysis: Testing for Significant Differences.
A Repertoire of Hypothesis Tests  z-test – for use with normal distributions and large samples.  t-test – for use with small samples and when the pop.
9 Mar 2007 EMBnet Course – Introduction to Statistics for Biologists Nonparametric tests, Bootstrapping
Nonparametric Statistical Methods: Overview and Examples ETM 568 ISE 468 Spring 2015 Dr. Joan Burtner.
Statistical Analysis. Statistics u Description –Describes the data –Mean –Median –Mode u Inferential –Allows prediction from the sample to the population.
Statistical analysis Prepared and gathered by Alireza Yousefy(Ph.D)
MGT-491 QUANTITATIVE ANALYSIS AND RESEARCH FOR MANAGEMENT OSMAN BIN SAIF Session 26.
12: Basic Data Analysis for Quantitative Research.
Lesson 15 - R Chapter 15 Review. Objectives Summarize the chapter Define the vocabulary used Complete all objectives Successfully answer any of the review.
STATISTICAL ANALYSIS FOR THE MATHEMATICALLY-CHALLENGED Associate Professor Phua Kai Lit School of Medicine & Health Sciences Monash University (Sunway.
Experimental Design and Statistics. Scientific Method
CHI SQUARE TESTS.
Chapter 13 CHI-SQUARE AND NONPARAMETRIC PROCEDURES.
Experimental Psychology PSY 433 Appendix B Statistics.
Parametric & Nonparametric Univariate Statistics, Univariate Significance Tests & Tests of Association Statistics & models we will consider Univariate.
Academic Research Academic Research Dr Kishor Bhanushali M
Experimental Research Methods in Language Learning Chapter 10 Inferential Statistics.
ANALYSIS PLAN: STATISTICAL PROCEDURES
Statistics in Applied Science and Technology Chapter14. Nonparametric Methods.
DTC Quantitative Methods Bivariate Analysis: t-tests and Analysis of Variance (ANOVA) Thursday 14 th February 2013.
IMPORTANCE OF STATISTICS MR.CHITHRAVEL.V ASST.PROFESSOR ACN.
1 Chapter 15 Data Analysis: Basic Questions © 2005 Thomson/South-Western.
Biostatistics Nonparametric Statistics Class 8 March 14, 2000.
Chapter 21prepared by Elizabeth Bauer, Ph.D. 1 Ranking Data –Sometimes your data is ordinal level –We can put people in order and assign them ranks Common.
Nonparametric Tests with Ordinal Data Chapter 18.
Chapter Eleven Performing the One-Sample t-Test and Testing Correlation.
University of Warwick, Department of Sociology, 2012/13 SO 201: SSAASS (Surveys and Statistics) (Richard Lampard) Analysing Means II: Nonparametric techniques.
Parametric & Nonparametric Models for Between Groups & Within-Groups Comparisons X 2 tests BG Parametric & Nonparametric Tests Mann-Whitney U-test Kruskal-Wallis.
 Kolmogor-Smirnov test  Mann-Whitney U test  Wilcoxon test  Kruskal-Wallis  Friedman test  Cochran Q test.
Nonparametric statistics. Four levels of measurement Nominal Ordinal Interval Ratio  Nominal: the lowest level  Ordinal  Interval  Ratio: the highest.
Chapter 15 Analyzing Quantitative Data. Levels of Measurement Nominal measurement Involves assigning numbers to classify characteristics into categories.
Nonparametric Statistics Overview. Objectives Understand Difference between Parametric and Nonparametric Statistical Procedures Nonparametric methods.
1 Underlying population distribution is continuous. No other assumptions. Data need not be quantitative, but may be categorical or rank data. Very quick.
I. ANOVA revisited & reviewed
Statistical tests for quantitative variables
University of Warwick, Department of Sociology, 2014/15 SO 201: SSAASS (Surveys and Statistics) (Richard Lampard) Analysing Means II: Nonparametric techniques.
Part Three. Data Analysis
Parametric versus Nonparametric (Chi-square)
Presentation transcript:

Intro to Parametric & Nonparametric Statistics “Kinds” of statistics often called nonparametric statistics Defining parametric and nonparametric statistics Common reasons for using nonparametric statistics Common reasons for not using nonparametric statistics Models we’ll cover in here Using ranks instead of values to compute statistics

Defining nonparametric statistics.. There are two “kinds” of statistics commonly referred to as “nonparametric”... Statistics for quantitative variables w/out making “assumptions about the form of the underlying data distribution univariate -- median & IQR -- 1-sample test of median bivariate -- analogs of the correlations, t-tests & ANOVAs you know Statistics for qualitative variables univariate -- mode & #categories -- goodness-of-fit X² bivariate -- Pearson’s Contingency Table X² Have to be careful!!  for example X² tests are actually parametric (they assume an underlying normal distribution – more later)

Defining nonparametric statistics... Nonparametric statistics (also called “distribution free statistics”) are those that can describe some attribute of a population, test hypotheses about that attribute, its relationship with some other attribute, or differences on that attribute across populations or across time, that require no assumptions about the form of the population data distribution(s).

Now, about that last part… … that require no assumptions about the form of the population data distribution(s). This is where things get a little dicey - hang in there with me… Most of the statistics you know have a fairly simple “computational formula”. As examples...

Here are formulas for two familiar parametric statistics: The mean...M =  X/ N The standard S =  ( X - M ) 2 deviation...   N But where to these formulas “come from” ??? As you’ve heard many times, “computing the mean and standard deviation assumes the data are drawn from a population that is normally distributed.” What does this really mean ???

formula for the normal distribution: e - ( x -  )² / 2  ² ƒ(x) =   2π For a given mean (  ) and standard deviation (  ), plug in any value of x to receive the proportional frequency of that normal distribution with that value. The computational formula for the mean and std are derived from this formula.

Since the computational formula for the mean as the description of the center of the distribution is based upon the assumption that the normal distribution formula describes the population data distribution, if the data are not normally distributed then the formula for the mean doesn’t provide a description of the center of the population distribution (which, of course, is being represented by the sample distribution). Same goes for all the formulae that you know !! Mean,std, Pearson’s corr, Z-tests, t-tests, F-tests, X 2 tests, etc….. The utility of the results from each is dependent upon the “fit” of the data to the measurement (interval) and distributional (normal) assumptions of these statistical models.

Common reasons/situations FOR using Nonparametric stats & a caveat to consider Data are not normally distributed r, Z, t, F and related statistics are rather “robust” to many violations of these assumptions Data are not measured on an interval scale. Most psychological data are measured “somewhere between” ordinal and interval levels of measurement. The good news is that the “regular stats” are pretty robust to this influence, since the rank order information is the most influential (especially for correlation-type analyses). Sample size is too small for “regular stats” Do we really want to make important decisions based on a sample that is so small that we change the statistical models we use?

Common reasons/situations AGAINST using Nonparametric stats & a caveat to consider Robustness of parametric statistics to most violated assumptions Difficult to know if the violations or a particular data set are “enough” to produce bias in the parametric statistics. One approach is to show convergence between parametric and nonparametric analyses of the data. Poorer power/sensitivity of nonpar statistics (make Type II errors) Parametric stats are only more powerful when the assumpt- ions upon which they are based are well-met. If assumptions are violated then nonpar statistics are more powerful. Mostly limited to uni- and bivariate analyses Most research questions are bivariate. If the bivariate results of parametric and nonparametric analyses converge, then there may be increased confidence in the parametric multivariate results.

continued… Not an integrated family of models, like GLM There are only 2 families -- tests based on summed ranks and tests using  2 (including tests of medians), most of which converge to Z-tests in their “large sample” versions. H0:s not parallel with those of parametric tests This argument applies best to comparisons of “groups” using quantitative DVs. For these types of data, although the null is that the distributions are equivalent (rather than that the centers are similarly positioned  H0: for t-test and ANOVA), if the spread and symmetry of the distributions are similar (as is often the case & the assumption of t-test and ANOVA), then the centers (medians instead of means) are what is being compared by the significance tests. In other words, the H0:s are similar when the two sets of analyses make the same assumptions.

Statistics We Will Consider Parametric Nonparametric DV Categorical Interval/ND Ordinal/~ND univariate gof X 2 1-grp t-test 1-grp mdn test association X 2 Pearson’s Spearman’s 2 bg X 2 t- / F-test M-W K-W Mdn k bg X 2 F-test K-W Mdn 2wg McNem & Wil’s t- / F-test Wil’s Fried’s kwg Cochran’s F-test Fried’s M-W -- Mann-Whitney U-TestWil’s -- Wilcoxin’s Test K-W -- Kruskal-Wallis TestFried’s -- Friedman’s F-test Mdf -- Median TestMcNem -- McNemar’s X 2

Working with “Ranks” instead of “Values” all of the nonparametric statistics for use with quantitative variables work with the ranks of the variables, rather than the values themselves. S# score rank Converting values to ranks… smallest value gets the smallest rank highest rank = number of cases tied values get the mean of the involved ranks cases 1 & 3 are tied for 3rd & 4th ranks, so both get a rank of 3.5 Why convert values to ranks? Because distributions of ranks are “better behaved” than are distributions of values (unless there are many ties).