Wilcoxon’s Rank-Sum Test (two independent samples) n1 + n2 ≤ 25: Same Distributions Runs (Labor Data) Naïve Bayes Acc (n1) RanksNaïve Bayes Acc (n2) Ranks.

Slides:



Advertisements
Similar presentations
Mixed Designs: Between and Within Psy 420 Ainsworth.
Advertisements

Hypothesis Testing Steps in Hypothesis Testing:
Chapter 16 Introduction to Nonparametric Statistics
Introduction to Nonparametric Statistics
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Nonparametric Methods Chapter 15.
Hypothesis Testing IV Chi Square.
Independent Sample T-test Formula
Chapter 14 Conducting & Reading Research Baumgartner et al Chapter 14 Inferential Data Analysis.
Chapter Topics The Completely Randomized Model: One-Factor Analysis of Variance F-Test for Difference in c Means The Tukey-Kramer Procedure ANOVA Assumptions.
PSY 307 – Statistics for the Behavioral Sciences
Test statistic: Group Comparison Jobayer Hossain Larry Holmes, Jr Research Statistics, Lecture 5 October 30,2008.
Crosstabs and Chi Squares Computer Applications in Psychology.
EXPERIMENTAL DESIGN Random assignment Who gets assigned to what? How does it work What are limits to its efficacy?
Chapter 11: Inference for Distributions
Analysis of Variance Introduction The Analysis of Variance is abbreviated as ANOVA The Analysis of Variance is abbreviated as ANOVA Used for hypothesis.
T Test for One Sample. Why use a t test? The sampling distribution of t represents the distribution that would be obtained if a value of t were calculated.
Analysis of Variance or ANOVA. In ANOVA, we are interested in comparing the means of different populations (usually more than 2 populations). Since this.
1 Tests with two+ groups We have examined tests of means for a single group, and for a difference if we have a matched sample (as in husbands and wives)
Inference about Two Population Standard Deviations.
Hypothesis Testing CSCE 587.
Individual values of X Frequency How many individuals   Distribution of a population.
© 2002 Prentice-Hall, Inc.Chap 9-1 Statistics for Managers Using Microsoft Excel 3 rd Edition Chapter 9 Analysis of Variance.
Chapter 11 HYPOTHESIS TESTING USING THE ONE-WAY ANALYSIS OF VARIANCE.
Go to index Two Sample Inference for Means Farrokh Alemi Ph.D Kashif Haqqi M.D.
Lecturer’s desk INTEGRATED LEARNING CENTER ILC 120 Screen Row A Row B Row C Row D Row E Row F Row G Row.
Psychology 301 Chapters & Differences Between Two Means Introduction to Analysis of Variance Multiple Comparisons.
H1H1 H1H1 HoHo Z = 0 Two Tailed test. Z score where 2.5% of the distribution lies in the tail: Z = Critical value for a two tailed test.
Chapter 4 analysis of variance (ANOVA). Section 1 the basic idea and condition of application.
Parametric tests (independent t- test and paired t-test & ANOVA) Dr. Omar Al Jadaan.
Lecture 9 TWO GROUP MEANS TESTS EPSY 640 Texas A&M University.
Slide Slide 1 Section 8-6 Testing a Claim About a Standard Deviation or Variance.
Nonparametric Statistics. In previous testing, we assumed that our samples were drawn from normally distributed populations. This chapter introduces some.
ANALYSIS OF VARIANCE (ANOVA) BCT 2053 CHAPTER 5. CONTENT 5.1 Introduction to ANOVA 5.2 One-Way ANOVA 5.3 Two-Way ANOVA.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics S eventh Edition By Brase and Brase Prepared by: Lynn Smith.
Lesson 15 - R Chapter 15 Review. Objectives Summarize the chapter Define the vocabulary used Complete all objectives Successfully answer any of the review.
Previous Lecture: Phylogenetics. Analysis of Variance This Lecture Judy Zhong Ph.D.
Nonparamentric Stats –Distribution free tests –e.g., rank tests Sign test –H 0 : Median = 100 H a : Median > 100 if median = 100, then half above, half.
Comparing k Populations Means – One way Analysis of Variance (ANOVA)
CD-ROM Chap 16-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition CD-ROM Chapter 16 Introduction.
Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and Methods and Applications CHAPTER 15 ANOVA : Testing for Differences among Many Samples, and Much.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
ANOVA Overview of Major Designs. Between or Within Subjects Between-subjects (completely randomized) designs –Subjects are nested within treatment conditions.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 12 Tests of Goodness of Fit and Independence n Goodness of Fit Test: A Multinomial.
Copyright © 2010, 2007, 2004 Pearson Education, Inc Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by.
Chapter 13 Understanding research results: statistical inference.
Copyright © Cengage Learning. All rights reserved. 12 Analysis of Variance.
ANalysis Of VAriance can be used to test for the equality of three or more population means. H 0 :  1  =  2  =  3  = ... =  k H a : Not all population.
Testing Differences in Means (t-tests) Dr. Richard Jackson © Mercer University 2005 All Rights Reserved.
Chapter 10 Section 5 Chi-squared Test for a Variance or Standard Deviation.
1 Estimating and Testing  2 0 (n-1)s 2 /  2 has a  2 distribution with n-1 degrees of freedom Like other parameters, can create CIs and hypothesis tests.
 List the characteristics of the F distribution.  Conduct a test of hypothesis to determine whether the variances of two populations are equal.  Discuss.
The 2 nd to last topic this year!!.  ANOVA Testing is similar to a “two sample t- test except” that it compares more than two samples to one another.
Sampling distribution of
Test of independence: Contingency Table
Chi-Square hypothesis testing
Effect Sizes (continued)
Y - Tests Type Based on Response and Measure Variable Data
Data Analysis and Interpretation
Inferential Statistics
Two Way ANOVAs Factorial Designs.
Chapter 10 Two-Sample Tests and One-Way ANOVA.
Review of Chapter 11 Comparison of Two Populations
Chapter 13 Group Differences
CHI SQUARE TEST OF INDEPENDENCE
A paired-samples t-test compares the means of two related sets of data to see if they differ statistically. IQ Example We may want to compare the IQ scores.
Testing a Claim About a Standard Deviation or Variance
Quadrat sampling & the Chi-squared test
Quadrat sampling & the Chi-squared test
Quantitative Methods ANOVA.
Presentation transcript:

Wilcoxon’s Rank-Sum Test (two independent samples) n1 + n2 ≤ 25: Same Distributions Runs (Labor Data) Naïve Bayes Acc (n1) RanksNaïve Bayes Acc (n2) Ranks Sample Size97 Mean Rank Sum (W) (accept) Critical Values (Wilcoxon table) H 0 : mean(Acc 1 ) = mean(Acc 2 ) Significance, test type0.05, two-tailed0.01, two-tailed0.05, one-tailed0.01, one-tailed V

Wilcoxon’s Rank-Sum Test (two independent samples) n1 + n2 ≤ 25: Different Distributions Runs (Labor Data) Naïve Bayes Acc (n1) RanksJ48 Acc (n2)Ranks Sample Size97 Mean Rank Sum (W) (reject) Critical Values (Wilcoxon table) H 0 : mean(Acc 1 ) = mean(Acc 2 ) Significance, test type0.05, two-tailed0.01, two-tailed0.05, one-tailed0.01, one-tailed V

Wilcoxon’s Rank-Sum Test (two independent samples) n1 + n2 > 25: Different Distributions Adult Datan1: Naïve Bayes Acc(rank) runs n1: Naïve Bayes Acc(rank) runs n2: J48 Acc(rank) runs n2: J48 Acc(rank) runs (1.0) (2.0) (3.0) (4.0) (5.0) (6.0) 83.1 (7.0) (8.0) (9.0) (10.0) (11.0) (12.0) (13.0) (14.0) (15.0) (16.0) 83.4 (17.0) (18.0) (19.5) (21.0) (22.0) (23.0) (24.0) (25.0) (26.0) (27.0) (28.0) (29.0) (30.0) 85.7 (31.0) (32.0) (33.0) (34.0) (35.0) (36.5) (38.0) (39.0) (40.0) (41.0) (42.0) (43.0) (44.5) (46.5) 86.1 (48.5) (50.5) 86.2 (52.0) (53.0) (54.0) (55.0) (56.0) (57.0) (58.0) (59.0) 86.7 (60.0) Sample Size30 Mean Rank Sum (W) Mean(W) = 915, STD(W) = Z statistic < 1.96 (z at alpha = 0.05) * reject H 0 : mean(Acc 1 ) = mean(Acc 2 )

Wilcoxon’s Matched Pairs Signed Ranks Test (for paired scores) n ≤ 50 Data Example Classifier 1 scores (A) Classifier 2 scores (B) A-B|A-B|Rank(|A-B|)Signed Rank(|A-B|) —3 — — remove remove +1 —2 — — Sum of Signed RanksW+ = +86 W- = -19 Select W = 19 (reject H 0 ) Critical Values (Wilcoxon table) H 0 : mean(signed_rank(|A-B|) = 0 Significance, test type0.05, two-tailed0.01, two-tailed0.05, one-tailed0.01, one-tailed0.05, two-tailed V

Wilcoxon’s Matched Pairs Signed Ranks Test (for paired scores) n > 50 Randomly split the Adult data set at 50% 100 times. For each training/testing data set, run Naïve Bayes and J48 and record their accuracy values as a pair for which we compute the difference in accuracy Determine the signed ranks of the difference for each pair (as previous example – data is omitted due to space constraints) We get W+ = 0 and W- = 5050 (J48 produces higher accuracy always), N = 100 We get, mean(W) = 2525, STD(W)= Z=(0-2525)/ = < 1.96 (at alpha = 0.05)

What is the Effect Size? (The effect of using LaPlace smoothing on accuracy of J48) Runs on Adult dataAccuracy of J48 (no LePlace)Acc J48 (LePlace) Mean Standard Deviation SP 2 SP (9 * ( ) * ( ) 2 ) / 18 = Sqrt(0.0365) = d(86.05 – 86.04) / = This is less than 0.2  d is very small to no effect

One-Way ANOVA (J48 on three domains) RunsJ48 Acc AdultJ48 Acc PimaJ48 Acc Credit Results: High F and very low p  Groups are significantly different (see plot) Source of Variability Sum Squares Degree of Freedom Mean Squares F Statistic = MS G /MS E Pro. > F (p-value) Groups E-14 Error Total

One-Way ANOVA (J48 on three domains)

Two-Way ANOVA (J48 & N.B. on 3 domains) ClassifierRunsAcc AdultAcc PimaAcc Credit J48 (A) NB (B) p-values are low  Columns (H 0A ), and Interactions(H 0AB ) are significantly different but Rows(H 0B ) are the least different Source of Variability Sum Squares Degree of Freedom Mean Squares F Statistic = MS G /MS E Pro. > F (p-value) Columns H 0A E-10 Rows H 0B Interactions H 0AB E-05 Error Total