ANOVA: Multiple Comparisons & Analysis of Variance

Slides:

Advertisements

Similar presentations

BPS - 5th Ed. Chapter 241 One-Way Analysis of Variance: Comparing Several Means.

Advertisements

CHAPTER 25: One-Way Analysis of Variance Comparing Several Means

CHAPTER 25: One-Way Analysis of Variance: Comparing Several Means ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner.

Lecture 9: One Way ANOVA Between Subjects

Chi-Square Tests and the F-Distribution

Chapter 12: Analysis of Variance

The Chi-Square Distribution 1. The student will be able to  Perform a Goodness of Fit hypothesis test  Perform a Test of Independence hypothesis test.

More About Significance Tests

+ Chapter 9 Summary. + Section 9.1 Significance Tests: The Basics After this section, you should be able to… STATE correct hypotheses for a significance.

t(ea) for Two: Test between the Means of Different Groups When you want to know if there is a ‘difference’ between the two groups in the mean Use “t-test”.

Basic concept Measures of central tendency Measures of central tendency Measures of dispersion & variability.

Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.

Analysis of Variance 1 Dr. Mohammed Alahmed Ph.D. in BioStatistics (011)

Chapter 9 Three Tests of Significance Winston Jackson and Norine Verberg Methods: Doing Social Research, 4e.

1 Chapter 8 Introduction to Hypothesis Testing. 2 Name of the game… Hypothesis testing Statistical method that uses sample data to evaluate a hypothesis.

Previous Lecture: Phylogenetics. Analysis of Variance This Lecture Judy Zhong Ph.D.

Chapter Seventeen. Figure 17.1 Relationship of Hypothesis Testing Related to Differences to the Previous Chapter and the Marketing Research Process Focus.

Copyright © Cengage Learning. All rights reserved. 12 Analysis of Variance.

Chapter 11: Multiple Comparisons & Analysis of Variance.

Hypothesis test flow chart frequency data Measurement scale number of variables 1 basic χ 2 test (19.5) Table I χ 2 test for independence (19.9) Table.

Chapter 13 Understanding research results: statistical inference.

Jump to first page Inferring Sample Findings to the Population and Testing for Differences.

The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 11 Inference for Distributions of Categorical.

Chapter 11: Categorical Data n Chi-square goodness of fit test allows us to examine a single distribution of a categorical variable in a population. n.

Class Seven Turn In: Chapter 18: 32, 34, 36 Chapter 19: 26, 34, 44 Quiz 3 For Class Eight: Chapter 20: 18, 20, 24 Chapter 22: 34, 36 Read Chapters 23 &

AP Stats Check In Where we’ve been… Chapter 7…Chapter 8… Where we are going… Significance Tests!! –Ch 9 Tests about a population proportion –Ch 9Tests.

Chapter Nine Hypothesis Testing.

HYPOTHESIS TESTING.

Chapter 13 f distribution and 0ne-way anova

Statistical Significance

CHAPTER 9 Testing a Claim

Warm Up Check your understanding on p You do NOT need to calculate ALL the expected values by hand but you need to do at least 2. You do NOT need.

CHAPTER 11 Inference for Distributions of Categorical Data

SEMINAR ON ONE WAY ANOVA

Lecture Slides Elementary Statistics Twelfth Edition

Comparing Three or More Means

Basic Practice of Statistics - 5th Edition

Hypothesis testing using contrasts

10 Chapter Chi-Square Tests and the F-Distribution Chapter 10

Hypothesis Testing Review

CHAPTER 11 Inference for Distributions of Categorical Data

Lecture #28 Thursday, December 1, 2016 Textbook: 16.1

CHAPTER 29: Multiple Regression*

Stat 217 – Day 28 Review Stat 217.

AP Stats Check In Where we’ve been… Chapter 7…Chapter 8…

Review: What influences confidence intervals?

Chapter 14: Analysis of Variance One-way ANOVA Lecture 8

Lesson Comparing Two Means.

Chapter 11: Inference for Distributions of Categorical Data

I. Statistical Tests: Why do we use them? What do they involve?

CHAPTER 11 Inference for Distributions of Categorical Data

CHAPTER 10 Comparing Two Populations or Groups

Inference for Relationships

CHAPTER 11 Inference for Distributions of Categorical Data

CHAPTER 10 Comparing Two Populations or Groups

CHAPTER 11 Inference for Distributions of Categorical Data

CHAPTER 9 Testing a Claim

CHAPTER 11 Inference for Distributions of Categorical Data

CHAPTER 10 Comparing Two Populations or Groups

CHAPTER 11 Inference for Distributions of Categorical Data

CHAPTER 11 Inference for Distributions of Categorical Data

CHAPTER 11 Inference for Distributions of Categorical Data

CHAPTER 9 Testing a Claim

CHAPTER 9 Testing a Claim

CHAPTER 10 Comparing Two Populations or Groups

CHAPTER 11 Inference for Distributions of Categorical Data

CHAPTER 11 Inference for Distributions of Categorical Data

MGS 3100 Business Analysis Regression Feb 18, 2016

STATISTICS INFORMED DECISIONS USING DATA

Presentation transcript:

ANOVA: Multiple Comparisons & Analysis of Variance

One population, Two population, ... Previously... Inference (confidence intervals, hypothesis testing) for mean for one group/one population Inference (confidence intervals, hypothesis testing) to compare the means of two groups/two populations To review... briefly look at a few of those one and two- mean inference procedures/situations

Ho: μ = 1 Ha: μ > 1 where μ = mean heat conductivity transmitted per square meter of surface per degree Celsius difference on the two sides of the glass Is there evidence that the conductivity of this type of glass is greater than 1? Carry out an appropriate test.

Does logging significantly change the mean number of species in a plot after 8 years? Give appropriate statistical evidence to support your conclusion. Assume both populations are Normally distributed. We want to test Ho: μU = μL OR μU – μL = 0 Ha: μU ≠ μL OR μU – μL ≠ 0 where μU & μL are the mean number of species in unlogged and logged plots, respectfully ;

Is there good evidence that red wine drinkers’ mean polyphenol levels were different from white wine drinkers’ mean polyphenol levels? Assume both populations are approximately Normal. We want to test: Ho: μR = μW or μR – μW = 0 Ha: μR ≠ μW or μR – μW ≠ 0 where μR & μW are the mean percent change in polyphenols for men who drink red and white wine, respectfully.

Nothing magical about the numbers one or two... Sometimes there is a need to compare three, four, five, or more groups with each other. ANOVA (Analysis of Variance) is a method for doing that; tests whether there is an association between a categorical variable that identifies different groups, and a numerical variable. The phrase “Analysis of Variance” can be misleading; the procedure really looks at means/compares means.

Go to Math 140 data... Copy and paste GPA & Favorite Social Media data into StatCrunch “Clean up” data Create side-by-side box plots (graph, box plots, select ‘overall college GPA data,’ then group by ‘favorite social media data;’ check boxes ‘use fences,’ ‘draw boxes horizontally,’ & markers mean;’ compute

Go to Math 140 data... Is there a difference in mean GPA for Twitter users vs. Snapchap users vs. Instagram users vs. Facebook users vs. other? Is there a significant difference? OR Is the mean difference just due to sampling variability? Compare means; compare spreads

More math 140 data... Let’s look at ‘age in years’ & ‘transportation used to get to COC’ data. Is there a difference in mean ages among the seven different categories? Is the difference just due to sampling variability or is there truly a difference in mean ages among the seven different types of transportation? Compare means; compare spreads

ANOVA is for... Comparing 3 or 4 or 5 or more groups to each other If we just have 2 groups to compare to each other, like comparing mean GPA’s and genders, we can use 2- sample t-test Compare male mean GPA’s to female mean GPA’s; that’s what 2-sample t-tests are meant to do ANOVA is for comparing multiple groups, like mean GPA among Twitter users vs. Facebook users vs. Instagram users vs. etc.

We could... Ho: Twitter User GPA = Facebook User GPA Ho: Snapchat User GPA = Facebook User GPA Ha: Twitter User GPA ≠ Facebook User GPA Ha: Snapchat User GPA ≠ Facebook User GPA Ho: Twitter User GPA = Instagram User GPA Ho: Snapchat User GPA = Instagram User GPA Ha: Twitter User GPA ≠ Instagram User GPA Ha: Snapchat User GPA ≠ Instagram User GPA Ho: Twitter User GPA = Other User GPA Ho: Snapchat User GPA = OtherUser GPA Ha: Twitter User GPA ≠ Other User GPA Ha: Snapchat User GPA ≠ Other User GPA Ho: Twitter User GPA = Snapchat User GPA Ho: Other User GPA = Facebook User GPA Ha: Twitter User GPA ≠ Snapchat User GPA Ha: Other User GPA ≠ Facebook User GPA And two more I can’t fit on here ... Other/Facebook & Instagram/Facebook... Ten different hypothesis tests... This is called multiple comparison... Comparing multiple pairs of means Three separate tests...

Remember α... Rejection zone (when conducting an hypothesis test); significance level; usually 5% (0.05) α is also the probability of committing a type I error (rejecting the null hypothesis when it really is true) Basic problem with multiple comparisons is that even though the probability of something going wrong (making an incorrect decision; committing an error) on one occasion (comparing 2 things only) is small (5%), if we keep repeating the experiment, eventually something will go wrong.

Big chances to make big mistakes... Essentially, by doing multiple tests, we are creating more opportunities to mistakenly reject the null hypothesis. The more tests we do, the greater the probability that we will mistakenly reject the null hypothesis at least once. For our ten hypothesis tests, each with α= 0.05, the overall significance level (or probability that we conclude that at least one mean is different from another, when the truth is that all means are equal is about 40%! Yikes!

So, anova to the rescue... ANOVA tests whether a categorical variable is associated with a numerical variable. This is the same as testing whether the mean value of a numerical variable is different in different groups ANOVA looks at the variation within each group and between all groups; then creates a ratio comparing these numbers called the F-statistic F =

ANOVA looks at variation within & between Look at variation within each group Look at variation between all groups

ANOVA looks at variation within & between Look at variation within each group Look at variation between all groups

ANOVA looks at variation within & between Look at variation within each group Look at variation between all groups

Like all other procedures... We have conditions that must be checked and met Random Sample & Independent Measurements Independent Groups Same Variance

Let’s do an example of anova... With our gpa & favorite social media data... Test the hypothesis that COC students with different favorite social media have different GPA (i.e, do students have higher (or lower) GPA’s depending on their favorite social media?). Assume all conditions have been checked and met. Ho: μTwitter = μSnapchat = μOther = μInstagram = μFacebook Ha: At least one population mean is different where μfavorite social mediais the true, unknown population mean (all COC students’ GPA whose favorite social media is indicated) StatCrunch, stat, ANOVA, one-way, values in a single column, response overall GPA, factors social media, compute

Let’s do an example of anova... With our gpa & favorite social media data... Ho: μTwitter = μSnapchat = μOther = μInstagram = μFacebook Ha: At least one population mean is different Fail to reject Ho. With a p-value of almost 0.80 and an alpha level of 5%, we do not have sufficient evidence to conclude that at least one population mean is different (i.e., we do not have enough evidence to conclude that all COC who have different favorite social media have different GPAs.

Study hours by major... Three independent random samples of full-time college students were asked how many hours per week they studied outside of class. Their responses and their majors are shown in the excel spread sheet found on my website (data sets). Test the hypothesis that the mean number of hours studying varies by major. Assume all conditions have been checked and met. Ho: μMath = μ Social Science = μEnglish Ha: At least one population mean is different. Where μmajor is the true, unknown population mean study time for all full-time college students within the given major SS: Sum of Squares (total amount of variation) Total: Sum of treatment (explained which is variation between and error (unexplained which is variation within) MS: SS / df F-Stat: ratio between MS between & MS within

Study hours by major... Ho: μMath = μ Social Science = μEnglish Ha: At least one population mean is different. Cut/paste data into StatCrunch. Stat, ANOVA, one-way, select columns, Math, Social Science, English, compute. SS: Sum of Squares (total amount of variation) Total: Sum of treatment (explained which is variation between and error (unexplained which is variation within) MS: SS / df F-Stat: ratio between MS between & MS within

Study hours by major... Ho: μMath = μ Social Science = μEnglish Ha: At least one population mean is different. Reject Ho. With a p-value of almost 0 and an alpha level of 5%, we have sufficient evidence to conclude that at least one population mean is different (i.e., we have enough evidence to conclude that the mean number of hours studying varies by major for all full-time college students. SS: Sum of Squares (total amount of variation) Total: Sum of treatment (explained which is variation between and error (unexplained which is variation within) MS: SS / df F-Stat: ratio between MS between & MS within

Your turn to choose some data... With a partner, go to Math 140 data Choose one numeric set of data and one categorical set of data (that has more than 2 categories... so you wouldn’t choose gender, or a yes/no set of data; the categorical set must have at least 3 options in it); choose the two sets of data that you believe may have a relationship Example: I though that favorite social media used might be related to GPA, i.e, Twitter is your favorite? I think you will have a high GPA; Instagram your favorite? I think you will have a lower GPA.

Your turn to choose some data... Go through your data and ‘clean it up’ as it might be ‘messy;’ justify any/all ‘cleaning’ you do State your null and alternative hypotheses; define parameters Assume all conditions have been checked and met Run the ANOVA procedure Provide a complete interpretation Questions? Refer to the examples worked in these notes.