Comparing Means for Several Populations When we wish to test for differences in means for only 1 or 2 populations, we use one- or two-sample t inference.

Slides:



Advertisements
Similar presentations
The t Test for Two Independent Samples
Advertisements

Introductory Mathematics & Statistics for Business
STATISTICS POINT ESTIMATION Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University.
Overview of Lecture Partitioning Evaluating the Null Hypothesis ANOVA
Lecture 2 ANALYSIS OF VARIANCE: AN INTRODUCTION
Multiple-choice question
1 Contact details Colin Gray Room S16 (occasionally) address: Telephone: (27) 2233 Dont hesitate to get in touch.
Chapter 7 Sampling and Sampling Distributions
You will need Your text Your calculator
Chapter 10: The t Test For Two Independent Samples
Pooled Variance t Test Tests means of 2 independent populations having equal variances Parametric test procedure Assumptions – Both populations are normally.
Chi-Square and Analysis of Variance (ANOVA)
Hypothesis Tests: Two Independent Samples
Chapter 15 ANOVA.
Module 16: One-sample t-tests and Confidence Intervals
McGraw-Hill, Bluman, 7th ed., Chapter 9
Please enter data on page 477 in your calculator.
Statistical Inferences Based on Two Samples
© The McGraw-Hill Companies, Inc., Chapter 10 Testing the Difference between Means and Variances.
Analysis of Variance Chapter 12 . McGraw-Hill/Irwin
Chapter Thirteen The One-Way Analysis of Variance.
Chapter 14 ANOVA 1.
Ch 14 實習(2).
Chapter 8 Estimation Understandable Statistics Ninth Edition
Experimental Design and Analysis of Variance
Lecture 11 One-way analysis of variance (Chapter 15.2)
Simple Linear Regression Analysis
Chapter 14 Nonparametric Statistics
Multiple Regression and Model Building
Chapter 13 Comparing Two Populations: Independent Samples.
BPS - 5th Ed. Chapter 241 One-Way Analysis of Variance: Comparing Several Means.
CHAPTER 25: One-Way Analysis of Variance Comparing Several Means
Design of Experiments and Analysis of Variance
Comparing Two Population Means The Two-Sample T-Test and T-Interval.
Statistics Are Fun! Analysis of Variance
Chapter 3 Analysis of Variance
Inferences About Process Quality
Chapter 9: Introduction to the t statistic
1 1 Slide © 2006 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
1 1 Slide © 2005 Thomson/South-Western Chapter 13, Part A Analysis of Variance and Experimental Design n Introduction to Analysis of Variance n Analysis.
Chapter 11 HYPOTHESIS TESTING USING THE ONE-WAY ANALYSIS OF VARIANCE.
Copyright © 2004 Pearson Education, Inc.
INTRODUCTION TO ANALYSIS OF VARIANCE (ANOVA). COURSE CONTENT WHAT IS ANOVA DIFFERENT TYPES OF ANOVA ANOVA THEORY WORKED EXAMPLE IN EXCEL –GENERATING THE.
Analysis of Variance 1 Dr. Mohammed Alahmed Ph.D. in BioStatistics (011)
Chapter 13 - ANOVA. ANOVA Be able to explain in general terms and using an example what a one-way ANOVA is (370). Know the purpose of the one-way ANOVA.
CHAPTER 4 Analysis of Variance One-way ANOVA
Previous Lecture: Phylogenetics. Analysis of Variance This Lecture Judy Zhong Ph.D.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
1 ANALYSIS OF VARIANCE (ANOVA) Heibatollah Baghi, and Mastee Badii.
Copyright © Cengage Learning. All rights reserved. 12 Analysis of Variance.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics S eventh Edition By Brase and Brase Prepared by: Lynn Smith.
Chapter 12 Introduction to Analysis of Variance PowerPoint Lecture Slides Essentials of Statistics for the Behavioral Sciences Eighth Edition by Frederick.
Econ 3790: Business and Economic Statistics Instructor: Yogesh Uppal
Econ 3790: Business and Economic Statistics Instructor: Yogesh Uppal
McGraw-Hill, Bluman, 7th ed., Chapter 12
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Overview and One-Way ANOVA.
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the.
 List the characteristics of the F distribution.  Conduct a test of hypothesis to determine whether the variances of two populations are equal.  Discuss.
The 2 nd to last topic this year!!.  ANOVA Testing is similar to a “two sample t- test except” that it compares more than two samples to one another.
Chapter 15 Analysis of Variance. The article “Could Mean Platelet Volume be a Predictive Marker for Acute Myocardial Infarction?” (Medical Science Monitor,
Chapter 11 Created by Bethany Stubbe and Stephan Kogitz.
Chapter 10 Two-Sample Tests and One-Way ANOVA.
Lecture Slides Elementary Statistics Twelfth Edition
Statistical Quality Control, 7th Edition by Douglas C. Montgomery.
i) Two way ANOVA without replication
Comparing Three or More Means
Basic Practice of Statistics - 5th Edition
Statistics Analysis of Variance.
Econ 3790: Business and Economic Statistics
ANalysis Of VAriance Lecture 1 Sections: 12.1 – 12.2
Presentation transcript:

Comparing Means for Several Populations When we wish to test for differences in means for only 1 or 2 populations, we use one- or two-sample t inference. Testing for differences in more than 2 populations, or at several different levels (values) of a variable involves a different approach. This is called Analysis of Variance, or ANOVA. ANOVA partitions the total sum of squares into two parts: 1.within treatment variability 2.between treatment variability

Comparing Means for Several Populations Example: Test 5 types of concrete for differences in moisture absorption. The 5 types of concrete are the five levels of the treatment. Within Variability – this seeks to quantify the variability in absorption for one particular type of concrete. Between Variability – this seeks to quantify the differences between the types of concrete. ANOVA seeks to answer the question “Are the differences between the 5 sample means what is expected purely from random variation alone?”

Definitions An experimental unit is an object, or subject, that produces a sample measurement. The experimental conditions that define the different populations in a completely randomized design are called treatments. Testing for differences in the treatments is equivalent to testing for differences in the population means.

Practice on Definitions See page 399 section 10.1 exercises.

Graphical demonstration: Employing two types of variability

Graphical demonstration: Employing two types of variability Treatment 1Treatment 2 Treatment Treatment 1Treatment 2Treatment The sample means are the same as before, but the larger within-sample variability makes it harder to draw a conclusion about the population means. A small variability within the samples makes it easier to draw a conclusion about the population means.

Assumptions for ANOVA 1. The samples are independent –Selection of objects from any one population is unrelated to the selection of objects from any of the other populations. Selections are random. –Examples Different groups of people (no person in more than one group) Different types of music Different concentrations of chemicals Different models of automobiles

Assumptions for ANOVA 2. Each population has the same standard deviation,  But the values of the population standard deviations is not known before testing.

Assumptions for ANOVA 3. Each sample has a mean that can be calculated. This mean is somehow representative of the population mean for its population.

Assumptions for ANOVA 4. Each population is normally distributed –Quantitative data: sample size is at least 30 –However, we will assume normally distributed populations for all the problems we work.

Assumptions for ANOVA The following assumptions are required for a 1-way ANOVA: The k populations are independent. Each population has common standard deviation, . Each population has a mean,  i for i = 1, 2, …, k. Each population is normally distributed. So we now are testing whether all the treatment means are equal. H 0 :  1 =  2 = … =  k H a : At least two of the population means are not equal

Test Statistic If the null hypothesis is true, we expect the k sample means to have reasonably similar values. In other words, if the population means are equal, we would expect the variability among the sample means to be relatively small. Variability among the sample means is one of the things we will be testing for.

Test Statistic If the null hypothesis is true, we do not expect the population means to be exactly the same, because there is a chance factor in our choice of sample experimental units. We need to take into account the variability due to chance among the sample means.

Test Statistic This method is called “analysis of variance” of ANOVA because we are comparing two sources of variance: the variance among the sample means and the variation expected by chance among the sample means when the null hypothesis is true.

Test Statistic Our test statistic is called F. F = Variability among the sample means Variability expected by chance

Degrees of freedom For a sample, (or group) (k) df = n – 1 Total df = total number of units in the experiment – 1 Error df = Total df – Group df –Or Error df = N - k

Minitab We will use Minitab to do our calculations. A typical Minitab display is on the next slide.

ANOVA Table: Tensile Strength for 6 Machines Analysis of Variance for Tensile-Strength Source DF SS MS F P Machine Error Total SSMachine = 5.34 (sample mean variability), k = 6 machines SSError = (variability due to chance) Notice how much larger the “chance” variability is than the other. There is little to no evidence that the machines differ in mean tensile-strength. Look at that HUGE p-value!

Another Minitab Example Example 102 page 369 Sociologist and GPA college students

One-way ANOVA: GPA versus Group Source DF SS MS F P Group Error Total S = R-Sq = 19.96% R-Sq(adj) = 13.29% Individual 95% CIs For Mean Based on Pooled StDev Level N Mean StDev Lower Middle ( * ) Poor ( * ) Upper Middle ( * ) Well-to-do ( * ) Pooled StDev =

Manual Calculation The formula for calculating F using the Mean Square Treatment is given on page 375.

Manual Calculation To determine the p value when the f value is known, we need to use a table. Table 5 is on pages VII, VIII, IX in the table appendix. In general, Table 5 will provide only approximate p-values. To find precise values, technology is needed.

ANOVA – What is expected from you? Be able to complete each of the following exercises: State the two hypotheses. What is the observed value of the test statistic? (F = ?) Is this valid? We will typically “assume” the method is ok. What is the p-value? State a conclusion. Using a table for comparisons, locate what mean(s) are significantly different if you accepted the alternative hypothesis. (Sect 10.3)

Analysis of Variance results: Responses stored in Score. Factors stored in Hair Color. Factor means Hair ColornMeanStd. Error Dark Blond Dark Brunette Light Blond Light Brunette ANOVA table SourcedfSSMSF-StatP-value Treatments Error Total

Example Page 423 # 1 One-way ANOVA: Score versus Hair Color Source DF SS MS F P Hair Color Error Total H 0 :  light_blond =  dark_blond = … =  dark_brunette H a : At least two population means are different. Accept Ha if p-value < 0.05 F = 5.44p-value = At the 0.05 level of significance, there is sufficient evidence to conclude that there is a difference among mean pain thresholds for people possessing these four hair colors.

10.3 Which means are different? Multiple Comparisons When an analysis of variance F-test indicates a significant difference among population means, (accept H a ), the next question is which means are different.

Which means are different? We need to test each of the following pairs of hypotheses. Pair 1: H o : μ 1 -μ 2 =0 H a : μ 1 -μ 2 ≠0 Pair 2: H o : μ 1 -μ 3 =0 H a : μ 1 -μ 3 ≠0 Pair 3: H o : μ 2 -μ 3 =0 H a : μ 2 -μ 3 ≠0

Which means are different? To test each pair of hypothesis, we are only testing two means for a difference between them. This is the two-sample t-statistic that we used in section 9-2. However, we will substitute MSE(Mean Square Error) for s 2 See page 416 for entire equation.

Which mean is different? We can use StatCrunch to calculate the value of t and the p-value for each of the comparisons. We can then draw our conclusions based on the p-value for each pair (is it less than α? If so we accept the alternative hypothesis), and summarize our findings in a chart. This is how the revised section in the book does it. See example 10.4 p 418

Let’s look further at the example on hair coloring.

Multiple Comparisons Pairp-valuet-valueInterpretation LB v DB NS LB v LBr NS LB v DBr LB > DBr DB v LBr NS DB v DBr NS LBr v DBr LBr > DBr Let’s look further at the example on hair coloring

Summary Ex 10.5 summarizes ideas from Chapter 10. See p 421

When should we use the multiple comparison method? The sample data are obtained from the k populations using a completely randomized design An analysis of variance F-test indicates that there are some differences among the k population means. The objective is to determine which of the k population means differ. It is usually of interest to determine which mean might be the largest (or smallest).