Chapter 12: Analysis of Variance. Chapter Goals Test a hypothesis about several means. Consider the analysis of variance technique (ANOVA). Restrict the.

Slides:



Advertisements
Similar presentations
Statistical Techniques I
Advertisements

Statistics Review – Part II Topics: – Hypothesis Testing – Paired Tests – Tests of variability 1.
Chapter 12 ANALYSIS OF VARIANCE.
© 2010 Pearson Prentice Hall. All rights reserved Single Factor ANOVA.
1 1 Slide © 2009, Econ-2030 Applied Statistics-Dr Tadesse Chapter 10: Comparisons Involving Means n Introduction to Analysis of Variance n Analysis of.
Independent Sample T-test Formula
Lecture 10 PY 427 Statistics 1 Fall 2006 Kin Ching Kong, Ph.D
Analysis of Variance Chapter Introduction Analysis of variance compares two or more populations of interval data. Specifically, we are interested.
Statistics Are Fun! Analysis of Variance
Chapter 9: Inferences Involving One Population Student’s t, df = 5 Student’s t, df = 15 Student’s t, df = 25.
Lecture 9: One Way ANOVA Between Subjects
Copyright © 2014 by McGraw-Hill Higher Education. All rights reserved.
Analysis of Variance Introduction to Business Statistics, 5e Kvanli/Guynes/Pavur (c)2000 South-Western College Publishing.
AM Recitation 2/10/11.
1 1 Slide © 2003 South-Western/Thomson Learning™ Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
CHAPTER 3 Analysis of Variance (ANOVA) PART 1
1 1 Slide 統計學 Spring 2004 授課教師:統計系余清祥 日期: 2004 年 3 月 30 日 第八週:變異數分析與實驗設計.
HAWKES LEARNING SYSTEMS math courseware specialists Copyright © 2010 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Chapter 14 Analysis.
1 Chapter 11 Analysis of Variance Introduction 11.2 One-Factor Analysis of Variance 11.3 Two-Factor Analysis of Variance: Introduction and Parameter.
1 1 Slide © 2006 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
1 1 Slide © 2005 Thomson/South-Western Chapter 13, Part A Analysis of Variance and Experimental Design n Introduction to Analysis of Variance n Analysis.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Inference on the Least-Squares Regression Model and Multiple Regression 14.
Copyright © Cengage Learning. All rights reserved. 10 Inferences Involving Two Populations.
1 1 Slide Analysis of Variance Chapter 13 BA 303.
Chapter 13: Linear Correlation and Regression Analysis
Analysis of Variance Chapter 12 Introduction Analysis of variance compares two or more populations of interval data. Specifically, we are interested.
Chapter 9: Inferences Involving One Population Student’s t, df = 5 Student’s t, df = 15 Student’s t, df = 25.
Analysis of Variance ST 511 Introduction n Analysis of variance compares two or more populations of quantitative data. n Specifically, we are interested.
Chapter 11: Applications of Chi-Square. Chapter Goals Investigate two tests: multinomial experiment, and the contingency table. Compare experimental results.
© Copyright McGraw-Hill CHAPTER 12 Analysis of Variance (ANOVA)
1 Chapter 13 Analysis of Variance. 2 Chapter Outline  An introduction to experimental design and analysis of variance  Analysis of Variance and the.
One-Way Analysis of Variance … to compare 2 or population means.
Copyright © 2004 Pearson Education, Inc.
Chapter 10: Inferences Involving Two Populations.
INTRODUCTION TO ANALYSIS OF VARIANCE (ANOVA). COURSE CONTENT WHAT IS ANOVA DIFFERENT TYPES OF ANOVA ANOVA THEORY WORKED EXAMPLE IN EXCEL –GENERATING THE.
1 Analysis of Variance Chapter 14 2 Introduction Analysis of variance helps compare two or more populations of quantitative data. Specifically, we are.
Analysis of Variance 1 Dr. Mohammed Alahmed Ph.D. in BioStatistics (011)
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 14 Comparing Groups: Analysis of Variance Methods Section 14.1 One-Way ANOVA: Comparing.
1 Objective Compare of two population variances using two samples from each population. Hypothesis Tests and Confidence Intervals of two variances use.
Copyright © Cengage Learning. All rights reserved. 12 Analysis of Variance.
Logic and Vocabulary of Hypothesis Tests Chapter 13.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics S eventh Edition By Brase and Brase Prepared by: Lynn Smith.
Chapter 12 Introduction to Analysis of Variance PowerPoint Lecture Slides Essentials of Statistics for the Behavioral Sciences Eighth Edition by Frederick.
© Copyright McGraw-Hill 2004
Econ 3790: Business and Economic Statistics Instructor: Yogesh Uppal
Econ 3790: Business and Economic Statistics Instructor: Yogesh Uppal
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved Lecture Slides Elementary Statistics Eleventh Edition and the Triola.
1 ES9 Chapter 24 ~ Linear Correlation & Regression Analysis Weight Waist Size.
Introduction to ANOVA Research Designs for ANOVAs Type I Error and Multiple Hypothesis Tests The Logic of ANOVA ANOVA vocabulary, notation, and formulas.
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Overview and One-Way ANOVA.
Copyright © Cengage Learning. All rights reserved. 12 Analysis of Variance.
Chapter 10 Section 5 Chi-squared Test for a Variance or Standard Deviation.
Formula for Linear Regression y = bx + a Y variable plotted on vertical axis. X variable plotted on horizontal axis. Slope or the change in y for every.
1/54 Statistics Analysis of Variance. 2/54 Statistics in practice Introduction to Analysis of Variance Analysis of Variance: Testing for the Equality.
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the.
CHAPTER 3 Analysis of Variance (ANOVA) PART 2 =TWO- WAY ANOVA WITHOUT REPLICATION.
Slide 1 Copyright © 2004 Pearson Education, Inc. Chapter 11 Multinomial Experiments and Contingency Tables 11-1 Overview 11-2 Multinomial Experiments:
 List the characteristics of the F distribution.  Conduct a test of hypothesis to determine whether the variances of two populations are equal.  Discuss.
Chapter 12 Introduction to Analysis of Variance
Chapter 15 Analysis of Variance. The article “Could Mean Platelet Volume be a Predictive Marker for Acute Myocardial Infarction?” (Medical Science Monitor,
Chapter 13 f distribution and 0ne-way anova
CHAPTER 3 Analysis of Variance (ANOVA) PART 1
Chapter 9: Inferences Involving One Population
CHAPTER 3 Analysis of Variance (ANOVA)
Chapter 10 Two-Sample Tests and One-Way ANOVA.
Econ 3790: Business and Economic Statistics
Chapter 14 Repeated Measures
Chapter 10 – Part II Analysis of Variance
ANalysis Of VAriance Lecture 1 Sections: 12.1 – 12.2
Presentation transcript:

Chapter 12: Analysis of Variance

Chapter Goals Test a hypothesis about several means. Consider the analysis of variance technique (ANOVA). Restrict the discussion to single-factor ANOVA.

12.1: Introduction to the Analysis of Variance Technique Compare several means simultaneously. The analysis of variance technique allows us to test the null hypothesis that all means are equal against the alternative hypothesis that at least one mean value is different, with a specified value of .

Example: A study was conducted to determine if the drying time for a certain paint is affected by the type of applicator used. The data in the table below represents the drying time (in minutes) for 3 different applicators when the paint was applied to standard wallboard. Is there any evidence to suggest the type of applicator has a significant effect on the paint drying time at the 0.05 level? Note: 1.The type of applicator is a level. 2.The data values from repeated samplings are called replicates.

Sample Results:

Note: 1.The drying time is measured by the mean value. is the mean drying time for level i, i = 1, 2, 3. 2.There is a certain amount of variation among the means. 3.Some variation can be expected, even if all three population means are equal. 4.Consider the question: “Is the variation among the sample means due to chance, or it is due to the effect of applicator on drying time?” 5.You might consider a dotplot of the data to see if the graphs suggests a difference among the levels?

Solution: 1.The Set-up: a.Population parameter of concern: The mean at each level of the test factor. Here, the mean drying time for each applicator. b.The null and the alternative hypothesis: H 0 :  1 =  2 =  3 The mean drying time is the same for each applicator. H a :  i  j for some i j Not all drying time means are equal.

2.The Test Criteria: a.Assumptions: The data was randomly collected and all observations are independent. The effects due to chance and untested factors are assumed to be normally distributed. b.Test statistic: F test statistic (see below). c.Level of significance:  = The Sample Evidence: a.Sample information: Data listed in the given table. b.Calculate the value of the test statistic: The F statistic is a ratio of two variances. Separate the variance in the entire data set into two parts.

Partition the Total Sum of Squares: Consider the numerator of the fraction used to define the sample variance: The numerator of this fraction is called the sum of squares, or total sum of squares. Notation:

Calculations:

An ANOVA table is often used to record the sums of squares and to organize the rest of the calculations. Format for the ANOVA Table:

Degrees of freedom, df, associated with each of the three sources of variation: 1.df(factor): one less than the number of levels (columns), c, for which the factor is tested. df(factor) = c  1 2.df(total): one less than the total number of observations, n. df(total) = n  1 n = k 1 + k 2 + k df(error): sum of the degrees of freedom for all levels tested. Each column has k i  1 degrees of freedom. df(error) = (k 1  1) + (k 2  1) + (k 3  1) +... = n  c

Calculations: df(factor) = df(applicator) = c  1 = 3  1 = 2 df(total) = n  1 = 19  1 = 18 df(error) = n  c = 19  3 = 16 Note: The sums of squares and the degrees of freedom must check. SS(factor) + SS(error) = SS(total) df(factor) + df(error) = df(total)

Mean Square: The mean square for the factor being tested and for the error is obtained by dividing the sum-of-square value by the corresponding number of degrees of freedom. Calculations:

The Complete ANOVA Table: The Test Statistic: Numerator degrees of freedom = df(factor) Denominator degrees of freedom = df(error)

4.The Probability Distribution (Classical Approach): a.Critical value: F(2, 16, 0.05) = 3.63 b.F* is in the critical region. 4.The Probability Distribution (p-Value Approach): a.The p-value: Table 9: < P < 0.05; By computer: P = b.The p-value is smaller than the level of significance, . 5.The Results: a.Decision: Reject H 0. b.Conclusion: There is evidence to suggest the three population means are not all the same. The type of applicator has a significant effect on the paint drying time.

12.2: The Logic Behind ANOVA Many experiments are conducted to determine the effect that different levels of some test factor have on a response variable. Single-factor ANOVA: obtain independent random samples at each of several levels of the factor being tested. Draw a conclusion concerning the effect that the levels of the test factors have on the response variable.

The Logic of the Analysis of Variance Technique: 1.In order to compare the means of the levels of the test factor, a measure of the variation between the levels (columns), the MS(factor), is compared to a measure of the variation within the levels, MS(error). 2.If the MS(factor) is significantly larger than the MS(error), then the means for each of the factor levels are not all the same. This implies the factor being tested has a significant effect on the response variable. 3.If the MS(factor) is not significantly larger than the MS(error), we cannot reject the null hypothesis that all means are equal.

Example: Do the box-and-whisker plots below show sufficient evidence to indicate a difference in the three population means?

Solution: 1.The box-and-whisker plots show the relationship among the three samples. 2.The plots suggest the three sample means are different from each other. 3.This suggests the population means are different. 4.There is relatively little within-sample variation, but a relatively large amount of between-sample variation.

Example: Do the box-and-whisker plots below show sufficient evidence to indicate a difference in the three population means?

Solution: 1.The box-and-whisker plots show the relationship among the four samples. 2.The plots suggest the four sample means are not different from each other. 3.There is relatively little between-sample variation, but a relatively large amount of within-sample variation. The data values within each sample cover a relatively wide range of values.

Assumptions: 1.Goal: to investigate the effect of various levels of a factor on a response variable. a.We would like to know which level is most advantageous. b.Probably want to reject H 0 in favor of H a. c.A follow-up study might determine the “best” level of the factor. 2.a.The effects due to chance and due to untested factors are normally distributed. b.The variance is constant throughout the experiment. 3.a.All observations are independent. b.The data is gathered (or tests are conducted) in a randomized order to ensure independence.

12.3: Applications of Single- Factor ANOVA Consider the notation used in ANOVA. Each observation has two subscripts: first indicates the column number (test factor level); second identifies the replicate (row) number. The column totals: C i The grand total (sum of all x’s): T

Notation used in ANOVA:

Mathematical Model for Single-Factor ANOVA: 1.  : mean value for all the data without respect to the test factor. 2.F c : effect of factor (level) c on the response variable. 3.  k(c) : experiment error that occurs among the k replicates in each of the c columns.

Example: A study was conducted to determine the effectiveness of various drugs on post-operative pain. The purpose of the experiment was to decide if there is any difference in length of pain relief due to drug. Eighty patients with similar operations were selected at random and split into four groups. Each patient was given one of four drugs and checked regularly. The length of pain relief (in hours) was recorded for each patient. At the 0.05 level of significance, is there any evidence to reject the claim that the four drugs are equally effective? Note: 1.The data is omitted here. 2.The ANOVA table is given in a later slide.

Solution: 1.The Set-up: a.Population parameter of interest: The mean time of pain relief for each factor (drug). b.The null and alternative hypothesis: H 0 :  1 =  2 =  3 =  4 H a : the means are not all equal. 2.The Hypothesis Test Criteria: a.Assumptions: The patients were randomly assigned to drug and their times are independent of each other. The effects due to chance and untested factors are assumed to be normally distributed. b.Test statistic: F* with df(numerator) = df(factor) = 3 and df(denominator) = df(error) = 80  4 = 76 c.Level of significance:  = 0.05

3.The Sample Evidence: a.Sample information: The ANOVA table: b.Calculate the value of the test statistic:

4.The Probability Distribution (Classical Approach): a.Critical value: F(3, 76, 0.05)  2.72 b.F* is in the critical region. 4.The Probability Distribution (p-Value Approach): a.The p-value: P = P(F* > 7.95, with df n = 3, df d = 76) < 0.01 By computer: P .0001 b.The p-value is smaller than the level of significance, . 5.The Results: a.Decision: Reject H 0. b.Conclusion: There is evidence to suggest that not all drugs have the same effect on length of pain relief.