Chapter 9: The Analysis of Variance (ANOVA)

Slides:



Advertisements
Similar presentations
Analysis of Variance (ANOVA) ANOVA methods are widely used for comparing 2 or more population means from populations that are approximately normal in distribution.
Advertisements

i) Two way ANOVA without replication
Chapter 14: One-Way Analysis of Variance (ANOVA)
Analysis of Variance Compares means to determine if the population distributions are not similar Uses means and confidence intervals much like a t-test.
© 2010 Pearson Prentice Hall. All rights reserved The Complete Randomized Block Design.
Studying for the Final
Lecture 10 PY 427 Statistics 1 Fall 2006 Kin Ching Kong, Ph.D
Analysis of Variance Chapter Introduction Analysis of variance compares two or more populations of interval data. Specifically, we are interested.
Comparing Means.
Lecture 13 Multiple comparisons for one-way ANOVA (Chapter 15.7)
Analysis of Variance Chapter 15 - continued Two-Factor Analysis of Variance - Example 15.3 –Suppose in Example 15.1, two factors are to be examined:
Chapter 12: One-Way ANalysis Of Variance (ANOVA) 1.
The Analysis of Variance
Inferences About Process Quality
McGraw-Hill/Irwin Copyright © 2013 by The McGraw-Hill Companies, Inc. All rights reserved. Business Statistics: Communicating with Numbers By Sanjiv Jaggia.
Chapter 12: Analysis of Variance
ANOVA Chapter 12.
1 1 Slide © 2003 South-Western/Thomson Learning™ Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
QNT 531 Advanced Problems in Statistics and Research Methods
1 1 Slide © 2005 Thomson/South-Western Chapter 13, Part A Analysis of Variance and Experimental Design n Introduction to Analysis of Variance n Analysis.
Multiple Comparisons.
Chapter 11 HYPOTHESIS TESTING USING THE ONE-WAY ANALYSIS OF VARIANCE.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
Chapter 10 Analysis of Variance.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap th Lesson Analysis of Variance.
Psychology 301 Chapters & Differences Between Two Means Introduction to Analysis of Variance Multiple Comparisons.
5-5 Inference on the Ratio of Variances of Two Normal Populations The F Distribution We wish to test the hypotheses: The development of a test procedure.
TOPIC 11 Analysis of Variance. Draw Sample Populations μ 1 = μ 2 = μ 3 = μ 4 = ….. μ n Evidence to accept/reject our claim Sample mean each group, grand.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 10-1 Chapter 10 Analysis of Variance Statistics for Managers Using Microsoft.
I. Statistical Tests: A Repetive Review A.Why do we use them? Namely: we need to make inferences from incomplete information or uncertainty þBut we want.
Chapter 19 Analysis of Variance (ANOVA). ANOVA How to test a null hypothesis that the means of more than two populations are equal. H 0 :  1 =  2 =
1 Analysis of Variance Chapter 14 2 Introduction Analysis of variance helps compare two or more populations of quantitative data. Specifically, we are.
1 Doing Statistics for Business Doing Statistics for Business Data, Inference, and Decision Making Marilyn K. Pelosi Theresa M. Sandifer Chapter 14 Experimental.
Lecture 9-1 Analysis of Variance
CHAPTER 4 Analysis of Variance One-way ANOVA
Analysis of Variance (One Factor). ANOVA Analysis of Variance Tests whether differences exist among population means categorized by only one factor or.
One-way ANOVA: - Comparing the means IPS chapter 12.2 © 2006 W.H. Freeman and Company.
Chapter 10: Analysis of Variance: Comparing More Than Two Means.
Copyright © Cengage Learning. All rights reserved. 12 Analysis of Variance.
Chapter 11: The ANalysis Of Variance (ANOVA) 1.
Chapter 11: The ANalysis Of Variance (ANOVA) 1.
Chapter 12 Introduction to Analysis of Variance PowerPoint Lecture Slides Essentials of Statistics for the Behavioral Sciences Eighth Edition by Frederick.
1 Experimental Statistics - week 9 Chapter 17: Models with Random Effects Chapter 18: Repeated Measures.
Random samples of size n 1, n 2, …,n k are drawn from k populations with means  1,  2,…,  k and with common variance  2. Let x ij be the j-th measurement.
Psy 230 Jeopardy Related Samples t-test ANOVA shorthand ANOVA concepts Post hoc testsSurprise $100 $200$200 $300 $500 $400 $300 $400 $300 $400 $500 $400.
Chapter 11 Analysis of Variance. 11.1: The Completely Randomized Design: One-Way Analysis of Variance vocabulary –completely randomized –groups –factors.
Chap 11-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 11 Analysis of Variance.
Chapter 11: The ANalysis Of Variance (ANOVA)
Copyright © 2016, 2013, 2010 Pearson Education, Inc. Chapter 10, Slide 1 Two-Sample Tests and One-Way ANOVA Chapter 10.
One-way ANOVA Example Analysis of Variance Hypotheses Model & Assumptions Analysis of Variance Multiple Comparisons Checking Assumptions.
AP Statistics. Chap 13-1 Chapter 13 Estimation and Hypothesis Testing for Two Population Parameters.
Introduction to ANOVA Research Designs for ANOVAs Type I Error and Multiple Hypothesis Tests The Logic of ANOVA ANOVA vocabulary, notation, and formulas.
1 An example of a more complex design (a four level nested anova) 0 %, 20% and 40% of a tree’s roots were cut with the purpose to study the influence.
ANalysis Of VAriance can be used to test for the equality of three or more population means. H 0 :  1  =  2  =  3  = ... =  k H a : Not all population.
Formula for Linear Regression y = bx + a Y variable plotted on vertical axis. X variable plotted on horizontal axis. Slope or the change in y for every.
SUMMARY EQT 271 MADAM SITI AISYAH ZAKARIA SEMESTER /2015.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Experimental Design and Analysis of Variance Chapter 11.
Stats/Methods II JEOPARDY. Jeopardy Estimation ANOVA shorthand ANOVA concepts Post hoc testsSurprise $100 $200$200 $300 $500 $400 $300 $400 $300 $400.
CHAPTER 3 Analysis of Variance (ANOVA) PART 2 =TWO- WAY ANOVA WITHOUT REPLICATION.
 List the characteristics of the F distribution.  Conduct a test of hypothesis to determine whether the variances of two populations are equal.  Discuss.
Chapter 12 Introduction to Analysis of Variance
CHAPTER 3 Analysis of Variance (ANOVA) PART 3 = TWO-WAY ANOVA WITH REPLICATION (FACTORIAL EXPERIMENT)
Comparing Multiple Groups:
CHAPTER 4 Analysis of Variance (ANOVA)
i) Two way ANOVA without replication
Comparing Three or More Means
Comparing Multiple Groups: Analysis of Variance ANOVA (1-way)
Comparing Three or More Means
Chapter 11: The ANalysis Of Variance (ANOVA)
Experimental Statistics - Week 4 (Lab)
Presentation transcript:

Chapter 9: The Analysis of Variance (ANOVA) http://www.luchsinger-mathematics.ch/Var_Reduction.jpg

ANOVA: Examples Do four different brands of gasoline have different effects on automobile efficiency? Does the type of sugar solution (glucose, sucrose, fructose, mixture) have an effect on bacterial growth? 3) Does the resulting color density of a fabric depend on the amount of dye used?

ANOVA: Graphical

ANOVA test statistic

ANOVA test

F Distribution http://www.vosesoftware.com/ModelRiskHelp/index.htm#Distributions/ Continuous_distributions/F_distribution.htm

P-value for an upper-tailed F test shaded area=P-value = 0.05

Table IV F critical values

ANOVA Table: Formulas Source df SS MS (Mean Square) F Model (Between) k – 1 Error (Within) n – k Total n – 1

ANOVA Hypothesis test: Summary H0: μ1 = μ2 =  = μk Ha: At least two μ’s are different Test statistic: P-value: area under the upper tail with an F distribution with (dfm, dfe) degrees of freedom

ANOVA: Example A random sample of 15 healthy young men are split randomly into 3 groups of 5. They receive 0, 20, and 40 mg of the drug Paxil for one week. Then their serotonin levels are measured to determine whether Paxil affects serotonin levels. The data is on the next slide. Does Paxil affect serotonin levels in healthy young men?

ANOVA: Example (cont). Dose 0 mg 20 mg 40 mg 48.62 58.60 68.59 49.85 72.52 78.28 64.22 66.72 82.77 62.81 80.12 76.53 62.51 68.44 72.33

ANOVA: Example (cont) 1. Let 1 be the mean serotonin level for men receiving 0 mg of Paxil. Let 2 be the mean serotonin level for men receiving 20 mg of Paxil. Let 3 be the mean serotonin level for men receiving 40 mg of Paxil.

ANOVA: Example (cont) 2. H0: 1 = 2 = 3; mean serotonin levels are the same at all 3 dosage levels [or, mean serotonin levels are unaffected by Paxil dose] HA: The mean serotonin levels of the three groups are not all equal. [or, serotonin levels are affected by Paxil does]

ANOVA: Example (cont). Dose 0 mg 20 mg 40 mg 48.62 58.60 68.59 49.85 72.52 78.28 64.22 66.72 82.77 62.81 80.12 76.53 62.51 68.44 72.33 overall ni 5 15 x̅i 57.60 69.28 75.70 67.53 si 7.678 7.895 5.460 (ni-1)si2 235.78 249.32 119.24 604.34 ni(x̅i - x̅̅)2 492.56 15.36 333.96 841.88

ANOVA: Example (cont) Source df SS MS 4. F-Ratio 5. P-Value Model 2 841.88 420.94 Error 12 604.34 50.36 Total 14 1446.23

Example: ANOVA (cont) 7. This does give strong support (0.001 < P < 0.01) to the claim that there is a difference in serotonin levels among the groups of men taking 0, 20, and 40 mg of Paxil. This does give strong support (0.001 < P < 0.01) to the claim that Paxil intake affects serotonin levels in young men.

Example: ANOVA Effect plot

Problem with Multiple t tests

Overall Risk of Type I Error in Using Repeated t Tests at  = 0.05

Table IX: Tukey  = 0.05

Table IX: Tukey  = 0.01

Example: Tukey A biologist wished to study the effects of ethanol on sleep time. A sample of 20 rants matched for age and other characteristics, was selected, and each rat was given an oral injection having a particular concentration of ethanol per body weight. The rapid eye movement (REM) sleep time for each rat was then recorded for a 24-hour period with the results in the table on the next slide. Does the data indicate that the true average REM sleep time depends of the concentration of ethanol?

Example: Tukey (cont) Treatment (conc. of ethanol) x̅i si 0 (control) 88.6 73.2 91.4 68.0 75.2 79.28 10.18 1 g/kg 63.0 53.9 69.2 50.1 71.5 61.54 9.34 2 g/kg 44.9 59.5 40.2 56.3 38.7 47.92 9.46 4 g/kg 31.0 39.6 45.3 25.2 22.7 32.76 9.56 Source df SS MS F-Ratio P-Value Model 3 5882.36 1960.79 21.09 0.0001 Error 16 1487.40 92.96 Total 19 7369.76

Example: Tukey (cont) i - j x̅i - x̅j Difference? 0 – 1 17.74 yes 0 – 2 31.36 0 – 4 46.52 1 – 2 13.62 1 – 4 28.78 2 – 4 15.16 x̅4 x̅2 x̅1 x̅0 32.76 47.92 61.54 79.28

OUTPUT: Tukey cldiff The GLM Procedure Tukey's Studentized Range (HSD) Test for sleep NOTE: This test controls the Type I experimentwise error rate. Alpha 0.05 Error Degrees of Freedom 16 Error Mean Square 92.9625 Critical Value of Studentized Range 4.04609 Minimum Significant Difference 17.446 Comparisons significant at the 0.05 level are indicated by ***. Difference amount Between Simultaneous 95% Comparison Means Confidence Limits 1 - 2 17.740 0.294 35.186 *** 1 - 3 31.360 13.914 48.806 *** 1 - 4 46.520 29.074 63.966 *** 2 - 1 -17.740 -35.186 -0.294 *** 2 - 3 13.620 -3.826 31.066 2 - 4 28.780 11.334 46.226 *** 3 - 1 -31.360 -48.806 -13.914 *** 3 - 2 -13.620 -31.066 3.826 3 - 4 15.160 -2.286 32.606 4 - 1 -46.520 -63.966 -29.074 *** 4 - 2 -28.780 -46.226 -11.334 *** 4 - 3 -15.160 -32.606 2.286

OUTPUT: Tukey lines The GLM Procedure Tukey's Studentized Range (HSD) Test for sleep NOTE: This test controls the Type I experimentwise error rate, but it generally has a higher Type II error rate than REGWQ. Alpha 0.05 Error Degrees of Freedom 16 Error Mean Square 92.9625 Critical Value of Studentized Range 4.04609 Minimum Significant Difference 17.446 Means with the same letter are not significantly different. Tukey Grouping Mean N amount A 79.280 5 1 B 61.540 5 2 B C B 47.920 5 3 C C 32.760 5 4

Critical values for Dunnett’s Method

Example: Dunnett A biologist wished to study the effects of ethanol on sleep time. A sample of 20 rants matched for age and other characteristics, was selected, and each rat was given an oral injection having a particular concentration of ethanol per body weight. The rapid eye movement (REM) sleep time for each rat was then recorded for a 24-hour period with the results in the table on the next slide. Does the data indicate that the true average REM sleep time is different for the ingestion of any alcohol?

Example: Dunnett (cont) Treatment (conc. of ethanol) x̅i si 0 (control) 88.6 73.2 91.4 68.0 75.2 79.28 10.18 1 g/kg 63.0 53.9 69.2 50.1 71.5 61.54 9.34 2 g/kg 44.9 59.5 40.2 56.3 38.7 47.92 9.46 4 g/kg 31.0 39.6 45.3 25.2 22.7 32.76 9.56 Source df SS MS F-Ratio P-Value Model 3 5882.36 1960.79 21.09 0.0001 Error 16 1487.40 92.96 Total 19 7369.76

Example: Dunnett (cont) i - j x̅i - x̅j Difference? 0 – 1 17.74 yes 0 – 2 31.36 0 – 4 46.52

OUTPUT: Dunnett cldiff The GLM Procedure Dunnett's t Tests for sleep NOTE: This test controls the Type I experimentwise error for comparisons of all treatments against a control. Alpha 0.05 Error Degrees of Freedom 16 Error Mean Square 92.9625 Critical Value of Dunnett's t 2.59232 Minimum Significant Difference 15.808 Comparisons significant at the 0.05 level are indicated by ***. Difference amount Between Simultaneous 95% Comparison Means Confidence Limits 2 - 1 -17.740 -33.548 -1.932 *** 3 - 1 -31.360 -47.168 -15.552 *** 4 - 1 -46.520 -62.328 -30.712 ***

Randomized Block Experiments Researchers are interested in the effect that acid has on growth rate of alfalfa plants. To control sunlight, the randomized block procedure is used.

ANOVA Table: RCB Source df SS MS F Factor A (treatment) a – 1 Factor B (block) b - 1 Error (Within) (a-1)(b-1) SST – SSA - SSB Total n –1 = ab– 1

Example: RBD An experiment was designed to study the performance of four different detergents in cleaning clothes. The clothes were measured on cleanness (higher= cleaner) after being stained with three different common stains and cleaned with the four detergents. We want to know which if any detergent is better. Stain 1 Stain 2 Stain 3 Detergent 1 45 43 51 Detergent 2 47 46 52 Detergent 3 48 50 55 Detergent 4 42 37 49

Example: RBD (cont) 1. Let 1 be the mean cleaning value for detergent 1. Let 2 be the mean cleaning value for detergent 2. Let 3 be the mean cleaning value for detergent 3. Let 4 be the mean cleaning value for detergent .

Example: RBD (cont) 2. H0: 1 = 2 = 3 = 4; mean cleaning levels for the 4 detergents are the same. HA: At least two of the detergents have different cleaning levels.

Example: RBD (cont) Source df SS MS F P Value Factor A (soap) 3 110.92 36.97 Factor B (stain) 2 135.17 67.58 Error 6 18.83 3.14 Total 11 264.92 7. This data does give strong support (P = 0.0063) to the claim that there is difference in how well the four detergents clean stains.

Example: RBD 2 An experiment was designed to study the performance of four different detergents in cleaning clothes. The clothes were measured on cleanness (higher= cleaner) after being stained with three different common stains and cleaned with the four detergents. We want to know if the stains (blocks) have an effect. Stain 1 Stain 2 Stain 3 Detergent 1 45 43 51 Detergent 2 47 46 52 Detergent 3 48 50 55 Detergent 4 42 37 49

Example: RBD 2 (cont) 1. Let 1 be the mean cleaning value for stain 1. Let 2 be the mean cleaning value for stain 2. Let 3 be the mean cleaning value for stain 3.

Example: RBD 2 (cont) 2. H0: 1 = 2 = 3 = 4; mean cleaning levels for the 4 detergents are the same. HA: At least two of the detergents have different cleaning levels.

Example: RBD 2 (cont) Source df SS MS F P Value Factor A (soap) 3 110.92 36.97 Factor B (stain) 2 135.17 67.58 Error 6 18.83 3.14 Total 11 264.92 7. This data does give strong support (P = 0.0018) to the claim that there is difference in how well the detergents clean the different stains.

Example: RBD (cont) i - j x̅i - x̅j Difference? 1 – 2 2.000 1 – 3 4.667 1 – 4 3.667 2 – 3 -2.667 2 – 4 5.667 yes 3 – 4 8.333 x̅4 x̅1 x̅2 x̅3 42.67 46.33 48.33 51.00