DSCI 346 Yamasaki Lecture 4 ANalysis Of Variance.

Slides:



Advertisements
Similar presentations
Copyright 2004 David J. Lilja1 Comparing Two Alternatives Use confidence intervals for Before-and-after comparisons Noncorresponding measurements.
Advertisements

Chapter 10 Hypothesis Testing Using Analysis of Variance (ANOVA)
Chapter 11 Analysis of Variance
Chapter 12 ANALYSIS OF VARIANCE.
Analysis and Interpretation Inferential Statistics ANOVA
© 2010 Pearson Prentice Hall. All rights reserved Single Factor ANOVA.
1 1 Slide © 2009, Econ-2030 Applied Statistics-Dr Tadesse Chapter 10: Comparisons Involving Means n Introduction to Analysis of Variance n Analysis of.
Statistics for Managers Using Microsoft® Excel 5th Edition
Independent Sample T-test Formula
Chapter 10 Hypothesis Testing III (ANOVA). Basic Logic  ANOVA can be used in situations where the researcher is interested in the differences in sample.
Chapter 11 Analysis of Variance
Analysis of Variance: Inferences about 2 or More Means
Statistics Are Fun! Analysis of Variance
Chapter 17 Analysis of Variance
1 Pertemuan 10 Analisis Ragam (Varians) - 1 Matakuliah: I0262 – Statistik Probabilitas Tahun: 2007 Versi: Revisi.
Lecture 12 One-way Analysis of Variance (Chapter 15.2)
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Statistics for Business and Economics 7 th Edition Chapter 15 Analysis of Variance.
Copyright © 2014 by McGraw-Hill Higher Education. All rights reserved.
Chapter 12: Analysis of Variance
F-Test ( ANOVA ) & Two-Way ANOVA
1 1 Slide 統計學 Spring 2004 授課教師:統計系余清祥 日期: 2004 年 3 月 30 日 第八週:變異數分析與實驗設計.
HAWKES LEARNING SYSTEMS math courseware specialists Copyright © 2010 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Chapter 14 Analysis.
Analysis of Variance Nutan S. Mishra Department of Mathematics and Statistics University of South Alabama.
QNT 531 Advanced Problems in Statistics and Research Methods
12-1 Chapter Twelve McGraw-Hill/Irwin © 2005 The McGraw-Hill Companies, Inc., All Rights Reserved.
1 1 Slide © 2006 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
1 1 Slide © 2005 Thomson/South-Western Chapter 13, Part A Analysis of Variance and Experimental Design n Introduction to Analysis of Variance n Analysis.
INFERENTIAL STATISTICS: Analysis Of Variance ANOVA
1 1 Slide Analysis of Variance Chapter 13 BA 303.
© 2003 Prentice-Hall, Inc.Chap 11-1 Analysis of Variance IE 340/440 PROCESS IMPROVEMENT THROUGH PLANNED EXPERIMENTATION Dr. Xueping Li University of Tennessee.
© Buddy Freeman, 2015 H 0 : H 1 : α = Decision Rule: If then do not reject H 0, otherwise reject H 0. Test Statistic: Decision: Conclusion: We have found.
12-1 Chapter Twelve McGraw-Hill/Irwin © 2006 The McGraw-Hill Companies, Inc., All Rights Reserved.
© 2002 Prentice-Hall, Inc.Chap 9-1 Statistics for Managers Using Microsoft Excel 3 rd Edition Chapter 9 Analysis of Variance.
One-Factor Analysis of Variance A method to compare two or more (normal) population means.
Chapter 10 Analysis of Variance.
ANOVA (Analysis of Variance) by Aziza Munir
1 Chapter 13 Analysis of Variance. 2 Chapter Outline  An introduction to experimental design and analysis of variance  Analysis of Variance and the.
TOPIC 11 Analysis of Variance. Draw Sample Populations μ 1 = μ 2 = μ 3 = μ 4 = ….. μ n Evidence to accept/reject our claim Sample mean each group, grand.
Chapter 19 Analysis of Variance (ANOVA). ANOVA How to test a null hypothesis that the means of more than two populations are equal. H 0 :  1 =  2 =
One-Way Analysis of Variance
One-Way ANOVA ANOVA = Analysis of Variance This is a technique used to analyze the results of an experiment when you have more than two groups.
Lecture 9-1 Analysis of Variance
Chapter 17 Comparing Multiple Population Means: One-factor ANOVA.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics S eventh Edition By Brase and Brase Prepared by: Lynn Smith.
ONE WAY ANALYSIS OF VARIANCE ANOVA o It is used to investigate the effect of one factor which occurs at h levels (≥3). Example: Suppose that we wish to.
ANOVA P OST ANOVA TEST 541 PHL By… Asma Al-Oneazi Supervised by… Dr. Amal Fatani King Saud University Pharmacy College Pharmacology Department.
12-1 Chapter Twelve McGraw-Hill/Irwin © 2006 The McGraw-Hill Companies, Inc., All Rights Reserved.
Chapter 4 Analysis of Variance
Econ 3790: Business and Economic Statistics Instructor: Yogesh Uppal
Econ 3790: Business and Economic Statistics Instructor: Yogesh Uppal
Hypothesis test flow chart frequency data Measurement scale number of variables 1 basic χ 2 test (19.5) Table I χ 2 test for independence (19.9) Table.
CHAPTER 12 ANALYSIS OF VARIANCE Prem Mann, Introductory Statistics, 7/E Copyright © 2010 John Wiley & Sons. All right reserved.
CHAPTER 10 ANOVA - One way ANOVa.
1/54 Statistics Analysis of Variance. 2/54 Statistics in practice Introduction to Analysis of Variance Analysis of Variance: Testing for the Equality.
 List the characteristics of the F distribution.  Conduct a test of hypothesis to determine whether the variances of two populations are equal.  Discuss.
Chapter 13 Analysis of Variance (ANOVA). ANOVA can be used to test for differences between three or more means. The hypotheses for an ANOVA are always:
Chapter 11 Analysis of Variance
Sampling distribution of
Statistics for Managers Using Microsoft Excel 3rd Edition
i) Two way ANOVA without replication
Statistics Analysis of Variance.
Statistics for Business and Economics (13e)
Econ 3790: Business and Economic Statistics
Chapter 11 Analysis of Variance
Hypothesis Tests for a Standard Deviation
Chapter 15 Analysis of Variance
Chapter 10 – Part II Analysis of Variance
Presentation transcript:

DSCI 346 Yamasaki Lecture 4 ANalysis Of Variance

So far we have looked at 1 and 2 sample problems. We now generalize to more than 2 groups. Group 1 Group 2Group 3 x1x1 x2x2 x3x3 x Notation: x ij = jth observation in ith group DSCI Lect 4 (23 pages)2

So total variability is broken down into two parts: 1)Variability due to different means (SSB) 2)Variability due to variation about each mean (SSW) DSCI Lect 4 (23 pages)3

Basic assumptions: 1) Normality 2) Equal variance for all groups 3) Independent data points Recall: DSCI Lect 4 (23 pages)4

Consider the following situation: H 0 :       ...    H A : not all equal If H 0 is true, then x i ’s should be “close” to each other. So SSB =  n i (x i – x) 2 should be “relatively small If there are a lot of groups, then we will be adding up a lot of numbers and the sum will have a tendency to get large just because it is made up of more deviations even if each deviation is small, similar to the chi square situation. So to take this into account, divide SSB by k-1 SSTB (k-1) = MSB (Mean Square Between) DSCI Lect 4 (23 pages)5

If H 0 is true, then MSB is relatively small; when H 0 is false, then MSB gets larger. As before, the $1,000,000 question is “How large is too large?” Consider OH NO! Not another table! Yes, another table; but it beats the living daylights out of the alternative of trying to calculate the distribution by hand. DSCI Lect 4 (23 pages)6

Note: What we have is essentially, the ratio of two chi square variables, each with its own degrees of freedom. The numerator degrees of freedom( 1 ) is k-1, where k is the number of groups. The denominator degrees of freedom ( 2 ) is N-k where N is the total number of observations in all the groups combined. In these tables, rows correspond to the denominator degrees of freedom, the columns correspond to the numerator degrees of freedom, and the tables correspond to different significance levels. Note that the numerator df and the denominator df are not interchangable F 3,2,05 = = F 2.3,.05 = 9.55 DSCI Lect 4 (23 pages)7

Example: The table below shows the results of a study on the average dollar value for orders made by individuals at the four locations of a sandwich company LocationSample SizeMeanVariance Location 18$ Location 28$ Location 38$ Location 48$ Do the data provide sufficient evidence to indicate differences in the mean dollar values? Test using  =.05 DSCI Lect 4 (23 pages)8

Typical way of reporting (ANOVA table) SourceSSdfMSF BetweenSSBk-1 MSB MSB/MSW (a.k.a. among) WithinSSWN-k MSW (a.k.a. error) Total SST N-1 DSCI Lect 4 (23 pages)9

SourceSSdfMSF Between Within Total From F table F 3,28,.05 = 2.95 H 0 :  1 =  2 =  3 =  4 H A : not all equal  =.05 Since F=1.03 is not larger than 2.95 we cannot reject null hypothesis. Conclude insufficient evidence to indicate a difference in average dollar values for orders. DSCI Lect 4 (23 pages)10

Notes: 1) Since everything is squared (like chi square) there is no such thing as a one tailed test. 2) If k=2, then this test is equivalent to 3) Even though theory is based on normality, in practice normality isn’t that important 4) If we reject H 0, we may want to know specifically which means are different. There are several approaches available, but not in this class. 5) This technique can be generalized to include more than one grouping variable, or non independent groups. DSCI Lect 4 (23 pages)11

DSCI Lect 4 (23 pages) Multiple Comparisons: What to do if reject H 0 Suppose we have rejected the null hypothesis and now we want to know which means are different. Comparing Pairs of Population Means using Tukey-Cramer Multiple Comparisons for One-Way ANOVA H 0 :  i =  j H A :  i ≠  j Example Group 1Group 2Group 3 Sample mean Sample size MSW = F test rejected that all population means equal 12

DSCI Lect 4 (23 pages) 13

DSCI Lect 4 (23 pages) |x i – x j | 1 vs Yes 1 vs Yes 2 vs No Significant? 14

What we just introduced is known as One-Way ANOVA (a generalization of the two independent population means problem. In the two population means problem we talked about pairing (non-independent) as a way to control for the impact of other variables. The concept of non-independence can also arise in the ANOVA situation, in which case they are known as blocks. Consider the following example: A bank is comparing the average house appraisal value between three appraisal companies. If a random sample from each company is chosen, problems could arise due to different characteristics of the houses chosen. To control for this, the bank will have each appraisal company appraise the same houses; the houses are the blocks DSCI Lect 4 (23 pages)15

DSCI Lect 4 (23 pages) Randomized Complete Block ANOVA Randomized Complete Block ANOVA is similar to One-Way ANOVA, but has an additional sum of squares, SSBL, to compute and the way the pieces are obtained is a little different Here SST = SSB + SSBL + SSE, where SSE (error) takes the place of SSW and is obtained by first calculating SST Where X j is the average of the jth block and k is the number of levels of the blocks; just like we calculated SSB Just like s 2. Note you can calculate SST for one-way in the same fashion 16

DSCI Lect 4 (23 pages) Example: Comparing three appraisal companies using the same five houses House(Block)Co1Co2Com3Block Average Ave

ANOVA table SourceSSdfMSF BetweenSSBk-1MSBMSB/MSW BlocksSSBLb-1MBBLMSBL/MSW ErrorSSW(k-1)(b-1)MSE Total SST N-1 DSCI Lect 4 (23 pages)18

ANOVA table SourceSSdfMS F Between Blocks Error Total F 2,8,05 = Since >4.459, sufficient evidence to conclude not all average appraisals are the same. DSCI Lect 4 (23 pages)19

DSCI Lect 4 (23 pages) Comparing Pairs of Population Means using Tukey-Cramer Multiple Comparisons for Randomized Blocks H 0 :  i =  j H A :  i ≠  j Group 1Group 2Group 3 Sample mean Sample size 5 55 MSE = F test rejected that all population means equal 20

DSCI Lect 4 (23 pages) 21

DSCI Lect 4 (23 pages) |x i – x j | 1 vs Yes 1 vs No 2 vs Yes Significant? 22

DSCI Lect 4 (23 pages) One last use of F Table Remember we talked about the small sample t-test for the difference of two means only being valid if the two population variances are equal? We use the F tables to test the hypothesis of equality. F = s 1 2 /s 2 2, where s 1 2 >s 2 2 and use the corresponding df’s 23