QNT 531 Advanced Problems in Statistics and Research Methods

Slides:



Advertisements
Similar presentations
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 10 The Analysis of Variance.
Advertisements

Chapter 11 Analysis of Variance
Analysis of Variance (ANOVA) ANOVA can be used to test for the equality of three or more population means We want to use the sample results to test the.
Design of Experiments and Analysis of Variance
1 1 Slide © 2011 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
Statistics for Managers Using Microsoft® Excel 5th Edition
Part I – MULTIVARIATE ANALYSIS
Chapter 11 Analysis of Variance
ANOVA Determining Which Means Differ in Single Factor Models Determining Which Means Differ in Single Factor Models.
Statistics for Business and Economics
Statistics for Managers Using Microsoft® Excel 5th Edition
Analysis of Variance Chapter 15 - continued Two-Factor Analysis of Variance - Example 15.3 –Suppose in Example 15.1, two factors are to be examined:
8. ANALYSIS OF VARIANCE 8.1 Elements of a Designed Experiment
Copyright ©2011 Pearson Education 11-1 Chapter 11 Analysis of Variance Statistics for Managers using Microsoft Excel 6 th Global Edition.
Go to Table of ContentTable of Content Analysis of Variance: Randomized Blocks Farrokh Alemi Ph.D. Kashif Haqqi M.D.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 10-1 Chapter 10 Analysis of Variance Statistics for Managers Using Microsoft.
Chap 10-1 Analysis of Variance. Chap 10-2 Overview Analysis of Variance (ANOVA) F-test Tukey- Kramer test One-Way ANOVA Two-Way ANOVA Interaction Effects.
Chapter 12: Analysis of Variance
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS & Updated by SPIROS VELIANITIS.
1 1 Slide © 2003 South-Western/Thomson Learning™ Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
1 1 Slide 統計學 Spring 2004 授課教師:統計系余清祥 日期: 2004 年 3 月 30 日 第八週:變異數分析與實驗設計.
1 1 Slide © 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
HAWKES LEARNING SYSTEMS math courseware specialists Copyright © 2010 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Chapter 14 Analysis.
1 1 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
Statistics for Business and Economics Chapter 8 Design of Experiments and Analysis of Variance.
1 1 Slide © 2006 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
1 1 Slide © 2005 Thomson/South-Western Chapter 13, Part A Analysis of Variance and Experimental Design n Introduction to Analysis of Variance n Analysis.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 13 Experimental Design and Analysis of Variance nIntroduction to Experimental Design.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Comparing Three or More Means 13.
© 2003 Prentice-Hall, Inc.Chap 11-1 Analysis of Variance IE 340/440 PROCESS IMPROVEMENT THROUGH PLANNED EXPERIMENTATION Dr. Xueping Li University of Tennessee.
© 2002 Prentice-Hall, Inc.Chap 9-1 Statistics for Managers Using Microsoft Excel 3 rd Edition Chapter 9 Analysis of Variance.
Analysis of Variance ST 511 Introduction n Analysis of variance compares two or more populations of quantitative data. n Specifically, we are interested.
© Copyright McGraw-Hill CHAPTER 12 Analysis of Variance (ANOVA)
CHAPTER 12 Analysis of Variance Tests
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Experimental Design and Analysis of Variance Chapter 10.
Chapter 10 Analysis of Variance.
1 Chapter 13 Analysis of Variance. 2 Chapter Outline  An introduction to experimental design and analysis of variance  Analysis of Variance and the.
TOPIC 11 Analysis of Variance. Draw Sample Populations μ 1 = μ 2 = μ 3 = μ 4 = ….. μ n Evidence to accept/reject our claim Sample mean each group, grand.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 10-1 Chapter 10 Analysis of Variance Statistics for Managers Using Microsoft.
1 Analysis of Variance Chapter 14 2 Introduction Analysis of variance helps compare two or more populations of quantitative data. Specifically, we are.
Analysis of Variance 1 Dr. Mohammed Alahmed Ph.D. in BioStatistics (011)
1 Doing Statistics for Business Doing Statistics for Business Data, Inference, and Decision Making Marilyn K. Pelosi Theresa M. Sandifer Chapter 14 Experimental.
Chapter 10: Analysis of Variance: Comparing More Than Two Means.
1 ANALYSIS OF VARIANCE (ANOVA) Heibatollah Baghi, and Mastee Badii.
Copyright © Cengage Learning. All rights reserved. 12 Analysis of Variance.
Chapter 11 Analysis of Variance. 11.1: The Completely Randomized Design: One-Way Analysis of Variance vocabulary –completely randomized –groups –factors.
Chapter 4 Analysis of Variance
Econ 3790: Business and Economic Statistics Instructor: Yogesh Uppal
Chap 11-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 11 Analysis of Variance.
1/54 Statistics Analysis of Variance. 2/54 Statistics in practice Introduction to Analysis of Variance Analysis of Variance: Testing for the Equality.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Experimental Design and Analysis of Variance Chapter 11.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
ENGR 610 Applied Statistics Fall Week 8 Marshall University CITE Jack Smith.
Aron, Aron, & Coups, Statistics for the Behavioral and Social Sciences: A Brief Course (3e), © 2005 Prentice Hall Chapter 10 Introduction to the Analysis.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
1 Pertemuan 19 Analisis Varians Klasifikasi Satu Arah Matakuliah: I Statistika Tahun: 2008 Versi: Revisi.
Factorial Experiments
ANOVA Econ201 HSTS212.
CHAPTER 13 Design and Analysis of Single-Factor Experiments:
i) Two way ANOVA without replication
Comparing Three or More Means
Statistics Analysis of Variance.
Chapter 10: Analysis of Variance: Comparing More Than Two Means
Statistics for Business and Economics (13e)
Econ 3790: Business and Economic Statistics
Chapter 11 Analysis of Variance
Chapter 10 – Part II Analysis of Variance
STATISTICS INFORMED DECISIONS USING DATA
Presentation transcript:

QNT 531 Advanced Problems in Statistics and Research Methods WORKSHOP 2 By Dr. Serhat Eren University OF PHOENIX Dr. Serhat Eren University of Phoenix

ANALYSIS OF VARIANCE AND EXPERIMENTAL DESIGN SECTION 2 ANALYSIS OF VARIANCE AND EXPERIMENTAL DESIGN

SECTION 2 SECTION OBJECTIVES An Introduction to Analysis of Variance Analysis of Variance: Testing for the Equality of k population means Multiple comparison procedures An introduction to Experimental Design Completely Randomized Designs Randomized Block Design Factorial Experiment

SECTION 2 ANALYSIS OF DATA FROM ONE-WAY DESIGNS One-Way Designs: The Basics A factor is a variable that can be used to differentiate one group or population from another. It is a variable that may be related to he variable of interest. A level is one of several possible values or settings that the factor can assume. The response variable is a quantitative variable that you are measuring or observing.

SECTION 2 ANALYSIS OF DATA FROM ONE-WAY DESIGNS These are all examples of one-way or completely randomized designs. An experiment has a one-way or completely randomized design if there are several different levels of one factor being studied and the objects or people being observed/ measured are randomly assigned to one of the levels of the factor.

SECTION 2 ANALYSIS OF DATA FROM ONE-WAY DESIGNS The term one-way refers to the fact that the groups differ with regard to the one factor being studied. The term completely randomized refers to the fact that individual observations are assigned to the groups in a random manner.

SECTION 2 ANALYSIS OF DATA FROM ONE-WAY DESIGNS Understanding the Total Variation Analysis of variance (ANOYA) is the technique used to analyze the variation in the data to determine if more than two population means are equal. A treatment is a particular setting or combination of settings of the factor(s)

SECTION 2 ANALYSIS OF DATA FROM ONE-WAY DESIGNS The grand mean or the overall mean is the sample average of all the observations in the experiment. It is labeled (x-bar-bar). Now we can rewrite the variance calculations as follows:

SECTION 2 ANALYSIS OF DATA FROM ONE-WAY DESIGNS The total variation or sum of squares total (SST) is a measure of the variability in the entire data set considered as a whole. SST is calculated as follows:

SECTION 2 ANALYSIS OF DATA FROM ONE-WAY DESIGNS Components of Total Variation The between groups variation is also called the Sum or Squares between or the Sum of Squares Among and it measures how much of the total variation comes from actual differences in the treatments. The dot-plot shown in Figure 14.3 displays the sample average for each of the four time treatments. These are called treatment means.

SECTION 2 ANALYSIS OF DATA FROM ONE-WAY DESIGNS A treatment mean is the average of the response variable for a particular treatment. Between Groups Variation measures how different the individual treatment means are from the overall grand mean. It is often called the sum of squares between or the sum of squares among (SSA).

SECTION 2 ANALYSIS OF DATA FROM ONE-WAY DESIGNS The formula for sum of squares among (SSA) is: Within groups variation measures the variability in the measurements within the groups. It is often called sum of squares within or sum of squares error (SSE).

SECTION 2 ANALYSIS OF DATA FROM ONE-WAY DESIGNS The Mean Square Terms in the ANOVA Table The mean square among is labeled MSA The mean square error is labeled MSE and the mean square total is labeled MST. The formulas for the mean squares are;

SECTION 2 ANALYSIS OF DATA FROM ONE-WAY DESIGNS Testing the Hypothesis of Equal Means In general, the null and alternative hypotheses for a one-way designed experiment are shown below: HA: At least one of the population means is different from the others.

SECTION 2 ANALYSIS OF DATA FROM ONE-WAY DESIGNS The formula for the F test statistic is calculated by taking the ratio of the two sample variances: In ANOVA, MSA and MSE are our two sample variances. So the F statistic is calculated as:

SECTION 2 ASSUMPTIONS OF ANOVA The three major assumptions of ANOVA are as follows: The errors are random and independent of each other. Each population has a normal distribution. All of the populations have the same variance.

SECTION 2 ANALYSIS OF DATA FROM BLOCKED DESIGNS A block is a group or objects or people that have been matched. Are object or person can be matched with itself, meaning that repeated observations are taken on that object or person and these observations form a block? If the realities of data collection lead you to use blocks, then you must take this into account in your analysis. Your experimental design is called a randomized block design. Instead of using a one-way ANOVA you must use a block ANOVA.

SECTION 2 ANALYSIS OF DATA FROM BLOCKED DESIGNS An experiment has a randomized block design if several different levels of one factor are being studied and the objects or people being observed/ measured have been matched. Each object or person is randomly assigned to one of the c levels of the factor.

SECTION 2 ANALYSIS OF DATA FROM BLOCKED DESIGNS Partitioning the Total Variation Like the approach we took with data from a one-way design, the idea is to take the total variability as measured by SST and break it down into its components. With a block design there is one additional component: the variability between the blocks. It is called the sum of squares blocks and is labeled SSBL.

SECTION 2 ANALYSIS OF DATA FROM BLOCKED DESIGNS The sum of squares blocks measures the variability between the blocks. It is labeled SSBL. For a block design, the variation we see in the data is due to one of three things: the level of the factor, the block, or the error.

SECTION 2 ANALYSIS OF DATA FROM BLOCKED DESIGNS Thus, the total variation is divided into three components: SST = SSA + SSBL + SSE

SECTION 2 ANALYSIS OF DATA FROM BLOCKED DESIGNS Using the ANOVA Table in a Block Design The ANOVA table for such a block design looks just like the ANOVA table for a one-way design with an additional row.

SECTION 2 ANALYSIS OF DATA FROM TWO-WAY DESIGNS Motivation for a Factorial Design Model An experimental design is called a factorial design with two factors if there are several different levels of two factors being studied. The first factor is called factor A and there are r levels of factor A. The second factor is called factor B and there are c levels of factor B.

SECTION 2 ANALYSIS OF DATA FROM TWO-WAY DESIGNS The design is said to have equal replication if the same number of objects or people being observed/measured are randomly selected from each population. The population is described by a specific level for each of the two factors. Each observation is called a replicate. There are n' observations or replicates observed from each population. There are n = n'rc observations in total.

SECTION 2 ANALYSIS OF DATA FROM TWO-WAY DESIGNS Partitioning the Variation The sum of squares due to factor A is labeled SSA. It measures the squared differences between the mean of each level of factor A and the grand mean. The sum of squares due to factor B is labeled SSB. It measures the squared differences between the mean of each level of factor B and the grand mean.

SECTION 2 ANALYSIS OF DATA FROM TWO-WAY DESIGNS The sum of squares due to the interacting effect of A and B is labeled SSAB. It measures the effect of combining factor A and factor B. The sum of squares error is labeled SSE. It measures the variability in the measurements within the groups. Thus, the total variation is divided into four components: SST = SSA + SSB + SSAB + SSE

SECTION 2 ANALYSIS OF DATA FROM TWO-WAY DESIGNS Using the ANOVA Table in a Two-Way Design The ANOVA table for such a design looks just like the ANOVA table for a one-way design with two additional rows.

SECTION 2 ANALYSIS OF DATA FROM TWO-WAY DESIGNS Using the ANOVA Table in a Two-Way Design In a two-way ANOVA, three hypothesis tests should be done. To test the hypothesis of no difference due to factor A we would have the following null and alternative hypotheses: Ho: There is no difference in the population means due to factor A. HA: There is a difference in the population means due to factor A.

SECTION 2 ANALYSIS OF DATA FROM TWO-WAY DESIGNS To test the hypothesis of no difference due to factor B we would have the following null and alternative hypotheses: Ho: There is no difference in the population means due to factor B. HA: There is a difference in the population means due to factor B.

SECTION 2 ANALYSIS OF DATA FROM TWO-WAY DESIGNS To test the hypothesis of no difference due to the interaction of factors A and B we would have the following null and alternative hypotheses: Ho: There is no difference in the population means due to the interaction of factors A and B. HA: There is a difference in the population means due to the interaction of factors A arid B.

SECTION 2 ANALYSIS OF DATA FROM TWO-WAY DESIGNS Understanding the interaction Effect The easiest way to understand this effect is to look at a graph of the sample averages for each of the possible combinations of the two factors. The line graph shown in Figure 14.7 displays the 20 sample means for airspace.

SECTION 2 ANALYSIS OF DATA FROM TWO-WAY DESIGNS From this graph you can see that the mean airspace decreases the longer the box sits on the shelf, regardless of from what position in the hardroll the box was made. The airspace behavior is affected by the interaction of the time on the shelf and the position in the hardroll from which it was made.

SECTION 2 ANALYSIS OF DATA FROM TWO-WAY DESIGNS If there were no interaction effect, the lines connecting the sample means would be parallel as in Figure 14.8.

SECTION 2 MULTIPLE COMPARISON PROCEDURE When we use analysis of variance to test whether the means of k populations are equal, rejection of the null hypothesis allows us to conclude only that the population means are not all equal. In some cases we will want to go a step further and determine where the differences among means occur.

SECTION 2 MULTIPLE COMPARISON PROCEDURE The purpose of this section is to introduce two multiple comparison procedures that can be used to conduct statistical comparisons between pairs of population means.

SECTION 2 MULTIPLE COMPARISON PROCEDURE 2.3.1 FISHER’S LSD Suppose that analysis of variance has provided statistical evidence to reject the null hypothesis of equal population means. In this case, Fisher’s least significant difference (LSD) procedure can be used to determine where the differences occur.

SECTION 2 MULTIPLE COMPARISON PROCEDURE Confidence Interval Estimate of the Difference Between Two Population Means Using Fisher’s LSD Procedure:

SECTION 2 MULTIPLE COMPARISON PROCEDURE TYPE I ERROR RATES We showed how Fisher’s LSD procedure can be used in such cases to determine where the differences occur. Technically, it is referred to as a protected or restricted LSD test because it is employed only if we first find a significant F value by using analysis of variance.

SECTION 2 MULTIPLE COMPARISON PROCEDURE To see why this distinction is important in multiple comparison tests, we need to explain the difference between a comparisonwise Type I error rate and an experimentwise Type I error rate.

SECTION 2 MULTIPLE COMPARISON PROCEDURE For example, in the NCP example Fisher’s LSD procedure was used to make three pairwise comparisons.

SECTION 2 MULTIPLE COMPARISON PROCEDURE In each case, we used a level of significance of = 0.05. Therefore, for each test, if the null hypothesis is true, the probability that we will make a Type I error is = 0.05; hence, the probability that we will not make a Type I error on each test is 1- 0.05= 0.95.

SECTION 2 MULTIPLE COMPARISON PROCEDURE In discussing multiple comparison procedures we refer to this probability of a Type I error (= 0.05) as the comparisonwise Type I error rate; comparisonwise Type I error rates indicate the level of significance associated with a single pairwise comparison. Let us now consider a slightly different question. What is the probability that in making three pairwise comparisons, we will commit a Type I error on at least one of the three tests?

SECTION 2 MULTIPLE COMPARISON PROCEDURE To answer this question, note that the probability that we will not make a Type I error on any of the three tests is; (.95)(.95)(.95)=0 .8574. Therefore, the probability of making at least one Type I error is: 1-0.8574= 0.1426

SECTION 2 MULTIPLE COMPARISON PROCEDURE Thus, when we use Fisher’s LSD procedure to make all three pairwise comparisons, the Type I error rate associated with this approach is not .05, but actually 0.1426; we refer to this error rate as the overall or experimentwise Type I error rate.