Chapter 14: Analysis of Variance One-way ANOVA Lecture 9a Instructor: Naveen Abedin Date: 24 th November 2015.

Slides:



Advertisements
Similar presentations
Chapter 14 ANOVA 1.
Advertisements

Design of Experiments and Analysis of Variance
1 1 Slide © 2009, Econ-2030 Applied Statistics-Dr Tadesse Chapter 10: Comparisons Involving Means n Introduction to Analysis of Variance n Analysis of.
ANALYSIS OF VARIANCE.
Analysis of Variance Chapter Introduction Analysis of variance compares two or more populations of interval data. Specifically, we are interested.
Statistics Are Fun! Analysis of Variance
Chapter 3 Analysis of Variance
Lecture 10 Inference about the difference between population proportions (Chapter 13.6) One-way analysis of variance (Chapter 15.2)
Lecture 9: One Way ANOVA Between Subjects
Lecture 12 One-way Analysis of Variance (Chapter 15.2)
1 Chapter 13: Introduction to Analysis of Variance.
Chi-Square Tests and the F-Distribution
Psy B07 Chapter 1Slide 1 ANALYSIS OF VARIANCE. Psy B07 Chapter 1Slide 2 t-test refresher  In chapter 7 we talked about analyses that could be conducted.
The basic idea So far, we have been comparing two samples
1 1 Slide © 2009 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
1 1 Slide © 2015 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
HAWKES LEARNING SYSTEMS math courseware specialists Copyright © 2010 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Chapter 14 Analysis.
QNT 531 Advanced Problems in Statistics and Research Methods
1 1 Slide © 2006 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
1 1 Slide © 2005 Thomson/South-Western Chapter 13, Part A Analysis of Variance and Experimental Design n Introduction to Analysis of Variance n Analysis.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 13 Experimental Design and Analysis of Variance nIntroduction to Experimental Design.
1 1 Slide Analysis of Variance Chapter 13 BA 303.
Analysis of Variance Chapter 12 Introduction Analysis of variance compares two or more populations of interval data. Specifically, we are interested.
Analysis of Variance ( ANOVA )
One-Way Analysis of Variance Comparing means of more than 2 independent samples 1.
12-1 Chapter Twelve McGraw-Hill/Irwin © 2006 The McGraw-Hill Companies, Inc., All Rights Reserved.
Analysis of Variance ST 511 Introduction n Analysis of variance compares two or more populations of quantitative data. n Specifically, we are interested.
Chapter 11 HYPOTHESIS TESTING USING THE ONE-WAY ANALYSIS OF VARIANCE.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Chapter 12: Introduction to Analysis of Variance
© Copyright McGraw-Hill CHAPTER 12 Analysis of Variance (ANOVA)
ANOVA (Analysis of Variance) by Aziza Munir
1 Chapter 13 Analysis of Variance. 2 Chapter Outline  An introduction to experimental design and analysis of variance  Analysis of Variance and the.
Testing Hypotheses about Differences among Several Means.
Copyright © 2010, 2007, 2004 Pearson Education, Inc Chapter 12 Analysis of Variance 12.2 One-Way ANOVA.
Chapter 14 – 1 Chapter 14: Analysis of Variance Understanding Analysis of Variance The Structure of Hypothesis Testing with ANOVA Decomposition of SST.
INTRODUCTION TO ANALYSIS OF VARIANCE (ANOVA). COURSE CONTENT WHAT IS ANOVA DIFFERENT TYPES OF ANOVA ANOVA THEORY WORKED EXAMPLE IN EXCEL –GENERATING THE.
1 Analysis of Variance Chapter 14 2 Introduction Analysis of variance helps compare two or more populations of quantitative data. Specifically, we are.
Chapter 9 Three Tests of Significance Winston Jackson and Norine Verberg Methods: Doing Social Research, 4e.
Learning Objectives Copyright © 2002 South-Western/Thomson Learning Statistical Testing of Differences CHAPTER fifteen.
Chapter 14 – 1 Chapter 14: Analysis of Variance Understanding Analysis of Variance The Structure of Hypothesis Testing with ANOVA Decomposition of SST.
Chapter Seventeen. Figure 17.1 Relationship of Hypothesis Testing Related to Differences to the Previous Chapter and the Marketing Research Process Focus.
Marketing Research Aaker, Kumar, Day Ninth Edition Instructor’s Presentation Slides 1.
Marketing Research Aaker, Kumar, Day Seventh Edition Instructor’s Presentation Slides.
Copyright © Cengage Learning. All rights reserved. 12 Analysis of Variance.
Chapter 12 Introduction to Analysis of Variance PowerPoint Lecture Slides Essentials of Statistics for the Behavioral Sciences Eighth Edition by Frederick.
12-1 Chapter Twelve McGraw-Hill/Irwin © 2006 The McGraw-Hill Companies, Inc., All Rights Reserved.
Business Statistics: A First Course (3rd Edition)
Chapter 13: Inferences about Comparing Two Populations Lecture 8b Date: 15 th November 2015 Instructor: Naveen Abedin.
Econ 3790: Business and Economic Statistics Instructor: Yogesh Uppal
Econ 3790: Business and Economic Statistics Instructor: Yogesh Uppal
The Analysis of Variance ANOVA
1/54 Statistics Analysis of Variance. 2/54 Statistics in practice Introduction to Analysis of Variance Analysis of Variance: Testing for the Equality.
381 Introduction to ANOVA QSCI 381 – Lecture 43 (Larson and Farber, Sect 10.4)
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
CHAPTER 10: ANALYSIS OF VARIANCE(ANOVA) Leon-Guerrero and Frankfort-Nachmias, Essentials of Statistics for a Diverse Society.
 List the characteristics of the F distribution.  Conduct a test of hypothesis to determine whether the variances of two populations are equal.  Discuss.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
The 2 nd to last topic this year!!.  ANOVA Testing is similar to a “two sample t- test except” that it compares more than two samples to one another.
Chapter 13 Analysis of Variance (ANOVA). ANOVA can be used to test for differences between three or more means. The hypotheses for an ANOVA are always:
i) Two way ANOVA without replication
Applied Business Statistics, 7th ed. by Ken Black
Statistics Analysis of Variance.
Post Hoc Tests on One-Way ANOVA
Econ 3790: Business and Economic Statistics
Chapter 14: Analysis of Variance One-way ANOVA Lecture 8
Chapter 11: Introduction to Hypothesis Testing Lecture 5b
Ch10 Analysis of Variance.
Chapter 13 Group Differences
Chapter 10 – Part II Analysis of Variance
Presentation transcript:

Chapter 14: Analysis of Variance One-way ANOVA Lecture 9a Instructor: Naveen Abedin Date: 24 th November 2015

Introduction So far, we have seen how the t-test for “difference in two population means” can be conducted to make inferences about comparing two populations. If we are interested in making statistical inferences about two or more populations, we can use a technique called the Analysis of Variance (ANOVA)

Introduction (cont.) The Analysis of Variance technique determines whether differences exist between population means. The procedure however works by analyzing the sample variances. The procedure was first applied in an experiment to determine whether different treatments of fertilizers produced different crop yields. So this procedure is designed primarily to determine whether there are significant differences between treatment means. Therefore, ANOVA analyzes the variance of sample data to determine whether we can infer that population means differ. When ANOVA is applied to Independent Samples, it is called One-Way Analysis of Variance.

Set up Consider k populations from which samples have been independently collected: Let us call the collection of this population j (j = 1, 2, 3…..k). This makes the collection of population parameters and. For each sample we can then calculate and

Example 1 The following example has been taken from pg 520 of the textbook: Do ownership of stocks vary by age? A financial analyst wanting to test this theory gathered a sample of 366 American households, and recorded the age of the head of the household, and asked them what proportion of its financial assets are invested in stock markets. The analyst then split the record according to four age categories. These age categories are referred to as different treatments.

Example 1: STEP 1 Step 1: Set up the test design Here we are dealing with four populations, so we have four parameters: The null hypothesis represents the scenario where there are no differences between population means: The ANOVA determines whether there is enough statistical evidence to show that the null hypothesis is false, so the alternative hypothesis will always be:

Example 1: STEP 2 Step 2: Calculate the test statistic

Example 1: STEP 2 (cont.) For this study, the different components are: i.Response variable: percentage of assets invested in stocks (i.e. what we are observing) ii.Experimental units: heads of households (the subject interviewed in our samples) iii.Factor: age. There are 4 factor levels (the variable used to disaggregate our data)

Example 1: STEP 2 (Part 1) If the null hypothesis is true, the population means would all be equal, and so we would expect the sample means to be close to one another. If the alternative hypothesis is true, we would expect to observe large differences between sample means. The statistic that measures the proximity (closeness) of the sample means to each other is called the between-treatments variation. It is calculated using SST – Sum of Squares for Treatments.

Example 1: STEP 2 (Part 1 cont.) If the sample means are close to each other, then all of the sample means would be close to the grand mean, and as a result SST would be small. This goes by the intuition that if the sample means were all equal to one another, then SST = 0. A small value of SST would imply that the null hypothesis is probably true.

Example 1: STEP 2 (Part 1 cont.) If large differences exist between the sample means, then at least some sample means differ considerably from the grand mean producing a large SST. And if SST is significantly large, then we can reject the null hypothesis in favor of alternative hypothesis. Thus the question is, what deems SST to be a significantly large figure.

Example 1: STEP 2 (Part 2) So the question we are now facing is how large does SST have to be for us to justify rejecting the null hypothesis? To answer this, we need to know how much variation exists within each sample, i.e. how much variation exists in the percentage of assets owned within each sample. This is measured by the within-treatments variation, denoted by SSE – Sum of Squares for Error. SSE provides a measure of the amount of variation in the response variable that is not caused by the treatments, e.g. other factors besides the age of household head that may influence the amount of stocks purchased, such as income, occupation, family size etc. All of these other sources of variation are lumped together under a single category called error. This source of variation is measured by the Sum of Squares Error.

Example 1: STEP 2 (Part 2 cont.)

Example 1: STEP 2 (Part 3) The next step is to calculate the mean squares: 1.The Mean Square for Treatments is computed by dividing SST by the number of treatments (k) minus 1. 2.The Mean Square for Error is determined by dividing SSE by the total sample size (n) minus the number of treatments (k)

Example 1: STEP 2 (Part 4) Finally, the test statistic can now be calculated: The test statistic follows the F-distribution with v1 = k – 1 and v2 = n – k degrees of freedom, where k is the number of treatments and n is the number of observation in total in the entire study.

Example 1: STEP 3 and 4 Step 3: Calculate the Rejection Region The F-statistic is calculated to determine whether the value of SST is large enough to reject the null hypothesis. SST and F statistic are positively related, so a large enough test statistic will ensure if there is sufficient evidence present to reject the null hypothesis The null hypothesis is rejected if the test statistic, F is higher than F α, k – 1, n – k. Step 4: The Decision: We have sufficient evidence to reject the null hypothesis as F = 2.79

ANOVA table The results of the analysis of variance are usually reported in an analysis of variance (ANOVA) table.

ANOVA table (cont.) For Example 1, the ANOVA table is: SS (Total) = SST + SSE SS (Total) / n – 1 = sample variance

Summary The analysis of variance tests to determine whether there is evidence of differences between two or more population means. (The t-test done in Chapter 13 can only be used to compare two population means only). SST: This measures the variation attributed to the differences between the treatment means (variation of each sample mean to the grand mean). If this is statistically significantly high, then we can reject the null hypothesis. SSE: This measures the variation within samples (differences attributed to other causes besides the factor). It represents the variations that is unexplained by the different treatments.