The Analysis of Variance

Slides:



Advertisements
Similar presentations
Multiple-choice question
Advertisements

Analysis of Variance (ANOVA) ANOVA methods are widely used for comparing 2 or more population means from populations that are approximately normal in distribution.
Hypothesis Testing Steps in Hypothesis Testing:
Inference for Regression
Analysis of Variance (ANOVA) ANOVA can be used to test for the equality of three or more population means We want to use the sample results to test the.
Linear regression models
Design of Experiments and Analysis of Variance
Classical Regression III
1 1 Slide © 2009, Econ-2030 Applied Statistics-Dr Tadesse Chapter 10: Comparisons Involving Means n Introduction to Analysis of Variance n Analysis of.
Independent Sample T-test Formula
The Statistical Analysis Partitions the total variation in the data into components associated with sources of variation –For a Completely Randomized Design.
Analysis of Variance: Inferences about 2 or More Means
Lesson #23 Analysis of Variance. In Analysis of Variance (ANOVA), we have: H 0 :  1 =  2 =  3 = … =  k H 1 : at least one  i does not equal the others.
ANalysis Of VAriance (ANOVA) Comparing > 2 means Frequently applied to experimental data Why not do multiple t-tests? If you want to test H 0 : m 1 = m.
Lesson #32 Simple Linear Regression. Regression is used to model and/or predict a variable; called the dependent variable, Y; based on one or more independent.
PSY 307 – Statistics for the Behavioral Sciences
Analysis of Variance Chapter 3Design & Analysis of Experiments 7E 2009 Montgomery 1.
Regression Analysis Regression analysis is a statistical technique that is very useful for exploring the relationships between two or more variables (one.
QNT 531 Advanced Problems in Statistics and Research Methods
1 1 Slide © 2006 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
1 Tests with two+ groups We have examined tests of means for a single group, and for a difference if we have a matched sample (as in husbands and wives)
1 1 Slide © 2005 Thomson/South-Western Chapter 13, Part A Analysis of Variance and Experimental Design n Introduction to Analysis of Variance n Analysis.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 13 Experimental Design and Analysis of Variance nIntroduction to Experimental Design.
PSY 307 – Statistics for the Behavioral Sciences Chapter 16 – One-Factor Analysis of Variance (ANOVA)
Chapter 10 Analysis of Variance.
Psychology 301 Chapters & Differences Between Two Means Introduction to Analysis of Variance Multiple Comparisons.
1 Chapter 13 Analysis of Variance. 2 Chapter Outline  An introduction to experimental design and analysis of variance  Analysis of Variance and the.
Copyright © 2004 Pearson Education, Inc.
Testing Hypotheses about Differences among Several Means.
Simple Linear Regression ANOVA for regression (10.2)
1 G Lect 11a G Lecture 11a Example: Comparing variances ANOVA table ANOVA linear model ANOVA assumptions Data transformations Effect sizes.
Previous Lecture: Phylogenetics. Analysis of Variance This Lecture Judy Zhong Ph.D.
STA 286 week 131 Inference for the Regression Coefficient Recall, b 0 and b 1 are the estimates of the slope β 1 and intercept β 0 of population regression.
VI. Regression Analysis A. Simple Linear Regression 1. Scatter Plots Regression analysis is best taught via an example. Pencil lead is a ceramic material.
1 ANALYSIS OF VARIANCE (ANOVA) Heibatollah Baghi, and Mastee Badii.
Copyright © Cengage Learning. All rights reserved. 12 Analysis of Variance.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics S eventh Edition By Brase and Brase Prepared by: Lynn Smith.
Econ 3790: Business and Economic Statistics Instructor: Yogesh Uppal
IE241: Introduction to Design of Experiments. Last term we talked about testing the difference between two independent means. For means from a normal.
1 1 Slide The Simple Linear Regression Model n Simple Linear Regression Model y =  0 +  1 x +  n Simple Linear Regression Equation E( y ) =  0 + 
Formula for Linear Regression y = bx + a Y variable plotted on vertical axis. X variable plotted on horizontal axis. Slope or the change in y for every.
1/54 Statistics Analysis of Variance. 2/54 Statistics in practice Introduction to Analysis of Variance Analysis of Variance: Testing for the Equality.
Copyright © 2008 by Nelson, a division of Thomson Canada Limited Chapter 18 Part 5 Analysis and Interpretation of Data DIFFERENCES BETWEEN GROUPS AND RELATIONSHIPS.
 List the characteristics of the F distribution.  Conduct a test of hypothesis to determine whether the variances of two populations are equal.  Discuss.
1 Chapter 5.8 What if We Have More Than Two Samples?
The “Big Picture” (from Heath 1995). Simple Linear Regression.
Chapter 13 Analysis of Variance (ANOVA). ANOVA can be used to test for differences between three or more means. The hypotheses for an ANOVA are always:
Copyright © 2008 by Hawkes Learning Systems/Quant Systems, Inc.
Chapter 10 Two-Sample Tests and One-Way ANOVA.
Week 2 – PART III POST-HOC TESTS.
Factorial Experiments
ANOVA Econ201 HSTS212.
CHAPTER 13 Design and Analysis of Single-Factor Experiments:
Multiple comparisons
i) Two way ANOVA without replication
Applied Business Statistics, 7th ed. by Ken Black
Statistical Data Analysis - Lecture10 26/03/03
Statistics Analysis of Variance.
Chapter 10: Analysis of Variance: Comparing More Than Two Means
Chapter 10 Two-Sample Tests and One-Way ANOVA.
Econ 3790: Business and Economic Statistics
CHAPTER 29: Multiple Regression*
Chapter 11 Analysis of Variance
Chapter 11: The ANalysis Of Variance (ANOVA)
One way ANALYSIS OF VARIANCE (ANOVA)
Chapter 15 Analysis of Variance
Chapter 10 – Part II Analysis of Variance
ANOVA: Analysis of Variance
Week ANOVA Four.
MGS 3100 Business Analysis Regression Feb 18, 2016
Presentation transcript:

The Analysis of Variance

The Analysis of Variance (ANOVA) Fisher’s technique for partitioning the sum of squares More generally, ANOVA refers to a class of sampling or experimental designs with a continuous response variable and categorical predictor(s) Ronald Aylmer Fisher (1890-1962)

Goal The comparison of means among 2 or more groups that have been sampled randomly Both regression and ANOVA are special cases of a more generalized linear model

ANOVA & Partitioning the Sum of Squares Remember: total variation is the sum of the difference between each observation and the overall sample mean Using ANOVA, we can partition the sum of squares among the different components in the model (the treatments, the error term, etc.) Finally, we can use the results to test statistical hypotheses about the strength of particular effects

Symbols Y= measured response variable = grand mean (for all observations) = mean that is calculated for a particular subgroup (i) = a particular datum (the jth observation of the ith subgroup)

EXAMPLE: Effects of early snowmelt on alpine plant growth Three treatment groups (a = 3) and four replicate plots per treatment (n = 4): Unmanipulated Control: fitted with heating coils that are never activated Treatment: warmed with permanent solar-powered heating coils that melt spring snow pack earlier in the year than normal

Effects of early snowmelt on alpine plant growth After 3 years of treatment application, you measure the length of the flowering period, in weeks, for larkspur (Delphinium nuttallianum) in each plot

Data Unmanipulated Control Treatment 10 9 12 11 13 15 16

Partitioning of the sum of squares in a one-way ANOVA 9

SStotal= SSag + SSwg 41.66 = 22.16 + 19.50

The Assumptions of ANOVA The samples are randomly selected and independent of each other The variance within each group is approximately equal to the variance within all the other groups The residuals are normally distributed The samples are classified correctly The main effects are additive

Hypothesis tests with ANOVA If the assumptions are met (or not severely violated), we can test hypotheses based on an underlying model that is fit to the data. For the one way ANOVA, that model is:

The null hypothesis is If the null hypothesis is true, any variation that occurs among the treatment groups reflects random error and nothing else.

ANOVA table for one-way layout Source df Sum of squares Mean square Expected mean square F-ratio Among groups a-1 Within groups a(n-1) Total an-1 P-value = tail probability from an F-distribution with (a-1) and a(n-1) degrees of freedom

Partitioning of the sum of squares in a one-way ANOVA 15

ANOVA table for larkspur data Source df Sum of squares Mean square F-ratio P-value Among groups 2 22.16 11.08 5.11 0.033 Within groups 9 19.50 2.17 Total 11 41.67

Constructing F-ratios Use the mean squares associated with the particular ANOVA model that matches your sampling or experimental design. Find the expected mean square that includes the particular effect you are trying to measure and use it as the numerator of the F-ratio.

Constructing F-ratios (cont.’d) Find a second expected mean square that includes all of the statistical terms in the numerator except for the single term you are trying to estimate and use it as the denominator of the F-ratio. Divide the numerator by the denominator to get your F-ratio.

Constructing F-ratios (cont.’d) Using statistical tables or the output from statistical software, determine the P-value associated with the F-ratio. WARNING: The default settings used by many software packages will not generate the correct F-ratios for many common experimental designs. Repeat steps 2 through 5 for other factors that you are testing.

ANOVA as linear regression treatment data X1 X2 unmanipulated 10 12 13 control 9 1 11 Treatment 15 16

EXAMPLE X1 X2 Expected Unmanipulated 11.75 Control 1 10.75 Treatment 14.0 Coefficients Value Unmanipulated Intercept 11.75 Control -1 Treatment 2.25

Regression Source of variation SS df MS Regression p-1 Residual n-p Total n-1

ANOVA table Source df Sum of squares Mean square F-ratio P-value Regression 2 22.16 11.08 5.11 0.033 Residual 9 19.50 2.17 Total 11 41.67 23