Multivariate Analysis of Variance, Part 1 BMTRY 726.

Slides:



Advertisements
Similar presentations
Agenda of Week V Review of Week IV Inference on MV Mean Vector One population Two populations Multi-populations: MANOVA.
Advertisements

A. The Basic Principle We consider the multivariate extension of multiple linear regression – modeling the relationship between m responses Y 1,…,Y m and.
BPS - 5th Ed. Chapter 241 One-Way Analysis of Variance: Comparing Several Means.
Chapter 4 Randomized Blocks, Latin Squares, and Related Designs
Multivariate Analysis of Variance, Part 2 BMTRY 726 2/21/14.
Sampling: Final and Initial Sample Size Determination
Multiple regression analysis
Analysis of variance (ANOVA)-the General Linear Model (GLM)
Chapter Seventeen HYPOTHESIS TESTING
Part I – MULTIVARIATE ANALYSIS
ANOVA Determining Which Means Differ in Single Factor Models Determining Which Means Differ in Single Factor Models.
Lesson #23 Analysis of Variance. In Analysis of Variance (ANOVA), we have: H 0 :  1 =  2 =  3 = … =  k H 1 : at least one  i does not equal the others.
ANalysis Of VAriance (ANOVA) Comparing > 2 means Frequently applied to experimental data Why not do multiple t-tests? If you want to test H 0 : m 1 = m.
Intro to Statistics for the Behavioral Sciences PSYC 1900
Chapter 2 Simple Comparative Experiments
Chapter 26: Comparing Counts. To analyze categorical data, we construct two-way tables and examine the counts of percents of the explanatory and response.
Copyright © 2014 by McGraw-Hill Higher Education. All rights reserved.
Inferences About Process Quality
1 Inference About a Population Variance Sometimes we are interested in making inference about the variability of processes. Examples: –Investors use variance.
Analysis of Variance & Multivariate Analysis of Variance
5-3 Inference on the Means of Two Populations, Variances Unknown
6.1 - One Sample One Sample  Mean μ, Variance σ 2, Proportion π Two Samples Two Samples  Means, Variances, Proportions μ 1 vs. μ 2.
One-Factor Experiments Andy Wang CIS 5930 Computer Systems Performance Analysis.
One-Way Manova For an expository presentation of multivariate analysis of variance (MANOVA). See the following paper, which addresses several questions:
Multivariate Analysis of Variance (MANOVA). Outline Purpose and logic : page 3 Purpose and logic : page 3 Hypothesis testing : page 6 Hypothesis testing.
1 Tests with two+ groups We have examined tests of means for a single group, and for a difference if we have a matched sample (as in husbands and wives)
PROBABILITY & STATISTICAL INFERENCE LECTURE 6 MSc in Computing (Data Analytics)
Some matrix stuff.
Chapter 10 Comparing Two Means Target Goal: I can use two-sample t procedures to compare two means. 10.2a h.w: pg. 626: 29 – 32, pg. 652: 35, 37, 57.
MARE 250 Dr. Jason Turner Multiway, Multivariate, Covariate, ANOVA.
Chapter 9: Non-parametric Tests n Parametric vs Non-parametric n Chi-Square –1 way –2 way.
Testing Multiple Means and the Analysis of Variance (§8.1, 8.2, 8.6) Situations where comparing more than two means is important. The approach to testing.
Maximum Likelihood Estimator of Proportion Let {s 1,s 2,…,s n } be a set of independent outcomes from a Bernoulli experiment with unknown probability.
Ch9. Inferences Concerning Proportions. Outline Estimation of Proportions Hypothesis concerning one Proportion Hypothesis concerning several proportions.
MANOVA Multivariate Analysis of Variance. One way Analysis of Variance (ANOVA) Comparing k Populations.
11 Comparison of Several Multivariate Means Shyh-Kang Jeng Department of Electrical Engineering/ Graduate Institute of Communication/ Graduate Institute.
MANOVA Mechanics. MANOVA is a multivariate generalization of ANOVA, so there are analogous parts to the simpler ANOVA equations First lets revisit Anova.
Chapter 10: Analyzing Experimental Data Inferential statistics are used to determine whether the independent variable had an effect on the dependent variance.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y is.
Education 793 Class Notes Presentation 10 Chi-Square Tests and One-Way ANOVA.
The Completely Randomized Design (§8.3)
Statistical Analysis of Data1 of 38 1 of 42 Department of Cognitive Science Adv. Experimental Methods & Statistics PSYC 4310 / COGS 6310 MANOVA Multivariate.
Educational Research Chapter 13 Inferential Statistics Gay, Mills, and Airasian 10 th Edition.
Data Analysis for Two-Way Tables. The Basics Two-way table of counts Organizes data about 2 categorical variables Row variables run across the table Column.
General Linear Model 2 Intro to ANOVA.
Analysis of Variance (One Factor). ANOVA Analysis of Variance Tests whether differences exist among population means categorized by only one factor or.
CHI SQUARE TESTS.
Regression Analysis Part C Confidence Intervals and Hypothesis Testing
Principles of Biostatistics ANOVA. DietWeight Gain (grams) Standard910 8 Junk Food Organic Table shows weight gains for mice on 3 diets.
Bootstrap Event Study Tests Peter Westfall ISQS Dept. Joint work with Scott Hein, Finance.
Multivariate Analysis of Variance
Experimental Statistics - week 3
Midterm. T/F (a) False—step function (b) False, F n (x)~Bin(n,F(x)) so Inverting and estimating the standard error we see that a factor of n -1/2 is missing.
Formula for Linear Regression y = bx + a Y variable plotted on vertical axis. X variable plotted on horizontal axis. Slope or the change in y for every.
Differences Among Groups
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Chapter 11: Categorical Data n Chi-square goodness of fit test allows us to examine a single distribution of a categorical variable in a population. n.
Class Seven Turn In: Chapter 18: 32, 34, 36 Chapter 19: 26, 34, 44 Quiz 3 For Class Eight: Chapter 20: 18, 20, 24 Chapter 22: 34, 36 Read Chapters 23 &
Chi Square Test of Homogeneity. Are the different types of M&M’s distributed the same across the different colors? PlainPeanutPeanut Butter Crispy Brown7447.
Differences Among Group Means: Multifactorial Analysis of Variance
Statistical Quality Control, 7th Edition by Douglas C. Montgomery.
Chapter 4. Inference about Process Quality
Data Analysis for Two-Way Tables
Inference About 2 or More Normal Populations, Part 1
Multivariate Analysis of Variance II
Multivariate Linear Regression
Chapter 24 Comparing Two Means.
Statistical Inference for the Mean: t-test
Presentation transcript:

Multivariate Analysis of Variance, Part 1 BMTRY 726

>2 Independent Samples What happens if we have more than two independent populations we want to compare?

Review ANOVA Given a random sample We rewrite  l as the overall mean plus the treatment effect of treatment l Thus we end up testing Using the model Grand mean ErrorTreatment effect Sample mean Est. treatment effect Residual

What we want to answer… is the contribution of the treatment large relative to our residuals? In ANOVA we do this using the sums of squares:

What we want to answer… is the contribution of the treatment large relative to our residuals? In ANOVA we do this using the sums of squares:

>2 Samples for MVN We can use one-way MANOVA to compare mean vectors for our g populations. Assumptions: (1) Independent random samples from g groups (2) Homogeneity of covariance matrices (3) Normality Summary Statistics:

One-way MANOVA Pooled estimate of the common covariance matrix: One-way MANOVA model:

One-way MANOVA Compute summary statistics for each sample SAS constraint: Vector of obs for j th unit Selected from l th population Mean vector for the l th population  l =  +  l Vector of random errors that are ~NID (0,  )

One-way MANOVA We can write the multivariate version of the sums of squares now… So if we want to test if all treatment effect are equal: Total SS and cross products Within (resid) SS and cross products Between (trt) SS and cross products

One-way MANOVA We reject out null hypothesis of Wilk’s lambda is too small. Under certain conditions, the exact distribution of lambda is known: -see table 6.3 on page 303 For other cases and when n is large, we can use the Bartlett modification

Distribution of  * Sampling distribution for MVN data No. of No. of Sampling distribution for MVN data variables groups

Example: Infant Right Ventricular Failure An investigator examines the at the effect of giving H 2 S to rabbits with banded right ventricles on heart tissue “floppiness” and amount of oxidized cystine residues in mitochondria collected from heart tissue. There are three groups of animals (1) Banded and untreated (2) Banded but dosed with H 2 S (3) Banded and over-express CES to produce H 2 S There are 3 rabbits in each group.

Example

Confidence Intervals If we reject the null hypothesis, we want to determine which treatments are different. In this case, the Bonferroni approach applies We use this to develop an expression for the CI difference between treatment k and l

Example Estimate the 95% CIs for each treatment difference

Example Estimate the 95% CIs for each treatment difference

MANOVA as Linear Model If we think about is as a linear model…

Model Features (1)Each row of the error matrix (or each row of X ) -is independent of any other row -has covariance matrix  (2)Each column of X -corresponds to a different trait -has a model for the mean of the same form -has a corresponding column of parameters in the design matrix

MANOVA MLE’s Maximum likelihood estimates of parameters in  : For the PROC GLM version of the one-way MANOVA

The j th column of contains parameter estimates for the model fit to the data for the j th trait:

Kronecker product:

Inference We can make certain inferences about the elements of our parameter matrix Generalized LRT procedure (Wilks, 1932) Make comparison across groups (rows of  ) Make comparisons across traits Matrix of zeros

Compute the corrected sums of squares and cross- products matrix for the model: (1)This reduces to: (2)The diagonal elements of H are model sums of squares when M = I :

Compute a matrix of residual sums of squares and cross-products: When M = I : Diagonal elements of E are sums of the squared residuals Off-diagonal elements are sums of cross-products of residuals

Likelihood Ratio Test Reject the null hypothesis if Wilk’s criterion is too small Large sample chi-square approximation, reject H 0 if

A more accurate approximation given by Rao (C.R. Rao (1951) Bull Int Stat Inst. 33(2), ) When is true: where

Exact distribution of Wilk’s criterion: (1) Independence (2) Homogeneity of covariance matrices (3) Multivariate normality When is true: When either These are the cases in Table 6.3 on page 303 In J & W

Other Criteria Wilk’s Lambda: Lawely-Hotelling Trace Pillai Trace Roy’s Maximum Root:

Example Recall our example right ventricular failure example…

Example Our estimate of  … Say we want to test the hypothesis of equal treatment effects:

Example We have to find E and H …

Example We have to find E and H …

Example We can use E and H to find lambda… Does this match what we calculated using our MANOVA table? NOTE: If p and g match conditions in table 6.2, we can compare this to the F distribution. Otherwise we can use the  2 approximation if n is large.