Université d’Ottawa / University of Ottawa 2001 Bio 8100s Allied Multivariate Biostatistics L6.1 Lecture 6: Single-classification multivariate ANOVA (k-group.

Slides:



Advertisements
Similar presentations
Week 2 – PART III POST-HOC TESTS. POST HOC TESTS When we get a significant F test result in an ANOVA test for a main effect of a factor with more than.
Advertisements

A. The Basic Principle We consider the multivariate extension of multiple linear regression – modeling the relationship between m responses Y 1,…,Y m and.
Lecture 3: A brief background to multivariate statistics
Analysis of Variance (ANOVA) ANOVA methods are widely used for comparing 2 or more population means from populations that are approximately normal in distribution.
Hypothesis Testing Steps in Hypothesis Testing:
Generalized Linear Models (GLM)
Lecture 7: Principal component analysis (PCA)
Université d’Ottawa / University of Ottawa 2001 Bio 4118 Applied Biostatistics L10.1 CorrelationCorrelation The underlying principle of correlation analysis.
ANOVA notes NR 245 Austin Troy
Lecture 10 PY 427 Statistics 1 Fall 2006 Kin Ching Kong, Ph.D
ANOVA: ANalysis Of VAriance. In the general linear model x = μ + σ 2 (Age) + σ 2 (Genotype) + σ 2 (Measurement) + σ 2 (Condition) + σ 2 (ε) Each of the.
Part I – MULTIVARIATE ANALYSIS
MARE 250 Dr. Jason Turner Hypothesis Testing II. To ASSUME is to make an… Four assumptions for t-test hypothesis testing:
Analysis of Variance: Inferences about 2 or More Means
ANalysis Of VAriance (ANOVA) Comparing > 2 means Frequently applied to experimental data Why not do multiple t-tests? If you want to test H 0 : m 1 = m.
Chapter Topics The Completely Randomized Model: One-Factor Analysis of Variance F-Test for Difference in c Means The Tukey-Kramer Procedure ANOVA Assumptions.
Chapter 3 Analysis of Variance
Chapter 3 Experiments with a Single Factor: The Analysis of Variance
Final Review Session.
Chapter 14 Conducting & Reading Research Baumgartner et al Chapter 14 Inferential Data Analysis.
Lecture 9: One Way ANOVA Between Subjects
One-way Between Groups Analysis of Variance
Lecture 12 One-way Analysis of Variance (Chapter 15.2)
Analysis of Variance & Multivariate Analysis of Variance
Chapter 9 - Lecture 2 Computing the analysis of variance for simple experiments (single factor, unrelated groups experiments).
Statistical Methods in Computer Science Hypothesis Testing II: Single-Factor Experiments Ido Dagan.
Chapter 14 Inferential Data Analysis
Chi-Square Tests and the F-Distribution
Multivariate Analysis of Variance, Part 1 BMTRY 726.
Analysis of Variance. ANOVA Probably the most popular analysis in psychology Why? Ease of implementation Allows for analysis of several groups at once.
PS 225 Lecture 15 Analysis of Variance ANOVA Tables.
ANCOVA Lecture 9 Andrew Ainsworth. What is ANCOVA?
University of Ottawa - Bio 4118 – Applied Biostatistics © Antoine Morin and Scott Findlay 21/09/2015 7:46 PM 1 Two-sample comparisons Underlying principles.
Which Test Do I Use? Statistics for Two Group Experiments The Chi Square Test The t Test Analyzing Multiple Groups and Factorial Experiments Analysis of.
© Copyright McGraw-Hill CHAPTER 12 Analysis of Variance (ANOVA)
Correlation and Regression Used when we are interested in the relationship between two variables. NOT the differences between means or medians of different.
Psychology 301 Chapters & Differences Between Two Means Introduction to Analysis of Variance Multiple Comparisons.
Between-Groups ANOVA Chapter 12. >When to use an F distribution Working with more than two samples >ANOVA Used with two or more nominal independent variables.
Chapter 4 analysis of variance (ANOVA). Section 1 the basic idea and condition of application.
Copyright © 2004 Pearson Education, Inc.
University of Ottawa - Bio 4118 – Applied Biostatistics © Antoine Morin and Scott Findlay 23/10/2015 9:22 PM 1 Two-sample comparisons Underlying principles.
Statistical Analysis of Data1 of 38 1 of 42 Department of Cognitive Science Adv. Experimental Methods & Statistics PSYC 4310 / COGS 6310 MANOVA Multivariate.
Université d’Ottawa / University of Ottawa 2001 Bio 4118 Applied Biostatistics L9.1 Lecture 9: Multiple-classification ANOVA What is it, and why use it?
Educational Research Chapter 13 Inferential Statistics Gay, Mills, and Airasian 10 th Edition.
One-way ANOVA Motivating Example Analysis of Variance Model & Assumptions Data Estimates of the Model Analysis of Variance Multiple Comparisons Checking.
ANOVA: Analysis of Variance.
Adjusted from slides attributed to Andrew Ainsworth
Previous Lecture: Phylogenetics. Analysis of Variance This Lecture Judy Zhong Ph.D.
Chapter Seventeen. Figure 17.1 Relationship of Hypothesis Testing Related to Differences to the Previous Chapter and the Marketing Research Process Focus.
Copyright © Cengage Learning. All rights reserved. 12 Analysis of Variance.
Université d’Ottawa / University of Ottawa 2001 Bio 8100s Applied Multivariate Biostatistics L10.1 Lecture 10: Cluster analysis l Uses of cluster analysis.
DTC Quantitative Methods Bivariate Analysis: t-tests and Analysis of Variance (ANOVA) Thursday 14 th February 2013.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved Lecture Slides Elementary Statistics Eleventh Edition and the Triola.
Introduction to ANOVA Research Designs for ANOVAs Type I Error and Multiple Hypothesis Tests The Logic of ANOVA ANOVA vocabulary, notation, and formulas.
University of Ottawa - Bio 4158 – Applied Biostatistics © Antoine Morin and Scott Findlay 20/02/ :23 PM 1 Multiple comparisons What are multiple.
Université d’Ottawa / University of Ottawa 2001 Bio 4118 Applied Biostatistics L14.1 Lecture 14: Contingency tables and log-linear models Appropriate questions.
Université d’Ottawa / University of Ottawa 2001 Bio 8100s Applied Multivariate Biostatistics L11.1 Lecture 11: Canonical correlation analysis (CANCOR)
MANOVA Lecture 12 Nuance stuff Psy 524 Andrew Ainsworth.
Université d’Ottawa / University of Ottawa 1999 Bio 8100s Multivariate biostatistics L5.1 Two sample comparisons l Univariate 2-sample comparisons l The.
Differences Among Groups
©2013, The McGraw-Hill Companies, Inc. All Rights Reserved Chapter 4 Investigating the Difference in Scores.
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the.
Chapter 11: Categorical Data n Chi-square goodness of fit test allows us to examine a single distribution of a categorical variable in a population. n.
1 Chapter 5.8 What if We Have More Than Two Samples?
Université d’Ottawa / University of Ottawa 2001 Bio 4118 Applied Biostatistics L12.1 Lecture 12: Generalized Linear Models (GLM) What are they? When do.
University of Ottawa - Bio 4158 – Applied Biostatistics © Antoine Morin and Scott Findlay 06/07/2016 6:16 AM 1 Single classification analysis of variance.
Université d’Ottawa / University of Ottawa 2003 Bio 8102A Applied Multivariate Biostatistics L4.1 Lecture 4: Multivariate distance measures l The concept.
Comparing Several Means: ANOVA
Elements of a statistical test Statistical null hypotheses
Lecture 7: Single classification analysis of variance (ANOVA)
Presentation transcript:

Université d’Ottawa / University of Ottawa 2001 Bio 8100s Allied Multivariate Biostatistics L6.1 Lecture 6: Single-classification multivariate ANOVA (k-group MANOVA) l Rationale and underlying principles l Univariate ANOVA l Multivariate ANOVA (MANOVA): principles and procedures l Rationale and underlying principles l Univariate ANOVA l Multivariate ANOVA (MANOVA): principles and procedures l MANOVA test statistics l MANOVA assumptions l Planned and unplanned comparisons

Université d’Ottawa / University of Ottawa 2001 Bio 8100s Allied Multivariate Biostatistics L6.2 When to use ANOVA l Tests for effect of “discrete” independent variables. l Each independent variable is called a factor, and each factor may have two or more levels or treatments (e.g. crop yields with nitrogen (N) or nitrogen and phosphorous (N + P) added). l ANOVA tests whether all group means are the same. l Use when number of levels (groups) is greater than two. l Tests for effect of “discrete” independent variables. l Each independent variable is called a factor, and each factor may have two or more levels or treatments (e.g. crop yields with nitrogen (N) or nitrogen and phosphorous (N + P) added). l ANOVA tests whether all group means are the same. l Use when number of levels (groups) is greater than two. Control Experimental (N) Experimental (N+P) Frequency Yield CC NN  N+P

Université d’Ottawa / University of Ottawa 2001 Bio 8100s Allied Multivariate Biostatistics L6.3 Why not use multiple 2-sample tests? For k comparisons, the probability of accepting a true H 0 for all k is (1 -  ) k. For 4 means, (1 -  ) k = (0.95) 6 =.735. So  (for all comparisons) = l So, when comparing the means of four samples from the same population, we would expect to detect significant differences among at least one pair 27% of the time. For k comparisons, the probability of accepting a true H 0 for all k is (1 -  ) k. For 4 means, (1 -  ) k = (0.95) 6 =.735. So  (for all comparisons) = l So, when comparing the means of four samples from the same population, we would expect to detect significant differences among at least one pair 27% of the time. Yield CC NN  N+P Control Experimental (N) Experimental (N+P) c:Nc:N  N :  N+P  C :  N+P Frequency

Université d’Ottawa / University of Ottawa 2001 Bio 8100s Allied Multivariate Biostatistics L6.4 What ANOVA does/doesn’t do Tells us whether all group means are equal (at a specified  level)... l...but if we reject H 0, the ANOVA does not tell us which pairs of means are different from one another. Tells us whether all group means are equal (at a specified  level)... l...but if we reject H 0, the ANOVA does not tell us which pairs of means are different from one another. Control Experimental (N) Experimental (N+ P) Yield Frequency CC NN  N+P Frequency CC NN  N+P

Université d’Ottawa / University of Ottawa 2001 Bio 8100s Allied Multivariate Biostatistics L6.5 Model I ANOVA: effects of temperature on trout growth l 3 treatments determined (set) by investigator. Dependent variable is growth rate ( ), factor (X) is temperature. Since X is controlled, we can estimate the effect of a unit increase in X (temperature) on  the effect size ... … and can predict  at other temperatures. l 3 treatments determined (set) by investigator. Dependent variable is growth rate ( ), factor (X) is temperature. Since X is controlled, we can estimate the effect of a unit increase in X (temperature) on  the effect size ... … and can predict  at other temperatures. Water temperature (°C) Growth rate (cm/day)

Université d’Ottawa / University of Ottawa 2001 Bio 8100s Allied Multivariate Biostatistics L6.6 Model II ANOVA: geographical variation in body size of black bears l 3 locations (groups) sampled from set of possible locations. l Dependent variable is body size, factor (X) is location. Even if locations differ, we have no idea what factors are controlling this variability... …so we cannot predict  body size  at other locations. l 3 locations (groups) sampled from set of possible locations. l Dependent variable is body size, factor (X) is location. Even if locations differ, we have no idea what factors are controlling this variability... …so we cannot predict  body size  at other locations. Body size (kg) Riding Mountain Kluane Algonquin

Université d’Ottawa / University of Ottawa 2001 Bio 8100s Allied Multivariate Biostatistics L6.7 Model differences l In Model I, the putative causal factor(s) can be manipulated by the experimenter, whereas in Model II they cannot. l In Model I, we can estimate the magnitude of treatment effects and make predictions, whereas in Model II we can do neither. l In one-way (single classification) ANOVA, calculations are identical for both models… l …but this is NOT so for multiple classification ANOVA! l In Model I, the putative causal factor(s) can be manipulated by the experimenter, whereas in Model II they cannot. l In Model I, we can estimate the magnitude of treatment effects and make predictions, whereas in Model II we can do neither. l In one-way (single classification) ANOVA, calculations are identical for both models… l …but this is NOT so for multiple classification ANOVA!

Université d’Ottawa / University of Ottawa 2001 Bio 8100s Allied Multivariate Biostatistics L6.8 How is it done? And why call it ANOVA? l In ANOVA, the total variance in the dependent variable is partitioned into two components: n among-groups: variance of means of different groups (treatments) n within-groups (error): variance of individual observations within groups around the mean of the group l In ANOVA, the total variance in the dependent variable is partitioned into two components: n among-groups: variance of means of different groups (treatments) n within-groups (error): variance of individual observations within groups around the mean of the group

Université d’Ottawa / University of Ottawa 2001 Bio 8100s Allied Multivariate Biostatistics L6.9 The general ANOVA model l The general model is: ANOVA algorithms fit the above model (by least squares) to estimate the  i ’s. H 0 : all  i ’s = 0 l The general model is: ANOVA algorithms fit the above model (by least squares) to estimate the  i ’s. H 0 : all  i ’s = 0 Group 1 Group 2 Group 3 Group Y 22 22   42 Y              

Université d’Ottawa / University of Ottawa 2001 Bio 8100s Allied Multivariate Biostatistics L6.10 Partitioning the total sums of squares Group 1 Group 2 Group 3 Y  Total SSModel (Groups) SSError SS   

Université d’Ottawa / University of Ottawa 2001 Bio 8100s Allied Multivariate Biostatistics L6.11 The ANOVA table Source of Variation Sum of Squares Mean Square Degrees of freedom (df) F Total Error n - 1 n - k SS/df Groupsk - 1SS/df MS groups MS error i1 k ij j1 n 2 ( Y Y ) i    i i i k n Y Y()    1 2 i1 k i j1 n 2 ( Y YiYi ) i    j

Université d’Ottawa / University of Ottawa 2001 Bio 8100s Allied Multivariate Biostatistics L6.12 Use of single-classification MANOVA l Data set consists of k groups (“treatments”), with n i observations per group, and p variables per observation. l Question: do the groups differ with respect to their multivariate means? l Data set consists of k groups (“treatments”), with n i observations per group, and p variables per observation. l Question: do the groups differ with respect to their multivariate means? l In single-classification ANOVA, we assume that a single factor is variable among groups, i.e., that all other factors which may possible affect the variables in question are randomized among groups.

Université d’Ottawa / University of Ottawa 2001 Bio 8100s Allied Multivariate Biostatistics L6.13 ExamplesExamples l 4 different concentrations of some suspected contaminant; 10 young fish randomly assigned to each treatment; at age 2 months, a number of measurements taken on each surviving fish. l 10 young fish reared in 4 different “treatments”, each treatment consisting of water samples taken at different stages of treatment in a water treatment plant. Good(ish) Bad(ish)

Université d’Ottawa / University of Ottawa 2001 Bio 8100s Allied Multivariate Biostatistics L6.14 Multivariate variance: a geometric interpretation l Univariate variance is a measure of the “volume” occupied by sample points in one dimension. l Multivariate variance involving m variables is the volume occupied by sample points in an m -dimensional space. l Univariate variance is a measure of the “volume” occupied by sample points in one dimension. l Multivariate variance involving m variables is the volume occupied by sample points in an m -dimensional space. X X Larger variance Smaller variance X1X1 X2X2 Occupied volume

Université d’Ottawa / University of Ottawa 2001 Bio 8100s Allied Multivariate Biostatistics L6.15 Multivariate variance: effects of correlations among variables l Correlations between pairs of variables reduce the volume occupied by sample points… l …and hence, reduce the multivariate variance. l Correlations between pairs of variables reduce the volume occupied by sample points… l …and hence, reduce the multivariate variance. No correlation X1X1 X2X2 X2X2 X1X1 Positive correlation Negative correlation Occupied volume

Université d’Ottawa / University of Ottawa 2001 Bio 8100s Allied Multivariate Biostatistics L6.16 C and the generalized multivariate variance l The determinant of the sample covariance matrix C is a generalized multivariate variance… l … because area 2 of a parallelogram with sides given by the individual standard deviations and angle determined by the correlation between variables equals the determinant of C. l The determinant of the sample covariance matrix C is a generalized multivariate variance… l … because area 2 of a parallelogram with sides given by the individual standard deviations and angle determined by the correlation between variables equals the determinant of C.

Université d’Ottawa / University of Ottawa 2001 Bio 8100s Allied Multivariate Biostatistics L6.17 ANOVA vs MANOVA: procedure l In ANOVA, the total sums of squares is partitioned into a within-groups (SS w ) and between-group SS b sums of squares: l In MANOVA, the total sums of squares and cross-products (SSCP) matrix is partitioned into a within groups SSCP (W) and a between-groups SSCP (B)

Université d’Ottawa / University of Ottawa 2001 Bio 8100s Allied Multivariate Biostatistics L6.18 l In ANOVA, the null hypothesis is: l This is tested by means of the F statistic: l In ANOVA, the null hypothesis is: l This is tested by means of the F statistic: ANOVA vs MANOVA: hypothesis testing l In MANOVA, the null hypothesis is l This is tested by (among other things) Wilk’s lambda:

Université d’Ottawa / University of Ottawa 2001 Bio 8100s Allied Multivariate Biostatistics L6.19 SSCP matrices: within, between, and total l The total (T) SSCP matrix (based on p variables X 1, X 2,…, X p ) in a sample of objects belonging to m groups G 1, G 2,…, G m with sizes n 1, n 2,…, n m can be partitioned into within- groups (W) and between- groups (B) SSCP matrices: Value of variable X k for ith observation in group j Mean of variable X k for group j Overall mean of variable X k Element in row r and column c of total (T, t) and within (W, w) SSCP

Université d’Ottawa / University of Ottawa 2001 Bio 8100s Allied Multivariate Biostatistics L6.20 The distribution of  Unlike F,  has a very complicated distribution… …but, given certain assumptions it can be approximated b as Bartlett’s  2 (for moderate to large samples) or Rao’s F (for small samples) Unlike F,  has a very complicated distribution… …but, given certain assumptions it can be approximated b as Bartlett’s  2 (for moderate to large samples) or Rao’s F (for small samples)

Université d’Ottawa / University of Ottawa 2001 Bio 8100s Allied Multivariate Biostatistics L6.21 AssumptionsAssumptions l All observations are independent (residuals are uncorrelated) l Within each sample (group), variables (residuals) are multivariate normally distributed l Each sample (group) has the same covariance matrix (compound symmetry) l All observations are independent (residuals are uncorrelated) l Within each sample (group), variables (residuals) are multivariate normally distributed l Each sample (group) has the same covariance matrix (compound symmetry)

Université d’Ottawa / University of Ottawa 2001 Bio 8100s Allied Multivariate Biostatistics L6.22 Effect of violation of assumptions Assumption Effect on  Effect on power Independence of observations Very large, actual  much larger than nominal  Large, power much reduced NormalitySmall to negligible Reduced power for platykurtotic distributions, skewness has little effect Equality of covariance matrices Small to negligible if group Ns similar, if Ns very unequal, actual  larger than nominal  Power reduced, reduction greater for unequal Ns.

Université d’Ottawa / University of Ottawa 2001 Bio 8100s Allied Multivariate Biostatistics L6.23 Checking assumptions in MANOVA Independence (intraclass correlation, ACF) Use group means as unit of analysis Assess MV normality Check group sizes MVN graph test Check Univariate normality No Yes N i > 20 N i < 20

Université d’Ottawa / University of Ottawa 2001 Bio 8100s Allied Multivariate Biostatistics L6.24 Checking assumptions in MANOVA (cont’d) MV normal? Check homogeneity of covariance matrices Most variables normal? Transform offending variables Group sizes more or less equal (R < 1.5)? Groups reasonably large (> 15)? Yes No Yes No END Yes No Transform variables, or adjust 

Université d’Ottawa / University of Ottawa 2001 Bio 8100s Allied Multivariate Biostatistics L6.25 Then what? QuestionProcedure What variables are responsible for detected differences among groups? Check univariate F tests as a guide; use another multivariate procedure (e.g. discriminant function analysis) Do certain groups (determined beforehand) differ from one another? Planned multiple comparisons Which pairs of groups differ from one another (groups not specified beforehand)? Unplanned multiple comparisons

Université d’Ottawa / University of Ottawa 2001 Bio 8100s Allied Multivariate Biostatistics L6.26 What are multiple comparisons? l Pair-wise comparisons of different treatments l These comparisons may involve group means, medians, variances, etc. l for means, done after ANOVA l In all cases, H 0 is that the groups in question do not differ. l Pair-wise comparisons of different treatments l These comparisons may involve group means, medians, variances, etc. l for means, done after ANOVA l In all cases, H 0 is that the groups in question do not differ. Yield CC NN  N+P Control Experimental (N) Experimental (N+P) c:Nc:N  N :  N+P  C :  N+P Frequency

Université d’Ottawa / University of Ottawa 2001 Bio 8100s Allied Multivariate Biostatistics L6.27 Types of comparisons l planned (a priori): independent of ANOVA results; theory predicts which treatments should be different. l unplanned (a posteriori): depend on ANOVA results; unclear which treatments should be different. l Test of significance are very different between the two! l planned (a priori): independent of ANOVA results; theory predicts which treatments should be different. l unplanned (a posteriori): depend on ANOVA results; unclear which treatments should be different. l Test of significance are very different between the two! Y Y X1X1 X2X2 X3X3 X4X4 X5X5 X1X1 X2X2 X3X3 X4X4 X5X5 Planned unplanned

Université d’Ottawa / University of Ottawa 2001 Bio 8100s Allied Multivariate Biostatistics L6.28 Planned comparisons (a priori contrasts): catecholamine levels in stressed fish l Comparisons of interest are determined by experimenter beforehand based on theory and do not depend on ANOVA results. l Prediction from theory: catecholamine levels increase above basal levels only after threshold PA O2 = 30 torr is reached. l So, compare only treatments above and below 30 torr (N T = 12). l Comparisons of interest are determined by experimenter beforehand based on theory and do not depend on ANOVA results. l Prediction from theory: catecholamine levels increase above basal levels only after threshold PA O2 = 30 torr is reached. l So, compare only treatments above and below 30 torr (N T = 12). Predicted threshold PA O 2 (torr) [Catecholamine]

Université d’Ottawa / University of Ottawa 2001 Bio 8100s Allied Multivariate Biostatistics L6.29 Unplanned comparisons (a posteriori contrasts): catecholamine levels in stressed fish l Comparisons are determined by ANOVA results. l Prediction from theory: catecholamine levels increase with increasing PA O2. l So, comparisons between any pairs of treatments may be warranted (N T = 21). l Comparisons are determined by ANOVA results. l Prediction from theory: catecholamine levels increase with increasing PA O2. l So, comparisons between any pairs of treatments may be warranted (N T = 21). Predicted relationship PA O 2 (torr) [Catecholamine]

Université d’Ottawa / University of Ottawa 2001 Bio 8100s Allied Multivariate Biostatistics L6.30 The problem: controlling experiment-wise  error For k comparisons, the probability of accepting H 0 (no difference) is (1 -  ) k. For 4 treatments, (1 -  ) k = (0.95) 6 =.735, so experiment-wise  (  e ) = l Thus we would expect to reject H 0 for at least one paired comparison about 27% of the time, even if all four treatments are identical. For k comparisons, the probability of accepting H 0 (no difference) is (1 -  ) k. For 4 treatments, (1 -  ) k = (0.95) 6 =.735, so experiment-wise  (  e ) = l Thus we would expect to reject H 0 for at least one paired comparison about 27% of the time, even if all four treatments are identical. Nominal  =.05 Number of treatments Experiment-wise  (  e )

Université d’Ottawa / University of Ottawa 2001 Bio 8100s Allied Multivariate Biostatistics L6.31 Unplanned comparisons: Hotelling T 2 and univariate F tests l Follow rejection of null in original MANOVA by all pairwise multivariate tests using Hotelling T 2 to determine which groups are different …but test at modified  to maintain overall nominal type I error rate (e.g. Bonferroni correction) l Follow rejection of null in original MANOVA by all pairwise multivariate tests using Hotelling T 2 to determine which groups are different …but test at modified  to maintain overall nominal type I error rate (e.g. Bonferroni correction) l Then use univariate t- tests to determine which variables are contributing to the detected pairwise differences… …opinion is divided as to whether these should be done at a modified . l Then use univariate t- tests to determine which variables are contributing to the detected pairwise differences… …opinion is divided as to whether these should be done at a modified .

Université d’Ottawa / University of Ottawa 2001 Bio 8100s Allied Multivariate Biostatistics L6.32 How many different variables for a MANOVA? l In general, try to use a small number of variables because: l In MANOVA, power generally declines with increasing number of variables. l If a number of variables are included that do not differ among groups, this will obscure differences on a few variables l In general, try to use a small number of variables because: l In MANOVA, power generally declines with increasing number of variables. l If a number of variables are included that do not differ among groups, this will obscure differences on a few variables l Measurement error is multiplicative among variables: the larger the number of variables, the larger the measurement noise l Interpretation is easier with a smaller number of variables l Measurement error is multiplicative among variables: the larger the number of variables, the larger the measurement noise l Interpretation is easier with a smaller number of variables

Université d’Ottawa / University of Ottawa 2001 Bio 8100s Allied Multivariate Biostatistics L6.33 How many different variables for a MANOVA : recommendation l Choose variables carefully, attempting to keep them to a minimum l Try to reduce the number of variables by using multivariate procedures (e.g. PCA) to generate composite, uncorrelated variables which can then be used as input. l Use multivariate procedures (such as discriminant function analysis) to “optimize” set of variables. l Choose variables carefully, attempting to keep them to a minimum l Try to reduce the number of variables by using multivariate procedures (e.g. PCA) to generate composite, uncorrelated variables which can then be used as input. l Use multivariate procedures (such as discriminant function analysis) to “optimize” set of variables.