Available at Chapter 13 Multivariate Analysis BCB 702: Biostatistics

Slides:



Advertisements
Similar presentations
Step three: statistical analyses to test biological hypotheses General protocol continued.
Advertisements

Multiple Analysis of Variance – MANOVA
An Introduction to Multivariate Analysis
 Population multiple regression model  Data for multiple regression  Multiple linear regression model  Confidence intervals and significance tests.
BPS - 5th Ed. Chapter 241 One-Way Analysis of Variance: Comparing Several Means.
Chapter 17 Overview of Multivariate Analysis Methods
1 Multivariate Statistics ESM 206, 5/17/05. 2 WHAT IS MULTIVARIATE STATISTICS? A collection of techniques to help us understand patterns in and make predictions.
Multivariate Methods Pattern Recognition and Hypothesis Testing.
© 2005 The McGraw-Hill Companies, Inc., All Rights Reserved. Chapter 14 Using Multivariate Design and Analysis.
19-1 Chapter Nineteen MULTIVARIATE ANALYSIS: An Overview.
MANOVA LDF & MANOVA Geometric example of MANVOA & multivariate power MANOVA dimensionality Follow-up analyses if k > 2 Factorial MANOVA.
Multiple-group linear discriminant function maximum & contributing ldf dimensions concentrated & diffuse ldf structures follow-up analyses evaluating &
Clustering Ram Akella Lecture 6 February 23, & 280I University of California Berkeley Silicon Valley Center/SC.
Analysis of Variance & Multivariate Analysis of Variance
Slide 1 Testing Multivariate Assumptions The multivariate statistical techniques which we will cover in this class require one or more the following assumptions.
Multivariate Analysis Techniques
Multiple Linear Regression Response Variable: Y Explanatory Variables: X 1,...,X k Model (Extension of Simple Regression): E(Y) =  +  1 X 1 +  +  k.
Business Research Methods William G. Zikmund Chapter 24 Multivariate Analysis.
The Tutorial of Principal Component Analysis, Hierarchical Clustering, and Multidimensional Scaling Wenshan Wang.
One-Way Manova For an expository presentation of multivariate analysis of variance (MANOVA). See the following paper, which addresses several questions:
Multivariate Analysis of Variance (MANOVA). Outline Purpose and logic : page 3 Purpose and logic : page 3 Hypothesis testing : page 6 Hypothesis testing.
CHAPTER 26 Discriminant Analysis From: McCune, B. & J. B. Grace Analysis of Ecological Communities. MjM Software Design, Gleneden Beach, Oregon.
Canonical Correlation Analysis, Redundancy Analysis and Canonical Correspondence Analysis Hal Whitehead BIOL4062/5062.
Some matrix stuff.
بسم الله الرحمن الرحیم.. Multivariate Analysis of Variance.
1 Multivariate Analysis (Source: W.G Zikmund, B.J Babin, J.C Carr and M. Griffin, Business Research Methods, 8th Edition, U.S, South-Western Cengage Learning,
Chapter 24 Multivariate Statistical Analysis © 2010 South-Western/Cengage Learning. All rights reserved. May not be scanned, copied or duplicated, or posted.
Business Research Methods William G. Zikmund Chapter 24 Multivariate Analysis.
Educational Research: Competencies for Analysis and Application, 9 th edition. Gay, Mills, & Airasian © 2009 Pearson Education, Inc. All rights reserved.
MANOVA Mechanics. MANOVA is a multivariate generalization of ANOVA, so there are analogous parts to the simpler ANOVA equations First lets revisit Anova.
Chapter 10: Analyzing Experimental Data Inferential statistics are used to determine whether the independent variable had an effect on the dependent variance.
Next Colin Clarke-Hill and Ismo Kuhanen 1 Analysing Quantitative Data 1 Forming the Hypothesis Inferential Methods - an overview Research Methods Analysing.
Discriminant Analysis Discriminant analysis is a technique for analyzing data when the criterion or dependent variable is categorical and the predictor.
C M Clarke-Hill1 Analysing Quantitative Data Forming the Hypothesis Inferential Methods - an overview Research Methods.
Descriptive Statistics vs. Factor Analysis Descriptive statistics will inform on the prevalence of a phenomenon, among a given population, captured by.
Statistical Analysis of Data1 of 38 1 of 42 Department of Cognitive Science Adv. Experimental Methods & Statistics PSYC 4310 / COGS 6310 MANOVA Multivariate.
Adjusted from slides attributed to Andrew Ainsworth
Factorial Design (Multifactor)
Principal Components Analysis. Principal Components Analysis (PCA) A multivariate technique with the central aim of reducing the dimensionality of a multivariate.
 Relationship between education level, income, and length of time out of school  Our new regression equation: is the predicted value of the dependent.
28. Multiple regression The Practice of Statistics in the Life Sciences Second Edition.
Module III Multivariate Analysis Techniques- Framework, Factor Analysis, Cluster Analysis and Conjoint Analysis Research Report.
MANOVA One-Way. MANOVA This is just a DFA in reverse. You predict a set of continuous variables from one or more grouping variables. Often used in an.
Principle Component Analysis and its use in MA clustering Lecture 12.
Copyright © 2011, 2005, 1998, 1993 by Mosby, Inc., an affiliate of Elsevier Inc. Chapter 19: Statistical Analysis for Experimental-Type Research.
Unit 7 Statistics: Multivariate Analysis of Variance (MANOVA) & Discriminant Functional Analysis (DFA) Chat until class starts.
Chapter 8 Relationships Among Variables. Outline What correlational research investigates Understanding the nature of correlation What the coefficient.
Principal Component Analysis
Canonical Correlation. Canonical correlation analysis (CCA) is a statistical technique that facilitates the study of interrelationships among sets of.
MANOVA Lecture 12 Nuance stuff Psy 524 Andrew Ainsworth.
Multivariate statistical methods. Multivariate methods multivariate dataset – group of n objects, m variables (as a rule n>m, if possible). confirmation.
Out of sample extension of PCA, Kernel PCA, and MDS WILSON A. FLORERO-SALINAS DAN LI MATH 285, FALL
McGraw-Hill/Irwin © 2003 The McGraw-Hill Companies, Inc.,All Rights Reserved. Part Four ANALYSIS AND PRESENTATION OF DATA.
Methods of multivariate analysis Ing. Jozef Palkovič, PhD.
FACTOR ANALYSIS CLUSTER ANALYSIS Analyzing complex multidimensional patterns.
Chapter 12 REGRESSION DIAGNOSTICS AND CANONICAL CORRELATION.
Stats Methods at IC Lecture 3: Regression.
Regression Analysis.
Analysis of Variance and Covariance
Differences Among Group Means: Multifactorial Analysis of Variance
12 Inferential Analysis.
Chapter 11: The ANalysis Of Variance (ANOVA)
Factorial Design (Multifactor)
Descriptive Statistics vs. Factor Analysis
Multivariate Statistics
12 Inferential Analysis.
Principal Component Analysis
Factorial Design (Multifactor)
Multivariate Analysis - Introduction
MGS 3100 Business Analysis Regression Feb 18, 2016
Presentation transcript:

Available at Chapter 13 Multivariate Analysis BCB 702: Biostatistics

 Usually involves situations where there are two or more dependent (response) variables  Examines the relationships or interactions of these variables  Takes into account the fact that:  Variables may not be independent of each other  Performing multiple comparisons increases the risk of making a Type I error  Simply performing a series of multiple univariate tests would not be appropriate and would give false results What is Multivariate Analysis?

Include:  Multivariate Analysis of Variance (MANOVA)  Discriminant Function Analysis (DFA)  Principal Components Analysis (PCA)  Factor Analysis  Cluster Analysis  Canonical Correlation Analysis  Multidimensional Scaling Types of Multivariate Tests

 Extension of the ANOVA  Examines two or more response variables  Combines multiple response variables into a single new variable to maximise the differences between the treatment group means  Obtain a multivariate F value – Wilks’ lambda (value between 0 and 1) is most commonly used  If the overall test is significant, we can then go on to examine which of the individual variables contributed to the significant effect MANOVA

 A researcher has collected a certain species of lizard from three different island populations. Each island represents a different eco-zone. He wishes to test whether lizards from different islands differ in their morphology and abilities, so he collects 10 lizards from each island and measures their body length, limb length and running speed.  Independent variable:  Island of origin  Dependent variables:  Body length  Limb length  Running speed MANOVA: Example

 From the analysis, we get:  The model shows a significant difference in lizards from the three islands (p <0.001) MANOVA: Example Wilks’ lambda Fdf (num) df (den) p <0.001

 Limb length and running speed differ significantly between lizards from different islands. There is no difference in body length MANOVA: Example SourceDependent variable Sum of squares dfMean square Fp IslandBody length Limb length Running speed8.261E E <0.001 ErrorBody length Limb length Running speed1.013E E-05

 Discriminant Function Analysis (DFA) is used to determine which variables predict naturally occurring groups in data  Several independent variables and one non- metric (grouping) dependent variable  MANOVA in reverse  DFA organises the original independent variables into a set of canonical correlations, which are linear combinations of the original variables Discriminant Function Analysis

 The first canonical correlation explains the most variation in the data set, the second canonical correlation explains the most variation that is left over, and so on  Three steps: 1.Look for an overall significant effect using a multivariate F test (Wilks’ lambda) 2.Examine the independent variables individually for differences in mean by group 3.Classification Discriminant Function Analysis

 Populations of a sunflower species grow at four sites (two in riparian habitat and two in serpentine habitat) that differ in soil chemistry and water availability. Various measures of soil chemistry were taken in order to determine which of these variables can be used to distinguish among sites. (Sambatti & Rice, 2006)  Independent variables:  Ca  Mg PP  Organic matter (OM)  pH  Dependent variable:  Site DFA: Example

 The overall model was significant (p <0.001), meaning that sites differ in soil nutrients  First canonical axis: The riparian habitats (particularly R1) have more OM and a lower pH  Second canonical axis: The two serpentine habitats (S1 and S2) have lower levels of Ca and P and slightly higher levels of Mg than riparian sites Canonical Centroid plot DFA: Example

 The goal of PCA is to reduce complex data sets containing a large number of variables to a lower dimension in order to see the relationships of variables more clearly  It computes a new set of composite variables called principal components (PCs)  Each PC explains a certain proportion of the variation in the data set, with PC1 explaining the most amount of variation, PC2 the next most amount of variation, and so on Principal Components Analysis

 Similar to Principal Components Analysis  Used to uncover underlying trends and relationships in large and complex data sets  Works on a correlation matrix of variables  Combines original variables into a smaller set of factors  Variables are correlated with each other due to their correlation with a common factor Factor Analysis

 Cluster analysis encompasses a number of different methods  Used to organize or group data according to similarities  There is no real dependent variable – cluster analysis does not attempt to explain why groups (clusters) exist  Often used in species taxonomy B A C D E Cluster Analysis

 Used when variables fall naturally into two groups (a group of dependent variables and a group of independent variables)  Tries to determine if there are linear relationships between the two sets of variables  It creates functions for each group, such that the correlation between the functions of each group is maximised  In this way, a combination of variables from the first group predicts a combination of variables from the second group Canonical Correlation Analysis

 Analyses pairwise similarities between variables  Only applicable to continuous data  Plots variables graphically to provide a visual representation of the pattern of proximity of a set of variables (objects)  Objects plotted close together are relatively similar to each other, while objects plotted far apart are relatively dissimilar Multidimensional Scaling