1-20-051 Multi-way Anova Identifying and quantifying sources of variation Ability to "factor out" certain sources - ("adjusting") For the beginning, we.

Slides:



Advertisements
Similar presentations
Agenda of Week VII Review of Week VI Multiple regression Canonical correlation.
Advertisements

Linear Models for Microarray Data
Structural Equation Modeling
© Department of Statistics 2012 STATS 330 Lecture 32: Slide 1 Stats 330: Lecture 32.
Linear regression models
Quantitative Data Analysis: Hypothesis Testing
Correlation and Regression. Spearman's rank correlation An alternative to correlation that does not make so many assumptions Still measures the strength.
Statistical Concepts (part III) Concepts to cover or review today: –Population parameter –Sample statistics –Mean –Standard deviation –Coefficient of variation.
1 Chapter 3 Multiple Linear Regression Ray-Bing Chen Institute of Statistics National University of Kaohsiung.
Differentially expressed genes
Crime? FBI records violent crime, z x y z [1,] [2,] [3,] [4,] [5,]
Chapter 14 Introduction to Linear Regression and Correlation Analysis
Regression Approach To ANOVA
6.1 - One Sample One Sample  Mean μ, Variance σ 2, Proportion π Two Samples Two Samples  Means, Variances, Proportions μ 1 vs. μ 2.
Multiple Linear Regression Response Variable: Y Explanatory Variables: X 1,...,X k Model (Extension of Simple Regression): E(Y) =  +  1 X 1 +  +  k.
Regression and Correlation Methods Judy Zhong Ph.D.
Chapter 4 Hypothesis Testing, Power, and Control: A Review of the Basics.
© 2002 Prentice-Hall, Inc.Chap 14-1 Introduction to Multiple Regression Model.
 Combines linear regression and ANOVA  Can be used to compare g treatments, after controlling for quantitative factor believed to be related to response.
1 Design of Engineering Experiments Part 2 – Basic Statistical Concepts Simple comparative experiments –The hypothesis testing framework –The two-sample.
Chapter 14 Simple Regression
OPIM 303-Lecture #8 Jose M. Cruz Assistant Professor.
Randomization issues Two-sample t-test vs paired t-test I made a mistake in creating the dataset, so previous analyses will not be comparable.
+ Chapter 12: Inference for Regression Inference for Linear Regression.
Design and Analysis of Experiments Dr. Tai-Yue Wang Department of Industrial and Information Management National Cheng Kung University Tainan, TAIWAN,
Chap 12-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 12 Introduction to Linear.
We calculated a t-test for 30,000 genes at once How do we handle results, present data and results Normalization of the data as a mean of removing.
Lesson Multiple Regression Models. Objectives Obtain the correlation matrix Use technology to find a multiple regression equation Interpret the.
11 Chapter 12 Quantitative Data Analysis: Hypothesis Testing © 2009 John Wiley & Sons Ltd.
Analisa Regresi Week 7 The Multiple Linear Regression Model
Inference for regression - More details about simple linear regression IPS chapter 10.2 © 2006 W.H. Freeman and Company.
Simple Linear Regression ANOVA for regression (10.2)
Chapter 9 Introduction to the t Statistic. 9.1 Review Hypothesis Testing with z-Scores Sample mean (M) estimates (& approximates) population mean (μ)
Statistics for Business and Economics 8 th Edition Chapter 11 Simple Regression Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Ch.
Analysis of Covariance Combines linear regression and ANOVA Can be used to compare g treatments, after controlling for quantitative factor believed to.
Inferential Statistics. The Logic of Inferential Statistics Makes inferences about a population from a sample Makes inferences about a population from.
1 Example Analysis of a Two-Color Array Experiment Using LIMMA 3/30/2011 Copyright © 2011 Dan Nettleton.
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 13: One-way ANOVA Marshall University Genomics Core.
Suppose we have T genes which we measured under two experimental conditions (Ctl and Nic) in n replicated experiments t i * and p i are the t-statistic.
Analysis and Interpretation: Analysis of Variance (ANOVA)
Multiple Regression I 1 Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 4 Multiple Regression Analysis (Part 1) Terry Dielman.
Lecture 6 Design Matrices and ANOVA and how this is done in LIMMA.
The Assessment of Improved Water Sources Across the Globe By Philisile Dube.
IE241: Introduction to Design of Experiments. Last term we talked about testing the difference between two independent means. For means from a normal.
Regression. Height Weight How much would an adult female weigh if she were 5 feet tall? She could weigh varying amounts – in other words, there is a distribution.
ReCap Part II (Chapters 5,6,7) Data equations summarize pattern in data as a series of parameters (means, slopes). Frequency distributions, a key concept.
Inference for regression - More details about simple linear regression IPS chapter 10.2 © 2006 W.H. Freeman and Company.
Colorado Center for Astrodynamics Research The University of Colorado 1 STATISTICAL ORBIT DETERMINATION EKF and Observability ASEN 5070 LECTURE 23 10/21/09.
Tutorial 5 Thursday February 14 MBP 1010 Kevin Brown.
> FDRpvalue FDRTPvalue plot(LimmaFit$coefficients[complete.data],-log(FDRTPvalue[complete.data],base=10),type="p",main="Two-Sample.
Summary of the Statistics used in Multiple Regression.
Chapter 26: Inference for Slope. Height Weight How much would an adult female weigh if she were 5 feet tall? She could weigh varying amounts – in other.
Statistics 350 Review. Today Today: Review Simple Linear Regression Simple linear regression model: Y i =  for i=1,2,…,n Distribution of errors.
AP Statistics Chapter 14 Section 1.
General Linear Model & Classical Inference
Hypothesis testing using contrasts
Psychology 202a Advanced Psychological Statistics
Chapter 13 Created by Bethany Stubbe and Stephan Kogitz.
Statistical Application Analysis On Dhaka Bank _______________________________________
Regression model with multiple predictors
CHAPTER 29: Multiple Regression*
Inferences on Two Samples Summary
Multiple Regression Models
Ass. Prof. Dr. Mogeeb Mosleh
Multiple Linear Regression
CHAPTER 10 Comparing Two Populations or Groups
RES 500 Academic Writing and Research Skills
Presentation transcript:

Multi-way Anova Identifying and quantifying sources of variation Ability to "factor out" certain sources - ("adjusting") For the beginning, we will reproduce paired t-test results by assuming that arrays are one of the factors in a Two-way ANOVA Second, adjusting for the dye effects in a Three-way ANOVA Third, four and more - way ANOVA when having multiple factors of interest

Sources of variation And the Two-way ANOVA

Statistical Inference in Two-way ANOVA Statistical Model Parameter Estimates Null Hypotheses Null Distributions

Alternative Formulations of the Two-way ANOVA No-intercept Model Parameter Estimates Gets complicated Regardless of how the model is parametrized certain parameters remain unchanged (Trt 2 -Trt 1 ) In this sense all formulations are equivalent Null Hypotheses Null Distributions

Re-organizing data for ANOVA > DAnova<-NNBLimmadataC > DAnova$weights<-cbind(DAnova$weights,DAnova$weights) > DAnova$M<-cbind((DAnova$A+DAnova$M/2),DAnova$A-(DAnova$M/2)) > DAnova$A<-cbind(DAnova$A,DAnova$A) > attributes(DAnova) $names [1] "weights" "targets" "genes" "printer" "M" "A" $class [1] "MAList" attr(,"package") [1] "limma" > NNBLimmadataC$M[1,] 51-C1-3-vs-W W2-3-vs-C C3-3-vs-W W4-3-vs-C C5-3-vs-W W6-3-vs-C > NNBLimmadataC$A[1,] 51-C1-3-vs-W W2-3-vs-C C3-3-vs-W W4-3-vs-C C5-3-vs-W W6-3-vs-C > DAnova$M[1,] 51-C1-3-vs-W W2-3-vs-C C3-3-vs-W W4-3-vs-C C5-3-vs-W W6-3-vs-C C1-3-vs-W W2-3-vs-C C3-3-vs-W W4-3-vs-C C5-3-vs-W W6-3-vs-C

Setting-up the design matrix for limma > targets SlideNumber FileName Cy3 Cy5 Date 51-C1-3-vs-W C1-3-vs-W1-5.gpr C W 11/8/ W2-3-vs-C W2-3-vs-C2-5.gpr W C 11/8/ C3-3-vs-W C3-3-vs-W3-5.gpr C W 11/8/ W4-3-vs-C W4-3-vs-C4-5.gpr W C 11/8/ C5-3-vs-W C5-3-vs-W5-5.gpr C W 11/8/ W6-3-vs-C W6-3-vs-C6-5.gpr W C 11/8/2004 > trt<-c(targets$Cy5,targets$Cy3) > trt [1] "W" "C" "W" "C" "W" "C" "C" "W" "C" "W" "C" "W" > array<-c(1:6,1:6) > array [1]

Setting-up the design matrix for limma > designa<-model.matrix(~-1+factor(array)+factor(trt)) > designa factor(array)1 factor(array)2 factor(array)3 factor(array)4 factor(array)5 factor(array)6 factor(trt)W attr(,"assign") [1] attr(,"contrasts") attr(,"contrasts")$"factor(array)" [1] "contr.treatment" attr(,"contrasts")$"factor(trt)" [1] "contr.treatment"

Setting-up the design matrix for limma > colnames(designa)<-c(paste("A",1:6,sep=""),"W") > designa A1 A2 A3 A4 A5 A6 W attr(,"assign") [1] attr(,"contrasts") attr(,"contrasts")$"factor(array)" [1] "contr.treatment" attr(,"contrasts")$"factor(trt)" [1] "contr.treatment"

Comparing to paired t-test > Anova<-lmFit(DAnova,designa) > > Anova$coefficients[2,] A1 A2 A3 A4 A5 A6 W > LimmaPTT$coefficients[2] [1] > Anova$coefficients[1,] A1 A2 A3 A4 A5 A6 W NA NA > LimmaPTT$coefficients[1] [1] >

Comparing to paired t-test > plot(Anova$coefficients[,"W"],LimmaPTT$coefficients) > plot(Anova$sigma,LimmaPTT$sigma) > plot(Anova$stdev.unscaled[,"W"],LimmaPTT$stdev.unscaled) > plot(Anova$sigma*Anova$stdev.unscaled[,"W"],LimmaPTT$sigma*LimmaPTT$stdev.unscaled) > plot(Anova$df.residual,LimmaPTT$df.residual)

Adjusting for Dye - Three-way ANOVA > dye<-c(rep("Cy5",6),rep("Cy3",6)) > > designad<-model.matrix(~-1+factor(array)+factor(dye)+factor(trt)) > #designad > colnames(designad)<-c(paste("A",1:6,sep=""),"Cy5","W") > designad A1 A2 A3 A4 A5 A6 Cy5 W attr(,"assign") [1] attr(,"contrasts") attr(,"contrasts")$"factor(array)" [1] "contr.treatment" attr(,"contrasts")$"factor(dye)" [1] "contr.treatment" attr(,"contrasts")$"factor(trt)" [1] "contr.treatment"

Adjusting for Dye - Three-way ANOVA >Anovad<-lmFit(DAnova,designad) > > Anova$coefficients[2,] A1 A2 A3 A4 A5 A6 W > LimmaPTT$coefficients[2] [1] > Anovad$coefficients[2,] A1 A2 A3 A4 A5 A6 Cy5 W > Anova$coefficients[1,] A1 A2 A3 A4 A5 A6 W NA NA > LimmaPTT$coefficients[1] [1] > Anovad$coefficients[1,] A1 A2 A3 A4 A5 A6 Cy5 W NA NA >

Comparing to paired t-test > plot(Anovad$coefficients[,"W"],LimmaPTT$coefficients) > plot(Anovad$sigma,LimmaPTT$sigma) > plot(Anovad$stdev.unscaled[,"W"],LimmaPTT$stdev.unscaled) > plot(Anovad$sigma*Anova$stdev.unscaled[,"W"],LimmaPTT$sigma*LimmaPTT$stdev.unscaled) > plot(Anovad$df.residual,LimmaPTT$df.residual)

Comparing to paired t-test > AnovaB<-eBayes(Anova) > AnovadB<-eBayes(Anovad) > volcanoplot(AnovaB,coef=7) > volcanoplot(AnovadB,coef=8)