Ark nr.: 1 | Forfatter: Øyvind Langsrud - a member of the Food Science Alliance | NLH - Matforsk - Akvaforsk Rotation Tests - Computing exact adjusted.

Slides:



Advertisements
Similar presentations
3.3 Hypothesis Testing in Multiple Linear Regression
Advertisements

1 1 Chapter 5: Multiple Regression 5.1 Fitting a Multiple Regression Model 5.2 Fitting a Multiple Regression Model with Interactions 5.3 Generating and.
Hypothesis Testing Steps in Hypothesis Testing:
Analysis of variance (ANOVA)-the General Linear Model (GLM)
Generalized Linear Models (GLM)
Regression Part II One-factor ANOVA Another dummy variable coding scheme Contrasts Multiple comparisons Interactions.
Classical inference and design efficiency Zurich SPM Course 2014
© 2010 Pearson Prentice Hall. All rights reserved Least Squares Regression Models.
Hypothesis Testing Using a Single Sample
© 2010 Pearson Prentice Hall. All rights reserved Single Factor ANOVA.
The Multiple Regression Model Prepared by Vera Tabakova, East Carolina University.
Analysis of variance (ANOVA)-the General Linear Model (GLM)
Part I – MULTIVARIATE ANALYSIS
Differentially expressed genes
9-1 Hypothesis Testing Statistical Hypotheses Statistical hypothesis testing and confidence interval estimation of parameters are the fundamental.
Lecture 14 – Thurs, Oct 23 Multiple Comparisons (Sections 6.3, 6.4). Next time: Simple linear regression (Sections )
Lecture 9: One Way ANOVA Between Subjects
Log-linear and logistic models
Analysis of Variance Chapter 3Design & Analysis of Experiments 7E 2009 Montgomery 1.
One-way Between Groups Analysis of Variance
Topic 3: Regression.
Lecture 12 One-way Analysis of Variance (Chapter 15.2)
Significance Tests P-values and Q-values. Outline Statistical significance in multiple testing Statistical significance in multiple testing Empirical.
Inferences About Process Quality
Analysis of Variance & Multivariate Analysis of Variance
Independent Sample T-test Classical design used in psychology/medicine N subjects are randomly assigned to two groups (Control * Treatment). After treatment,
INFERENTIAL STATISTICS – Samples are only estimates of the population – Sample statistics will be slightly off from the true values of its population’s.
Objectives of Multiple Regression
1 Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. Analysis of Categorical Data Test of Independence.
Chapter 13: Inference in Regression
Multiple testing in high- throughput biology Petter Mostad.
1/2555 สมศักดิ์ ศิวดำรงพงศ์
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 14 Comparing Groups: Analysis of Variance Methods Section 14.2 Estimating Differences.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Inference on the Least-Squares Regression Model and Multiple Regression 14.
Copyright © Cengage Learning. All rights reserved. 10 Inferences Involving Two Populations.
Essential Statistics in Biology: Getting the Numbers Right
Statistics & Biology Shelly’s Super Happy Fun Times February 7, 2012 Will Herrick.
Regression Part II One-factor ANOVA Another dummy variable coding scheme Contrasts Multiple comparisons Interactions.
9-1 Hypothesis Testing Statistical Hypotheses Definition Statistical hypothesis testing and confidence interval estimation of parameters are.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Differential Expression II Adding power by modeling all the genes Oct 06.
ANOVA (Analysis of Variance) by Aziza Munir
1 Chapter 3 Multiple Linear Regression Multiple Regression Models Suppose that the yield in pounds of conversion in a chemical process depends.
Multiple Regression The Basics. Multiple Regression (MR) Predicting one DV from a set of predictors, the DV should be interval/ratio or at least assumed.
Brian Macpherson Ph.D, Professor of Statistics, University of Manitoba Tom Bingham Statistician, The Boeing Company.
Bioinformatics Expression profiling and functional genomics Part II: Differential expression Ad 27/11/2006.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
May 2004 Prof. Himayatullah 1 Basic Econometrics Chapter 5: TWO-VARIABLE REGRESSION: Interval Estimation and Hypothesis Testing.
EMIS 7300 SYSTEMS ANALYSIS METHODS FALL 2005 Dr. John Lipp Copyright © Dr. John Lipp.
Analysis of Variance (ANOVA) Brian Healy, PhD BIO203.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 14 Comparing Groups: Analysis of Variance Methods Section 14.1 One-Way ANOVA: Comparing.
Chapter 8 1-Way Analysis of Variance - Completely Randomized Design.
Single-Factor Studies KNNL – Chapter 16. Single-Factor Models Independent Variable can be qualitative or quantitative If Quantitative, we typically assume.
Applied Quantitative Analysis and Practices LECTURE#25 By Dr. Osman Sadiq Paracha.
ANOVA P OST ANOVA TEST 541 PHL By… Asma Al-Oneazi Supervised by… Dr. Amal Fatani King Saud University Pharmacy College Pharmacology Department.
Comp. Genomics Recitation 10 4/7/09 Differential expression detection.
1 Identifying Robust Activation in fMRI Thomas Nichols, Ph.D. Assistant Professor Department of Biostatistics University of Michigan
BIOSTATISTICS Hypotheses testing and parameter estimation.
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 6 –Multiple hypothesis testing Marshall University Genomics.
Chapters Way Analysis of Variance - Completely Randomized Design.
Model adequacy checking in the ANOVA Checking assumptions is important –Normality –Constant variance –Independence –Have we fit the right model? Later.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Multiple Regression Chapter 14.
BIOL 582 Lecture Set 2 Inferential Statistics, Hypotheses, and Resampling.
Differences Among Groups
Copyright © 2008 by Nelson, a division of Thomson Canada Limited Chapter 18 Part 5 Analysis and Interpretation of Data DIFFERENCES BETWEEN GROUPS AND RELATIONSHIPS.
Review Statistical inference and test of significance.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 7 Inferences Concerning Means.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
The 2 nd to last topic this year!!.  ANOVA Testing is similar to a “two sample t- test except” that it compares more than two samples to one another.
INTRODUCTION TO MULTIPLE REGRESSION MULTIPLE REGRESSION MODEL 11.2 MULTIPLE COEFFICIENT OF DETERMINATION 11.3 MODEL ASSUMPTIONS 11.4 TEST OF SIGNIFICANCE.
Presentation transcript:

Ark nr.: 1 | Forfatter: Øyvind Langsrud - a member of the Food Science Alliance | NLH - Matforsk - Akvaforsk Rotation Tests - Computing exact adjusted p-values in multiresponse experiments Øyvind Langsrud, MATFORSK, Norwegian Food Research Institute.

Ark nr.: 2 | Forfatter: Øyvind Langsrud - a member of the Food Science Alliance | NLH - Matforsk - Akvaforsk Campylobacter experiment  Three biological replicates (block variable)  312 FT-IR wavelengths as multiple responses  polysaccharide region [ cm -1 ]

Ark nr.: 3 | Forfatter: Øyvind Langsrud - a member of the Food Science Alliance | NLH - Matforsk - Akvaforsk Analysis with MINITAB - first wavelength

Ark nr.: 4 | Forfatter: Øyvind Langsrud - a member of the Food Science Alliance | NLH - Matforsk - Akvaforsk ANALYSIS with MANOVA

Ark nr.: 5 | Forfatter: Øyvind Langsrud - a member of the Food Science Alliance | NLH - Matforsk - Akvaforsk Adjusted means as curves

Ark nr.: 6 | Forfatter: Øyvind Langsrud - a member of the Food Science Alliance | NLH - Matforsk - Akvaforsk Effect of Day single response p-values  Ordinary significance tests are not longer suitable  A lot of significant results cased by random variation (since several tests/responses)  The p-values need to be adjusted  So that they are interpretable

Ark nr.: 7 | Forfatter: Øyvind Langsrud - a member of the Food Science Alliance | NLH - Matforsk - Akvaforsk Adjusted p-values  So that experimentwise (or familywise) error rate is controlled  Bonferroni correction (classical method)  pAdj = #responses pRaw  Conservative upper bound (in practice often too conservative )  Dependence among responses not investigated  Modern methods  Makes active use of dependence among responses  Permutation tests  Rotation tests

Ark nr.: 8 | Forfatter: Øyvind Langsrud - a member of the Food Science Alliance | NLH - Matforsk - Akvaforsk Assume a regression model (simplified model without constant term)  Separate F-tests for each response  Random variables: F 1, F 2 …, F q  Observed values: f 1, f 2 …, f q  Maximal F-value (= minimum p-value) obtained for response number k  Raw p-value:  Adjusted p-value:

Ark nr.: 9 | Forfatter: Øyvind Langsrud - a member of the Food Science Alliance | NLH - Matforsk - Akvaforsk Adjusting the minimum p-value by permutations  For m =1,2 …. M  permute data ( Y  P (m) Y )  compute maximal F-statistic from these data  Compute p-value as

Ark nr.: 10 | Forfatter: Øyvind Langsrud - a member of the Food Science Alliance | NLH - Matforsk - Akvaforsk How is dependence handled?  Estimate of covariance matrix under H 0 :  Estimate based on permuted data:  The permutation test is a conditional test  Conditioned on the covariance matrix estimate  Conditioned on sufficient statistics for the unknown parameters  Fisher's exact test for 2  2 contingency tables  is the most famous conditional test

Ark nr.: 11 | Forfatter: Øyvind Langsrud - a member of the Food Science Alliance | NLH - Matforsk - Akvaforsk Conditional test under multivariate normality?  Need distribution of Y conditioned on Y T Y  Answer  Y is distributed as RY obs  where Y obs is the observed matrix  and where R is an uniformly distributed orthogonal matrix (random rotation matrix)  Relation to well-known tests  t-test, F-tests, Hotelling T 2, Wilks’  are special cases of rotation testing  But these test statistics do not depend on Y T Y  Conditioning not needed  Simulations not necessary

Ark nr.: 12 | Forfatter: Øyvind Langsrud - a member of the Food Science Alliance | NLH - Matforsk - Akvaforsk Adjusting the minimum p-value by rotations  For m =1,2 …. M  simulate rotated data ( Y  R (m) Y )  where R (m) is a simulated random rotation matrix  compute maximal F-statistic from these data  Compute p-value as  In practice: a much more efficient algorithm is applied

Ark nr.: 13 | Forfatter: Øyvind Langsrud - a member of the Food Science Alliance | NLH - Matforsk - Akvaforsk Adjusting the other p-values (permutations or rotations)  Remove response with minimum p-value  Adjust minimum p-value in new data set  and so on  Enforce monotonicity  All calculations can be done simultaneously

Ark nr.: 14 | Forfatter: Øyvind Langsrud - a member of the Food Science Alliance | NLH - Matforsk - Akvaforsk Permutation test or rotation test  Exact permutation testing  The only assumption: independent observations  Useless for few observations  Useless for complex ANOVA and regression models  Exact rotation testing  Assumes multivariate normality  Does not need as many observations as permutation testing  Can be use for complex ANOVA and regression models  F-test  rotation test

Ark nr.: 15 | Forfatter: Øyvind Langsrud - a member of the Food Science Alliance | NLH - Matforsk - Akvaforsk Adjusted p-values (FWE)  non-adjusted p-values (RAW)  False significance at 1% level is expected in 1% of all the investigated responses  If you have 5000 responses …..  “Classically ” adjusted p-values (FWE)  False significance at 1% level is expected in not more that 1% of all experiments where the method is applied.  The experimentwise (or familywise) error rate is controlled

Ark nr.: 16 | Forfatter: Øyvind Langsrud - a member of the Food Science Alliance | NLH - Matforsk - Akvaforsk False Discovery Rate (FDR)  Adjusted p-values according to False Discovery Rate  False significance at 1% level is expected in 1% of all cases (responses) reported as significant at 1% level.  If you have 5000 responses and 200 are reported as significant at 1% level, one will expect two of these as false.  “q-values” is proposed instead of “adjusted p-values”

Ark nr.: 17 | Forfatter: Øyvind Langsrud - a member of the Food Science Alliance | NLH - Matforsk - Akvaforsk Calculation of FDR adjusted p-values  Several methods exist  Most of them do not handle the dependence among the responses  but OK if the “weak dependence requirement” is met  New variant  based on rotations (or alternatively permutations)  handles any kind of dependence  conservative compared to other methods  since the method does not involve an estimate of the amount of responses with true null hypotheses

Ark nr.: 18 | Forfatter: Øyvind Langsrud - a member of the Food Science Alliance | NLH - Matforsk - Akvaforsk Adjusted p-values (first 30 wavelengths)

Ark nr.: 19 | Forfatter: Øyvind Langsrud - a member of the Food Science Alliance | NLH - Matforsk - Akvaforsk Adjusted p-values (30 most significant wavelengths)

Ark nr.: 20 | Forfatter: Øyvind Langsrud - a member of the Food Science Alliance | NLH - Matforsk - Akvaforsk p-values

Ark nr.: 21 | Forfatter: Øyvind Langsrud - a member of the Food Science Alliance | NLH - Matforsk - Akvaforsk Rotation Tests - Conclusion  Simulation principle for computing exact Monte Carlo p-value for any test statistic.  Based on multivariate normal distribution.  Generalisation of classical tests.  Related to permutation testing.  Useful for computing adjusted p-values (F-tests)  FWE, FDR  General linear models (ANOVA and regression)  Implemented in the MANOVA program ( )