STATISTICAL INFERENCE PART IX HYPOTHESIS TESTING - APPLICATIONS – MORE THAN TWO POPULATION.

Slides:



Advertisements
Similar presentations
Prepared by Lloyd R. Jaisingh
Advertisements

Generalized Linear Models (GLM)
Nonparametric tests and ANOVAs: What you need to know.
Testing means, part III The two-sample t-test. Sample Null hypothesis The population mean is equal to  o One-sample t-test Test statistic Null distribution.
Chapter 13 Multiple Regression
Multiple Regression Predicting a response with multiple explanatory variables.
Analysis of Variance and Multiple Comparisons Comparing more than two means and figuring out which are different.
ANOVA notes NR 245 Austin Troy
Using Statistics in Research Psych 231: Research Methods in Psychology.
Using Statistics in Research Psych 231: Research Methods in Psychology.
Independent Sample T-test Formula
Part I – MULTIVARIATE ANALYSIS
Using Statistics in Research Psych 231: Research Methods in Psychology.
Chapter 12 Multiple Regression
Comparing Means.
BCOR 1020 Business Statistics
Lecture 24: Thurs. Dec. 4 Extra sum of squares F-tests (10.3) R-squared statistic (10.4.1) Residual plots (11.2) Influential observations (11.3,
Lecture 9: One Way ANOVA Between Subjects
Two Groups Too Many? Try Analysis of Variance (ANOVA)
Lecture 12 One-way Analysis of Variance (Chapter 15.2)
Chi-Square and F Distributions Chapter 11 Understandable Statistics Ninth Edition By Brase and Brase Prepared by Yixun Shi Bloomsburg University of Pennsylvania.
Ch. 14: The Multiple Regression Model building
Using Statistics in Research Psych 231: Research Methods in Psychology.
Hypothesis Testing :The Difference between two population mean :
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 13-1 Chapter 13 Introduction to Multiple Regression Statistics for Managers.
6.1 - One Sample One Sample  Mean μ, Variance σ 2, Proportion π Two Samples Two Samples  Means, Variances, Proportions μ 1 vs. μ 2.
Chapter 12: Analysis of Variance
PS 225 Lecture 15 Analysis of Variance ANOVA Tables.
T-test Mechanics. Z-score If we know the population mean and standard deviation, for any value of X we can compute a z-score Z-score tells us how far.
OPIM 303-Lecture #8 Jose M. Cruz Assistant Professor.
Chapter 11 HYPOTHESIS TESTING USING THE ONE-WAY ANALYSIS OF VARIANCE.
Chapter 14 Introduction to Multiple Regression
January 31 and February 3,  Some formulae are presented in this lecture to provide the general mathematical background to the topic or to demonstrate.
ANOVA (Analysis of Variance) by Aziza Munir
Testing Multiple Means and the Analysis of Variance (§8.1, 8.2, 8.6) Situations where comparing more than two means is important. The approach to testing.
Statistics Psych 231: Research Methods in Psychology.
Between-Groups ANOVA Chapter 12. >When to use an F distribution Working with more than two samples >ANOVA Used with two or more nominal independent variables.
Copyright © 2011 Pearson Education, Inc. Analysis of Variance Chapter 26.
I. Statistical Tests: A Repetive Review A.Why do we use them? Namely: we need to make inferences from incomplete information or uncertainty þBut we want.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
1 Analysis of Variance Chapter 14 2 Introduction Analysis of variance helps compare two or more populations of quantitative data. Specifically, we are.
Interval Estimation and Hypothesis Testing Prepared by Vera Tabakova, East Carolina University.
Statistics for Business and Economics 8 th Edition Chapter 11 Simple Regression Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Ch.
Analysis of Variance (ANOVA) Brian Healy, PhD BIO203.
Chapter 13 - ANOVA. ANOVA Be able to explain in general terms and using an example what a one-way ANOVA is (370). Know the purpose of the one-way ANOVA.
Chapter Outline Goodness of Fit test Test of Independence.
ANOVA P OST ANOVA TEST 541 PHL By… Asma Al-Oneazi Supervised by… Dr. Amal Fatani King Saud University Pharmacy College Pharmacology Department.
Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall 14-1 Chapter 14 Introduction to Multiple Regression Statistics for Managers using Microsoft.
Business Statistics: A First Course (3rd Edition)
- We have samples for each of two conditions. We provide an answer for “Are the two sample means significantly different from each other, or could both.
6.1 - One Sample One Sample  Mean μ, Variance σ 2, Proportion π Two Samples Two Samples  Means, Variances, Proportions μ 1 vs. μ 2.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc.. Chap 14-1 Chapter 14 Introduction to Multiple Regression Basic Business Statistics 10 th Edition.
One-way ANOVA Example Analysis of Variance Hypotheses Model & Assumptions Analysis of Variance Multiple Comparisons Checking Assumptions.
Introduction to Multiple Regression Lecture 11. The Multiple Regression Model Idea: Examine the linear relationship between 1 dependent (Y) & 2 or more.
Statistics for Managers Using Microsoft Excel, 5e © 2008 Prentice-Hall, Inc.Chap 14-1 Statistics for Managers Using Microsoft® Excel 5th Edition Chapter.
T tests comparing two means t tests comparing two means.
Comparing Two Means Ch. 13. Two-Sample t Interval for a Difference Between Two Means.
Jump to first page Inferring Sample Findings to the Population and Testing for Differences.
Formula for Linear Regression y = bx + a Y variable plotted on vertical axis. X variable plotted on horizontal axis. Slope or the change in y for every.
Copyright © 2014, 2011 Pearson Education, Inc. 1 Chapter 26 Analysis of Variance.
Analysis of Variance ANOVA - method used to test the equality of three or more population means Null Hypothesis - H 0 : μ 1 = μ 2 = μ 3 = μ k Alternative.
Inferential Statistics Psych 231: Research Methods in Psychology.
Chapter 11: Test for Comparing Group Means: Part I.
The 2 nd to last topic this year!!.  ANOVA Testing is similar to a “two sample t- test except” that it compares more than two samples to one another.
Inferential Statistics Psych 231: Research Methods in Psychology.
STAT Single-Factor ANOVA
Statistical Significance Test
I. Statistical Tests: Why do we use them? What do they involve?
One way ANALYSIS OF VARIANCE (ANOVA)
Chapter 15 Analysis of Variance
Presentation transcript:

STATISTICAL INFERENCE PART IX HYPOTHESIS TESTING - APPLICATIONS – MORE THAN TWO POPULATION

INFERENCES ABOUT POPULATION MEANS Example: H o :  1 =  2 =  3 where  1 = population mean for group 1  2 = population mean for group 2  3 = population mean for group 3 H 1 : Not all are equal. 2

Assumptions Each of the populations are normally distributed (or large enough sample sizes to use CLT) with equal variances Populations are independent Cases within each sample are independent 3

Difference in means small relative to overall variability Difference in means large relative to overall variability Larger F-values typically yield more significant results. How large is large enough? We will compare with the tabulated value. INFERENCES ABOUT POPULATION MEANS - ANOVA  F tends to be small  F tends to be large 4

INFERENCES ABOUT POPULATION MEANS If F test shows that there are significant differences between means, then, apply paired t-tests to see which one(s) are different. Apply multiple testing correction to control for Type I error 5

Example Kenton Food Company wants to test 4 different package designs for a new product. Designs are introduced in 20 randomly selected markets. These markets are similar to each other in terms of location and sales records. Due to a fire incidence, one of these markets are removed from the study, leading to an unbalanced study design. 6 Example is taken from: Neter, J., Kutner, M.H., Nachtsheim, C.J., & Wasserman, W., (1996) Applied Linear Statistical Models, 4th edition, Irwin.

Example Market (j) Design (i) Is there a difference among designs in terms of their average sales? 7

Example > va1=read.table("VAT1.txt",header=T) > head(va1,3) Case Design Market Sales > aov1 = aov(Sales ~ Design,data=va1) > summary(aov1) Df Sum Sq Mean Sq F value Pr(>F) Design e-05 *** Residuals Signif. codes: 0 ‘***’ ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1 Degrees of freedoms are wrong! Since there are 4 different designs, d.f. should be 3. 8

Example > class(va1[,2]) [1] "integer" > va1[,2]=as.factor(va1[,2]) > aov1 = aov(Sales ~ Design,data=va1) > summary(aov1) Df Sum Sq Mean Sq F value Pr(>F) Design e-05 *** Residuals Signif. codes: 0 ‘***’ ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1 # or, alternatively: > aov1 = aov(Sales ~factor(Design),data=va1) 4 designs have different mean sales. But, which one(s) are different? 9

Example > library(multcomp) > c1=glht(aov1, linfct = mcp(Design = "Tukey")) > summary(c1) Simultaneous Tests for General Linear Hypotheses Multiple Comparisons of Means: Tukey Contrasts Fit: aov(formula = Sales ~ Design, data = va1) Linear Hypotheses: EstimateStd. Errort value Pr(>|t|) == == == <0.001 *** == == <0.001 *** == * Signif. codes: 0 ‘***’ ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1 (Adjusted p values reported -- single-step method) 4 th design has higher average sales than all other designs. 3 rd design is slightly significantly better than 2 nd design. 10

Example # or, alternatively > TukeyHSD(aov1, "Tasarim", conf.level=0.9) There are many functions in R available for multiple testing correction. For instance, you can look into “p.adjust” function in stats library for other types of corrections (e.g. Bonferroni). Supply raw p-values  obtain adjusted p-values. Different ANOVA types (e.g. 2-factor, repeated,…) in R; reference: Ilk, O. (2011) R Yazilimina Giris, ODTU, Chp. 7 11