Be humble in our attribute, be loving and varying in our attitude, that is the way to live in heaven.

Slides:



Advertisements
Similar presentations
Analysis of Variance (ANOVA) ANOVA methods are widely used for comparing 2 or more population means from populations that are approximately normal in distribution.
Advertisements

1 Chapter 4 Experiments with Blocking Factors The Randomized Complete Block Design Nuisance factor: a design factor that probably has an effect.
Analysis of variance (ANOVA)-the General Linear Model (GLM)
C82MST Statistical Methods 2 - Lecture 4 1 Overview of Lecture Last Week Per comparison and familywise error Post hoc comparisons Testing the assumptions.
Analysis of Variance (ANOVA) Statistics for the Social Sciences Psychology 340 Spring 2010.
LINEAR REGRESSION: Evaluating Regression Models Overview Assumptions for Linear Regression Evaluating a Regression Model.
LINEAR REGRESSION: Evaluating Regression Models. Overview Assumptions for Linear Regression Evaluating a Regression Model.
Independent Sample T-test Formula
Be humble in our attribute, be loving and varying in our attitude, that is the way to live in heaven.
Comparing Means.
Every achievement originates from the seed of determination. 1Random Effect.
Lesson #32 Simple Linear Regression. Regression is used to model and/or predict a variable; called the dependent variable, Y; based on one or more independent.
Experimental Design Terminology  An Experimental Unit is the entity on which measurement or an observation is made. For example, subjects are experimental.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 14-1 Chapter 14 Introduction to Multiple Regression Basic Business Statistics 11 th Edition.
Lecture 9: One Way ANOVA Between Subjects
Statistics for the Social Sciences Psychology 340 Spring 2005 Analysis of Variance (ANOVA)
13-1 Designing Engineering Experiments Every experiment involves a sequence of activities: Conjecture – the original hypothesis that motivates the.
January 7, morning session 1 Statistics Micro Mini Multi-factor ANOVA January 5-9, 2008 Beth Ayers.
Today Concepts underlying inferential statistics
Statistical Methods in Computer Science Hypothesis Testing II: Single-Factor Experiments Ido Dagan.
Chapter 14 Inferential Data Analysis
Two-Way Analysis of Variance STAT E-150 Statistical Methods.
Chapter 12 Inferential Statistics Gay, Mills, and Airasian
Analysis of Variance (ANOVA) Quantitative Methods in HPELS 440:210.
LEARNING PROGRAMME Hypothesis testing Intermediate Training in Quantitative Analysis Bangkok November 2007.
Repeated Measures ANOVA
1 Multiple Comparison Procedures Once we reject H 0 :   =   =...  c in favor of H 1 : NOT all  ’s are equal, we don’t yet know the way in which.
When we think only of sincerely helping all others, not ourselves,
1 1 Slide © 2005 Thomson/South-Western Chapter 13, Part A Analysis of Variance and Experimental Design n Introduction to Analysis of Variance n Analysis.
Stats Lunch: Day 7 One-Way ANOVA. Basic Steps of Calculating an ANOVA M = 3 M = 6 M = 10 Remember, there are 2 ways to estimate pop. variance in ANOVA:
Lecture 8 Analysis of Variance and Covariance Effect of Coupons, In-Store Promotion and Affluence of the Clientele on Sales.
1 1 Slide © 2003 Thomson/South-Western Chapter 13 Multiple Regression n Multiple Regression Model n Least Squares Method n Multiple Coefficient of Determination.
1 1 Slide Multiple Regression n Multiple Regression Model n Least Squares Method n Multiple Coefficient of Determination n Model Assumptions n Testing.
Correlation and Regression Used when we are interested in the relationship between two variables. NOT the differences between means or medians of different.
ANOVA (Analysis of Variance) by Aziza Munir
Everyday is a new beginning in life. Every moment is a time for self vigilance.
Statistics for the Social Sciences Psychology 340 Fall 2013 Tuesday, October 15, 2013 Analysis of Variance (ANOVA)
Analysis of Variance 1 Dr. Mohammed Alahmed Ph.D. in BioStatistics (011)
Educational Research Chapter 13 Inferential Statistics Gay, Mills, and Airasian 10 th Edition.
Chapter 13 Multiple Regression
1 Always be mindful of the kindness and not the faults of others.
1 Analysis of Variance & One Factor Designs Y= DEPENDENT VARIABLE (“yield”) (“response variable”) (“quality indicator”) X = INDEPENDENT VARIABLE (A possibly.
Analysis of Variance (One Factor). ANOVA Analysis of Variance Tests whether differences exist among population means categorized by only one factor or.
Analysis of Variance and Covariance Effect of Coupons, In-Store Promotion and Affluence of the Clientele on Sales.
VI. Regression Analysis A. Simple Linear Regression 1. Scatter Plots Regression analysis is best taught via an example. Pencil lead is a ceramic material.
MARKETING RESEARCH CHAPTER 17: Hypothesis Testing Related to Differences.
Multiple Regression I 1 Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 4 Multiple Regression Analysis (Part 1) Terry Dielman.
Two-Way (Independent) ANOVA. PSYC 6130A, PROF. J. ELDER 2 Two-Way ANOVA “Two-Way” means groups are defined by 2 independent variables. These IVs are typically.
Experimental Statistics - week 3
One-Way Analysis of Variance Recapitulation Recapitulation 1. Comparing differences among three or more subsamples requires a different statistical test.
Statistics for the Social Sciences Psychology 340 Spring 2009 Analysis of Variance (ANOVA)
Introduction to ANOVA Research Designs for ANOVAs Type I Error and Multiple Hypothesis Tests The Logic of ANOVA ANOVA vocabulary, notation, and formulas.
Simple ANOVA Comparing the Means of Three or More Groups Chapter 9.
Joyful mood is a meritorious deed that cheers up people around you like the showering of cool spring breeze.
1/54 Statistics Analysis of Variance. 2/54 Statistics in practice Introduction to Analysis of Variance Analysis of Variance: Testing for the Equality.
Independent Samples ANOVA. Outline of Today’s Discussion 1.Independent Samples ANOVA: A Conceptual Introduction 2.The Equal Variance Assumption 3.Cumulative.
Educational Research Inferential Statistics Chapter th Chapter 12- 8th Gay and Airasian.
Lecturer: Ing. Martina Hanová, PhD.. Regression analysis Regression analysis is a tool for analyzing relationships between financial variables:  Identify.
Analyze Of VAriance. Application fields ◦ Comparing means for more than two independent samples = examining relationship between categorical->metric variables.
Chapter 14 Introduction to Multiple Regression
Everyday is a new beginning in life.
The greatest blessing in life is in giving and not taking.
Multiple Comparisons Q560: Experimental Methods in Cognitive Science Lecture 10.
STAT 6304 Final Project Fall, 2016.
Prepared by Lee Revere and John Large
Always be mindful of the kindness and not the faults of others.
MOHAMMAD NAZMUL HUQ, Assistant Professor, Department of Business Administration. Chapter-16: Analysis of Variance and Covariance Relationship among techniques.
I. Statistical Tests: Why do we use them? What do they involve?
Psych 231: Research Methods in Psychology
Be humble in our attribute, be loving and varying in our attitude, that is the way to live in heaven.
Presentation transcript:

Be humble in our attribute, be loving and varying in our attitude, that is the way to live in heaven.

Applied Statistics Using SAS and SPSS Topic: One Way ANOVA By Prof Kelly Fan, Cal State Univ, East Bay

Statistical Tools vs. Variable Types Response (output) Predictor (input) NumericalCategorical/Mixed Numerical Simple and Multiple Regression Analysis of Variance (ANOVA) Analysis of Covariance (ANCOVA) CategoricalCategorical data analysis

Example: Battery Lifetime 8 brands of battery are studied. We would like to find out whether or not the brand of a battery will affect its lifetime. If so, of which brand the batteries can last longer than the other brands. Data collection: For each brand, 3 batteries are tested for their lifetime. What is Y variable? X variable?

Data: Y = LIFETIME (HOURS) BRAND 3 replications per level

Statistical Model “LEVEL” OF BRAND (Brand is, of course, represented as “categorical”) Y 11 Y 12 Y 1c Y ij Y 21 Y nI 1 2 n 1 2 C Y ij =  i +  ij i = 1,....., C j = 1,....., n Y nc

Hypotheses Setup H O : Level of X has no impact on Y H I : Level of X does have impact on Y H O :  1 =  2 =  8 H I : not all  j are EQUAL

ONE WAY ANOVA Analysis of Variance for life Source DF SS MS F P brand Error Total Estimate of the common variance  ^2 S = R-Sq = 59.67% R-Sq(adj) = 42.02%

Review Fitted value = Predicted value Residual = Observed value – fitted value

Diagnosis: Normality The points on the normality plot must more or less follow a line to claim “normal distributed”. There are statistic tests to verify it scientifically. The ANOVA method we learn here is not sensitive to the normality assumption. That is, a mild departure from the normal distribution will not change our conclusions much. Normality plot: normal scores vs. residuals

From the Battery lifetime data:

Diagnosis: Equal Variances The points on the residual plot must be more or less within a horizontal band to claim “constant variances”. There are statistic tests to verify it scientifically. The ANOVA method we learn here is not sensitive to the constant variances assumption. That is, slightly different variances within groups will not change our conclusions much. Residual plot: fitted values vs. residuals

From the Battery lifetime data:

Multiple Comparison Procedures Once we reject H 0 :   =   =...  c in favor of H 1 : NOT all  ’s are equal, we don’t yet know the way in which they’re not all equal, but simply that they’re not all the same. If there are 4 columns, are all 4  ’s different? Are 3 the same and one different? If so, which one? etc.

These “more detailed” inquiries into the process are called MULTIPLE COMPARISON PROCEDURES. Errors (Type I): We set up “  ” as the significance level for a hypothesis test. Suppose we test 3 independent hypotheses, each at  =.05; each test has type I error (rej H 0 when it’s true) of.05. However, P(at least one type I error in the 3 tests) = 1-P( accept all ) = 1 - (.95) 3 .14 3, given true

In other words, Probability is.14 that at least one type one error is made. For 5 tests, prob =.23. Question - Should we choose  =.05, and suffer (for 5 tests) a.23 OVERALL Error rate (or “a” or  experimentwise )? OR Should we choose/control the overall error rate, “a”, to be.05, and find the individual test  by 1 - (1-  ) 5 =.05, (which gives us  =.011)?

The formula 1 - (1-  ) 5 =.05 would be valid only if the tests are independent; often they’re not. [ e.g.,  1 =  2  2 =  3,  1 =  3 IF accepted & rejected, isn’t it more likely that rejected? ]

When the tests are not independent, it’s usually very difficult to arrive at the correct  for an individual test so that a specified value results for the overall error rate.

Categories of multiple comparison tests - “Planned”/ “a priori” comparisons (stated in advance, usually a linear combination of the column means equal to zero.) - “Post hoc”/ “a posteriori” comparisons (decided after a look at the data - which comparisons “look interesting”) - “Post hoc” multiple comparisons (every column mean compared with each other column mean)

There are many multiple comparison procedures. We’ll cover only a few. Post hoc multiple comparisons 1)Pairwise comparisons: Do a series of pairwise tests; Duncan and SNK tests 2)(Optional) Comparisons to control: Dunnett tests

Example: Broker Study A financial firm would like to determine if brokers they use to execute trades differ with respect to their ability to provide a stock purchase for the firm at a low buying price per share. To measure cost, an index, Y, is used. Y=1000(A-P)/A where P=per share price paid for the stock; A=average of high price and low price per share, for the day. “The higher Y is the better the trade is.”

} R=6 CoL: broker Five brokers were in the study and six trades were randomly assigned to each broker.

SPSS Output Analyze>>General Linear Model>>Univariate…

Homogeneous Subsets

Conclusion : 3, 1 2, 4, 5 Conclusion : 3, ???

Broker 1 and 3 are not significantly different but they are significantly different to the other 3 brokers. Broker 2 and 4 are not significantly different, and broker 4 and 5 are not significantly different, but broker 2 is different to (smaller than) broker 5 significantly. Conclusion : 3,

Comparisons to Control Dunnett’s test Designed specifically for (and incorporating the interdependencies of) comparing several “treatments” to a “control.” Example: Col } R=6 CONTROL

- Cols 4 and 5 differ from the control [ 1 ]. - Cols 2 and 3 are not significantly different from control. In our example: CONTROL

Exercise: Sales Data Sales

Exercise. 1.Find the Anova table. 2.Perform SNK tests at a = 5% to group treatments. 3.Perform Duncan tests at a = 5% to group treatments. 4.Which treatment would you use?

Post Hoc and Priori comparisons F test for linear combination of column means (contrast) Scheffe test: To test all linear combinations at once. Very conservative; not to be used for a few of comparisons.