Fundamentals of Data Analysis Lecture 8 ANOVA pt.2.

Slides:



Advertisements
Similar presentations
Multiple-choice question
Advertisements

Copyright 2004 David J. Lilja1 Comparing Two Alternatives Use confidence intervals for Before-and-after comparisons Noncorresponding measurements.
CHI-SQUARE(X2) DISTRIBUTION
Multiple Comparisons in Factorial Experiments
i) Two way ANOVA without replication
Hypothesis Testing IV Chi Square.
Analysis and Interpretation Inferential Statistics ANOVA
The Two Factor ANOVA © 2010 Pearson Prentice Hall. All rights reserved.
1 1 Slide © 2009, Econ-2030 Applied Statistics-Dr Tadesse Chapter 10: Comparisons Involving Means n Introduction to Analysis of Variance n Analysis of.
ANALYSIS OF VARIANCE.
PSY 307 – Statistics for the Behavioral Sciences
Chapter 9 - Lecture 2 Some more theory and alternative problem formats. (These are problem formats more likely to appear on exams. Most of your time in.
The Statistical Analysis Partitions the total variation in the data into components associated with sources of variation –For a Completely Randomized Design.
PSY 307 – Statistics for the Behavioral Sciences
Lecture 9: One Way ANOVA Between Subjects
ANOVA Single Factor Models Single Factor Models. ANOVA ANOVA (ANalysis Of VAriance) is a natural extension used to compare the means more than 2 populations.
Chapter 9 - Lecture 2 Computing the analysis of variance for simple experiments (single factor, unrelated groups experiments).
Statistical Analysis. Purpose of Statistical Analysis Determines whether the results found in an experiment are meaningful. Answers the question: –Does.
Inferential Statistics
Chapter 12: Analysis of Variance
T Test for One Sample. Why use a t test? The sampling distribution of t represents the distribution that would be obtained if a value of t were calculated.
Fundamentals of Data Analysis Lecture 7 ANOVA. Program for today F Analysis of variance; F One factor design; F Many factors design; F Latin square scheme.
Statistical Analysis Statistical Analysis
Analysis of Variance Nutan S. Mishra Department of Mathematics and Statistics University of South Alabama.
1 1 Slide © 2006 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
1 1 Slide © 2005 Thomson/South-Western Chapter 13, Part A Analysis of Variance and Experimental Design n Introduction to Analysis of Variance n Analysis.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Comparing Three or More Means 13.
1 1 Slide Analysis of Variance Chapter 13 BA 303.
Experimental Design An Experimental Design is a plan for the assignment of the treatments to the plots in the experiment Designs differ primarily in the.
Today’s lesson Confidence intervals for the expected value of a random variable. Determining the sample size needed to have a specified probability of.
Analysis of Variance ( ANOVA )
Chapter 9 Section 2 Testing the Difference Between Two Means: t Test 1.
Fundamentals of Data Analysis Lecture 10 Management of data sets and improving the precision of measurement pt. 2.
1 Chapter 13 Analysis of Variance. 2 Chapter Outline  An introduction to experimental design and analysis of variance  Analysis of Variance and the.
Copyright © 2004 Pearson Education, Inc.
INTRODUCTION TO ANALYSIS OF VARIANCE (ANOVA). COURSE CONTENT WHAT IS ANOVA DIFFERENT TYPES OF ANOVA ANOVA THEORY WORKED EXAMPLE IN EXCEL –GENERATING THE.
Comparing Three or More Means ANOVA (One-Way Analysis of Variance)
Chapter 12 Analysis of Variance. An Overview We know how to test a hypothesis about two population means, but what if we have more than two? Example:
Essential Question:  How do scientists use statistical analyses to draw meaningful conclusions from experimental results?
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics S eventh Edition By Brase and Brase Prepared by: Lynn Smith.
Learning Objectives Copyright © 2002 South-Western/Thomson Learning Statistical Testing of Differences CHAPTER fifteen.
Chapter Seventeen. Figure 17.1 Relationship of Hypothesis Testing Related to Differences to the Previous Chapter and the Marketing Research Process Focus.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 14 Comparing Groups: Analysis of Variance Methods Section 14.3 Two-Way ANOVA.
CPE 619 One Factor Experiments Aleksandar Milenković The LaCASA Laboratory Electrical and Computer Engineering Department The University of Alabama in.
Copyright © Cengage Learning. All rights reserved. 12 Analysis of Variance.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics S eventh Edition By Brase and Brase Prepared by: Lynn Smith.
ONE WAY ANALYSIS OF VARIANCE ANOVA o It is used to investigate the effect of one factor which occurs at h levels (≥3). Example: Suppose that we wish to.
Econ 3790: Business and Economic Statistics Instructor: Yogesh Uppal
Econ 3790: Business and Economic Statistics Instructor: Yogesh Uppal
Hypothesis test flow chart frequency data Measurement scale number of variables 1 basic χ 2 test (19.5) Table I χ 2 test for independence (19.9) Table.
CHAPTER 12 ANALYSIS OF VARIANCE Prem Mann, Introductory Statistics, 7/E Copyright © 2010 John Wiley & Sons. All right reserved.
Analysis of Variance Analysis of Variance (AOV) was originally devised within the realm of agricultural statistics for testing the yields of various crops.
Chapter 10 Section 5 Chi-squared Test for a Variance or Standard Deviation.
 List the characteristics of the F distribution.  Conduct a test of hypothesis to determine whether the variances of two populations are equal.  Discuss.
DSCI 346 Yamasaki Lecture 4 ANalysis Of Variance.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Chapter 13 Analysis of Variance (ANOVA). ANOVA can be used to test for differences between three or more means. The hypotheses for an ANOVA are always:
Testing the Difference Between Two Means
Lecture Slides Elementary Statistics Twelfth Edition
i) Two way ANOVA without replication
Applied Business Statistics, 7th ed. by Ken Black
Comparing Three or More Means
CHAPTER 12 ANALYSIS OF VARIANCE
Statistics for Business and Economics (13e)
Econ 3790: Business and Economic Statistics
Chapter 13 Group Differences
Lecture Slides Elementary Statistics Twelfth Edition
Chapter 10 – Part II Analysis of Variance
Statistical Inference for the Mean: t-test
STATISTICS INFORMED DECISIONS USING DATA
Presentation transcript:

Fundamentals of Data Analysis Lecture 8 ANOVA pt.2

Multifactors design With that issue we are dealing eg. in the case of alloy hardness test, which consists of two metals A and B, and their contents in the alloy determines the hardness. We therefore divide our observations into r classes due to the characteristics of the value of A and p Classes due to the characteristics of the value of B. All observations are therefore divided into rp groups. On the example of two factors design

Multifactors design On the example of two factors design For this model, we verify the hypothesis : 1. of equality of mean values ​​ for all rp populations: H 0 : m ij = m dla i = 1,..., r; j = 1,..., p. 2.of equality of all the average values m i of studied features treated of A with r variants, excluding the impact of factor B: H 0 : m 1. =... = m r. dla i = 1,..., r. 3. of equality of all the average values m i of studied features treated of B with p variants, excluding the impact of factor A: H 0 : m.1 =... = m.p dla j = 1,..., p. 4. that the deviation of the mean value m ij from the total value of the average m is equal to the sum of effects of factor A and factor B : H 0 : m ij - m = (m i. - m) + (m.j - m).

Two factors design From three different departments of a university were drawn at l = 4 students from each year of study, and calculated the mean ratings obtained by each student in the last semester. The obtained results are shown in Table : Example

Two factors design Example Assuming that the average grades obtained by the students have a normal distributions with the same variance at the confidence level α= 95% verify the following hypotheses: a)the average values of average grades for students of different departments are the same; b)the average value of average grades for differernt years of study are the same; c)the average value of the average grades for the first two years are the same.

Two factors design In that case we have r = 3 (Departments) and p = 5 ( number of years of study ). After calculations we get the results shown in the table : Example

Two factors design Example Then compute the sum of squared deviations: q A = , for df = 2, then q A /df = q B = df = 4 q B /df = q AB = df = 8 q AB /df = q R = df = 45 q R /df = q = df = 59 F-statistics calculated on this basis, have the following values : F A = / = F B = / = For the third hipothesis we must calculate new mean value: x = 3.4 and q C = 0.24 (df = 1) q R = 7.95 (df = 22) F C = / =

Two factors design Example Critical values: F Acr = 3.20 > F A There is no reason to reject the null hypothesis F Bcr = 2.58 > F B There is no reason to reject the null hypothesis F Ccr = 4.30 > F C There is no reason to reject the null hypothesis

Two factors design Exercise Each of the three varieties of potatoes was cultivated on 12 parcels of the same size and type. Parcels were divided into four groups of three parcels, and for each group a different type of fertilizer was used. Yields for these plots are shown in Table At the confidence level 95% verify the following hypotheses: A)The values ​​ of the average yield for the different varieties of potatoes are dependent on the applied fertilizer B)The values ​​ of the average yields for the different fertilizers do not differ regardless of potato variety Potato variety Type of fertilizer , 6.1, 5,9 5.7, 4.9, , 6.1, , 6.7, , 6.7, , 6.4, , 7.3, , 7.1, , 6.6, , 6.4, , 6.7, , 6.1, 6.0

Latin square design During Latin sQuare design (LQ) experimental items are classified in terms of the classification of three directions: rows, columns, and objects. Experimental factor A presented at p levels (contains p objects), each of which occurs exactly once in the corresponding row and column.

Latin square design 1. We calculate the correction factor: 2. We calculate the sum of squares for rows

Latin square design and the mean square for rows 3. We calculate the sum of squares for columns

Latin square design and the mean square for columns 4. calculate the sum of squares for the factors

Latin square design and the mean square for factors 5. calculate the total sum of squares

Latin square design 6. and the residual sum of squares SSE = SS - SSR - SSC - SST 7. mean square residual

Latin square design 8. and calculate statistics

Latin square design Example In the experiment, on the fertilization of fields used are the following factors (fertilizers): A - (NH 4 ) 2 SO 4, B - NH 4 NO 3, C - CO(NH 2 ) 2, D - Ca(NO 3 ) 2, E - NaNO 3, F - NoN (non-fertilized). Fertilizers are used in equal doses (in g/m 2 ). In the first stage the draw was performed of suitable Latin square 6x6 (since we have 6 factors) and the result is shown in table:

Latin square design Example The results of the experiment as planned (achieved yields of sugar beet) are presented in Table

Latin square design Example Harvests for different fertilizers are presented in Table.

Latin square design Example Degree of freedom: totaldf tot = pr - 1 = 35 for rowsdf row = r - 1 = 5 for columnsdf col = p - 1 = 5 for factorsdf tr = n - 1 = 5 for error df error = (r-1)(p-1) - (n-1) = = 20.

Latin square design Example

Latin square design Example A further step in the analysis of our data is the separation of variables (averages). Based on the result of our experiment, we can answer a number of questions: 1) Whether fertilization can effect on crop growth (excreted factor F)? 2) Is organic fertilizer better than inorganic? 3) Is NH 4 -N better than NO 3 -N ? 4) Is (NH 4 ) 2 SO 4 better than NH 4 NO 3 ? 5) Is Ca(NO 3 ) better than NaNO 3 ? Such questions may of course be more, depending on the factors or groups of factors we want to compare.

Latin square design Example The results after the separation of values

Latin square design Example The results after the separation of values

Latin square design Example Only when comparing the results for the test "without fertilizer - with fertilizer" calculated value is greater than the critical value, so there is no reason to reject this hypothesis. In other cases, the choice of the source of deviation is negligible.

Thanks for attention! Books: W. Wagner, P. Błażczak, Statystyka matematyczna z elementami doświadczalnictwa, cz. 2, AR, Poznań T. M. Little, F.J. Hills, Agricultural experimentation. Design and analysis, Wiley and Sons, New York, 1987.