Probability & Statistical Inference Lecture 8 MSc in Computing (Data Analytics)

Slides:



Advertisements
Similar presentations
Analysis of Variance (ANOVA) ANOVA methods are widely used for comparing 2 or more population means from populations that are approximately normal in distribution.
Advertisements

Analysis of Variance Outlines: Designing Engineering Experiments
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
Analysis of Variance (ANOVA) ANOVA can be used to test for the equality of three or more population means We want to use the sample results to test the.
Linear regression models
Design of Experiments and Analysis of Variance
1 1 Slide © 2009, Econ-2030 Applied Statistics-Dr Tadesse Chapter 10: Comparisons Involving Means n Introduction to Analysis of Variance n Analysis of.
ANALYSIS OF VARIANCE.
10-1 Introduction 10-2 Inference for a Difference in Means of Two Normal Distributions, Variances Known Figure 10-1 Two independent populations.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 13-1 Chapter 13 Simple Linear Regression Basic Business Statistics 11 th Edition.
Lesson #23 Analysis of Variance. In Analysis of Variance (ANOVA), we have: H 0 :  1 =  2 =  3 = … =  k H 1 : at least one  i does not equal the others.
Chapter 3 Analysis of Variance
Chapter 3 Experiments with a Single Factor: The Analysis of Variance
1 Pertemuan 13 Uji Koefisien Korelasi dan Regresi Matakuliah: A0392 – Statistik Ekonomi Tahun: 2006.
Statistics for Managers Using Microsoft Excel, 5e © 2008 Prentice-Hall, Inc.Chap 13-1 Statistics for Managers Using Microsoft® Excel 5th Edition Chapter.
Pengujian Parameter Koefisien Korelasi Pertemuan 04 Matakuliah: I0174 – Analisis Regresi Tahun: Ganjil 2007/2008.
13-1 Designing Engineering Experiments Every experiment involves a sequence of activities: Conjecture – the original hypothesis that motivates the.
Chapter 12: One-Way ANalysis Of Variance (ANOVA) 1.
Inferences About Process Quality
EEM332 Lecture Slides1 EEM332 Design of Experiments En. Mohd Nazri Mahmud MPhil (Cambridge, UK) BEng (Essex, UK) Room 2.14 Ext
Analysis of Variance & Multivariate Analysis of Variance
Analysis of Variance Introduction to Business Statistics, 5e Kvanli/Guynes/Pavur (c)2000 South-Western College Publishing.
15-1 Introduction Most of the hypothesis-testing and confidence interval procedures discussed in previous chapters are based on the assumption that.
Chapter 7 Forecasting with Simple Regression
13 Design and Analysis of Single-Factor Experiments:
PS 225 Lecture 15 Analysis of Variance ANOVA Tables.
QNT 531 Advanced Problems in Statistics and Research Methods
1 1 Slide © 2006 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
1 1 Slide © 2005 Thomson/South-Western Chapter 13, Part A Analysis of Variance and Experimental Design n Introduction to Analysis of Variance n Analysis.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 13 Experimental Design and Analysis of Variance nIntroduction to Experimental Design.
1 1 Slide Analysis of Variance Chapter 13 BA 303.
PROBABILITY & STATISTICAL INFERENCE LECTURE 6 MSc in Computing (Data Analytics)
One-Way Analysis of Variance Comparing means of more than 2 independent samples 1.
OPIM 303-Lecture #8 Jose M. Cruz Assistant Professor.
© 2003 Prentice-Hall, Inc.Chap 13-1 Basic Business Statistics (9 th Edition) Chapter 13 Simple Linear Regression.
Introduction to Linear Regression
EQT 373 Chapter 3 Simple Linear Regression. EQT 373 Learning Objectives In this chapter, you learn: How to use regression analysis to predict the value.
1 Chapter 13 Analysis of Variance. 2 Chapter Outline  An introduction to experimental design and analysis of variance  Analysis of Variance and the.
Analysis of Variance 1 Dr. Mohammed Alahmed Ph.D. in BioStatistics (011)
Chapter 15 – Analysis of Variance Math 22 Introductory Statistics.
CHAPTER 4 Analysis of Variance One-way ANOVA
Previous Lecture: Phylogenetics. Analysis of Variance This Lecture Judy Zhong Ph.D.
STA 286 week 131 Inference for the Regression Coefficient Recall, b 0 and b 1 are the estimates of the slope β 1 and intercept β 0 of population regression.
1 ANALYSIS OF VARIANCE (ANOVA) Heibatollah Baghi, and Mastee Badii.
Lecture 10: Correlation and Regression Model.
ETM U 1 Analysis of Variance (ANOVA) Suppose we want to compare more than two means? For example, suppose a manufacturer of paper used for grocery.
1 Experiments with Random Factors Previous chapters have considered fixed factors –A specific set of factor levels is chosen for the experiment –Inference.
Chapter 11: The ANalysis Of Variance (ANOVA) 1.
Econ 3790: Business and Economic Statistics Instructor: Yogesh Uppal
Econ 3790: Business and Economic Statistics Instructor: Yogesh Uppal
IE241: Introduction to Design of Experiments. Last term we talked about testing the difference between two independent means. For means from a normal.
Chapter 11: The ANalysis Of Variance (ANOVA)
Statistics for Managers Using Microsoft® Excel 5th Edition
1/54 Statistics Analysis of Variance. 2/54 Statistics in practice Introduction to Analysis of Variance Analysis of Variance: Testing for the Equality.
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the.
Conceptual Foundations © 2008 Pearson Education Australia Lecture slides for this course are based on teaching materials provided/referred by: (1) Statistics.
 List the characteristics of the F distribution.  Conduct a test of hypothesis to determine whether the variances of two populations are equal.  Discuss.
1 Chapter 5.8 What if We Have More Than Two Samples?
1 Pertemuan 19 Analisis Varians Klasifikasi Satu Arah Matakuliah: I Statistika Tahun: 2008 Versi: Revisi.
INTRODUCTION TO MULTIPLE REGRESSION MULTIPLE REGRESSION MODEL 11.2 MULTIPLE COEFFICIENT OF DETERMINATION 11.3 MODEL ASSUMPTIONS 11.4 TEST OF SIGNIFICANCE.
Analyze Of VAriance. Application fields ◦ Comparing means for more than two independent samples = examining relationship between categorical->metric variables.
Rancangan Acak Lengkap ( Analisis Varians Klasifikasi Satu Arah) Pertemuan 16 Matakuliah: I0184 – Teori Statistika II Tahun: 2009.
Statistics for Managers using Microsoft Excel 3rd Edition
Statistical Quality Control, 7th Edition by Douglas C. Montgomery.
Analysis of Variance (ANOVA)
i) Two way ANOVA without replication
Statistics for Business and Economics (13e)
Econ 3790: Business and Economic Statistics
One way ANALYSIS OF VARIANCE (ANOVA)
Presentation transcript:

Probability & Statistical Inference Lecture 8 MSc in Computing (Data Analytics)

Introduction  In the previous lecture we were concerned with the analysis of data where we compared the sample means.  Frequently data contains more that two samples, they may compare several treatments.  In this lecture we introduce statistical analysis that allows us compare the mean of more that two samples. The method is called ‘Analysis of Variance ‘ or AVOVA for short.

Total Sum of Squares Data set: 14, 12, 10, 6,4, 2 Group A: 6,4, 2 Group B: 14, 12, 10 Overall Mean : 8 Total Sum of Squares: SS T = (14-8) 2 + (12-8) 2 + (10-8) 2 + (6-8) 2 + (4-8) 2 + (2-8) 2 =112

Between Group Variation  Sum of Squares of the Model: SS m = n a (µ - µ a ) 2 + n b (µ - µ b ) 2 =3*(8-4) 2 + 3*(8-12) 2 =96

Within Group Variation  Sum of Squares of the Error: SS e = = (14-12) 2 + (12-12) 2 + (10- 12) 2 + (8-6) 2 + (6-6) 2 + (6- 4) 2 + = 16

Structure of the Data GroupObservationTotalMean 1x 11 x x 1n x1x1 2x 21 x x 2n x2x ax a1 x a x an xaxa Total

ANOVA Table SourceDegrees of Freedom Sum Of SquaresMean Square F- Stat Modela - 1SS M /(a-1)MS M / MS E Errorn-a SS E /(n-a) Totaln-1 SS T /(n-1) Where : n is the sample size and a is the number of groups

ANOVA Table – Original Example SourceDegrees of Freedom Sum Of SquaresMean Square F- Stat Model2 - 1 = Error6 – 2 = Total6 – 1 = 5112 Where : n is the sample size and k is the number of groups

Model Assumptions  Independence of observations within and between samples  normality of sampling distribution  equal variance - This is also called the homoscedasticity assumption

The ANOVA Equation  We can describe the observations in the above table usint the following equation: Where : n is the sample size and k is the number of groups

ANOVA Hypotheses We wish to test the hypotheses: The analysis of variance partitions the total variability into two parts.

Example

Graphical Display of Data Figure 13-1 (a) Box plots of hardwood concentration data. (b) Display of the model in Equation 13-1 for the completely randomized single-factor experiment

Example  We can use ANOVA to test the hypotheses that different hardwood concentrations do not affect the mean tensile strength of the paper. The hypotheses are:  The ANOVA table is below:

Example  The p-value is less than 0.05 therefore the H 0 can be rejected and we can conclude that at least one of the hardwood concentrations affects the mean tensile strength of the paper.

Demo

Confidence Interval about the mean For 20% hardwood, the resulting confidence interval on the mean is

Confidence Interval about on the difference of two treatments For the hardwood concentration example,

An Unbalanced Experiment

Multiple Comparisons Following the ANOVA  The least significant difference (LSD) is If the sample sizes are different in each treatment:

Example: Multi-comparison Test