Essentials of Modern Business Statistics (7e)

Slides:



Advertisements
Similar presentations
Request Dispatching for Cheap Energy Prices in Cloud Data Centers
Advertisements

SpringerLink Training Kit
Luminosity measurements at Hadron Colliders
From Word Embeddings To Document Distances
Choosing a Dental Plan Student Name
Virtual Environments and Computer Graphics
Chương 1: CÁC PHƯƠNG THỨC GIAO DỊCH TRÊN THỊ TRƯỜNG THẾ GIỚI
THỰC TIỄN KINH DOANH TRONG CỘNG ĐỒNG KINH TẾ ASEAN –
D. Phát triển thương hiệu
NHỮNG VẤN ĐỀ NỔI BẬT CỦA NỀN KINH TẾ VIỆT NAM GIAI ĐOẠN
Điều trị chống huyết khối trong tai biến mạch máu não
BÖnh Parkinson PGS.TS.BS NGUYỄN TRỌNG HƯNG BỆNH VIỆN LÃO KHOA TRUNG ƯƠNG TRƯỜNG ĐẠI HỌC Y HÀ NỘI Bác Ninh 2013.
Nasal Cannula X particulate mask
Evolving Architecture for Beyond the Standard Model
HF NOISE FILTERS PERFORMANCE
Electronics for Pedestrians – Passive Components –
Parameterization of Tabulated BRDFs Ian Mallett (me), Cem Yuksel
L-Systems and Affine Transformations
CMSC423: Bioinformatic Algorithms, Databases and Tools
Some aspect concerning the LMDZ dynamical core and its use
Bayesian Confidence Limits and Intervals
实习总结 (Internship Summary)
Current State of Japanese Economy under Negative Interest Rate and Proposed Remedies Naoyuki Yoshino Dean Asian Development Bank Institute Professor Emeritus,
Front End Electronics for SOI Monolithic Pixel Sensor
Face Recognition Monday, February 1, 2016.
Solving Rubik's Cube By: Etai Nativ.
CS284 Paper Presentation Arpad Kovacs
انتقال حرارت 2 خانم خسرویار.
Summer Student Program First results
Theoretical Results on Neutrinos
HERMESでのHard Exclusive生成過程による 核子内クォーク全角運動量についての研究
Wavelet Coherence & Cross-Wavelet Transform
yaSpMV: Yet Another SpMV Framework on GPUs
Creating Synthetic Microdata for Higher Educational Use in Japan: Reproduction of Distribution Type based on the Descriptive Statistics Kiyomi Shirakawa.
MOCLA02 Design of a Compact L-­band Transverse Deflecting Cavity with Arbitrary Polarizations for the SACLA Injector Sep. 14th, 2015 H. Maesaka, T. Asaka,
Hui Wang†*, Canturk Isci‡, Lavanya Subramanian*,
Fuel cell development program for electric vehicle
Overview of TST-2 Experiment
Optomechanics with atoms
داده کاوی سئوالات نمونه
Inter-system biases estimation in multi-GNSS relative positioning with GPS and Galileo Cecile Deprez and Rene Warnant University of Liege, Belgium  
ლექცია 4 - ფული და ინფლაცია
10. predavanje Novac i financijski sustav
Wissenschaftliche Aussprache zur Dissertation
FLUORECENCE MICROSCOPY SUPERRESOLUTION BLINK MICROSCOPY ON THE BASIS OF ENGINEERED DARK STATES* *Christian Steinhauer, Carsten Forthmann, Jan Vogelsang,
Particle acceleration during the gamma-ray flares of the Crab Nebular
Interpretations of the Derivative Gottfried Wilhelm Leibniz
Advisor: Chiuyuan Chen Student: Shao-Chun Lin
Widow Rockfish Assessment
SiW-ECAL Beam Test 2015 Kick-Off meeting
On Robust Neighbor Discovery in Mobile Wireless Networks
Chapter 6 并发:死锁和饥饿 Operating Systems: Internals and Design Principles
You NEED your book!!! Frequency Distribution
Y V =0 a V =V0 x b b V =0 z
Fairness-oriented Scheduling Support for Multicore Systems
Climate-Energy-Policy Interaction
Hui Wang†*, Canturk Isci‡, Lavanya Subramanian*,
Ch48 Statistics by Chtan FYHSKulai
The ABCD matrix for parabolic reflectors and its application to astigmatism free four-mirror cavities.
Measure Twice and Cut Once: Robust Dynamic Voltage Scaling for FPGAs
Online Learning: An Introduction
Factor Based Index of Systemic Stress (FISS)
What is Chemistry? Chemistry is: the study of matter & the changes it undergoes Composition Structure Properties Energy changes.
THE BERRY PHASE OF A BOGOLIUBOV QUASIPARTICLE IN AN ABRIKOSOV VORTEX*
Quantum-classical transition in optical twin beams and experimental applications to quantum metrology Ivano Ruo-Berchera Frascati.
The Toroidal Sporadic Source: Understanding Temporal Variations
FW 3.4: More Circle Practice
ارائه یک روش حل مبتنی بر استراتژی های تکاملی گروه بندی برای حل مسئله بسته بندی اقلام در ظروف
Decision Procedures Christoph M. Wintersteiger 9/11/2017 3:14 PM
Limits on Anomalous WWγ and WWZ Couplings from DØ
Presentation transcript:

Essentials of Modern Business Statistics (7e) Anderson, Sweeney, Williams, Camm, Cochran © 2018 Cengage Learning

Chapter 13, Part A Experimental Design and Analysis of Variance An Introduction to Experimental Design and Analysis of Variance Analysis of Variance and the Completely Randomized Design Multiple Comparison Procedures

An Introduction to Experimental Design and Analysis of Variance Statistical studies can be classified as being either experimental or observational. In an experimental study, one or more factors are controlled so that data can be obtained about how the factors influence the variables of interest. In an observational study, no attempt is made to control the factors. Cause-and-effect relationships are easier to establish in experimental studies than in observational studies. Analysis of variance (ANOVA) can be used to analyze the data obtained from experimental or observational studies.

An Introduction to Experimental Design and Analysis of Variance In this chapter three types of experimental designs are introduced. a completely randomized design a randomized block design a factorial experiment

An Introduction to Experimental Design and Analysis of Variance A factor is a variable that the experimenter has selected for investigation. A treatment is a level of a factor. Experimental units are the objects of interest in the experiment. A completely randomized design is an experimental design in which the treatments are randomly assigned to the experimental units.

Analysis of Variance: A Conceptual Overview Analysis of Variance (ANOVA) can be used to test for the equality of three or more population means. Data obtained from observational or experimental studies can be used for the analysis. We want to use the sample results to test the following hypotheses: H0: 1 = 2 = 3 = . . . = k Ha: Not all population means are equal

Analysis of Variance: A Conceptual Overview If H0 is rejected, we cannot conclude that all population means are different. Rejecting H0 means that at least two population means have different values.

Analysis of Variance: A Conceptual Overview Assumptions for Analysis of Variance For each population, the response (dependent) variable is normally distributed. The variance of the response variable, denoted  2, is the same for all of the populations. The observations must be independent.

Analysis of Variance: A Conceptual Overview Sampling Distribution of 𝑥 , Given H0 is True

Analysis of Variance: A Conceptual Overview Sampling Distribution of 𝑥 , Given H0 is False

Analysis of Variance and the Completely Randomized Design Between-Treatments Estimate of Population Variance Within-Treatments Estimate of Population Variance Comparing the Variance Estimates: The F Test ANOVA Table

Between-Treatments Estimate of Population Variance s 2 The estimate of  2 based on the variation of the sample means is called the mean square due to treatments and is denoted by MSTR. MSTR= 𝑗=1 𝑘 𝑛 𝑗 𝑥 𝑗 − 𝑥 2 𝑘−1 Numerator is called the sum of squares due to treatments (SSTR). Denominator is the degrees of freedom associated with SSTR.

Within-Treatments Estimate of Population Variance s 2 The estimate of  2 based on the variation of the sample observations within each sample is called the mean square error and is denoted by MSE. MSE= 𝑗=1 𝑘 (𝑛 𝑗 −1) 𝑠 𝑗 2 𝑛 𝑇 −𝑘 Numerator is called the sum of squares due to error (SSE). Denominator is the degrees of freedom associated with SSE.

Comparing the Variance Estimates: The F Test If the null hypothesis is true and the ANOVA assumptions are valid, the sampling distribution of MSTR/MSE is an F distribution with MSTR d.f. equal to k - 1 and MSE d.f. equal to nT - k. If the means of the k populations are not equal, the value of MSTR/MSE will be inflated because MSTR overestimates  2. Hence, we will reject H0 if the resulting value of MSTR/MSE appears to be too large to have been selected at random from the appropriate F distribution.

Comparing the Variance Estimates: The F Test Sampling Distribution of MSTR/MSE Reject H0 Do Not Reject H0 a MSTR/MSE F Critical Value

ANOVA Table for a Completely Randomized Design Source of Variation Sum of Squares Degrees of Freedom Mean Square F p- Value Treatments SSTR k - 1 MSTR= SSTR 𝑘−1 MSTR MSE Error SSE nT - k MSE= SSE 𝑛 𝑇 −𝑘 Total SST nT - 1 SST is partitioned into SSTR and SSE. SST’s degrees of freedom (d.f.) are partitioned into SSTR’s d.f. and SSE’s d.f.

ANOVA Table for a Completely Randomized Design SST divided by its degrees of freedom nT – 1 is the overall sample variance that would be obtained if we treated the entire set of observations as one data set. With the entire data set as one sample, the formula for computing the total sum of squares, SST, is: SST= 𝑗=1 𝑘 𝑖=1 𝑛 𝑗 𝑥 𝑖𝑗 − 𝑥 2 = SSTR+SSE

ANOVA Table for a Completely Randomized Design ANOVA can be viewed as the process of partitioning the total sum of squares and the degrees of freedom into their corresponding sources: treatments and error. Dividing the sum of squares by the appropriate degrees of freedom provides the variance estimates, the F value and the p- value used to test the hypothesis of equal population means.

Test for the Equality of k Population Means Hypotheses H0: 1 = 2 = 3 = . . . = k Ha: Not all population means are equal Test Statistic F = MSTR/MSE

Test for the Equality of k Population Means Rejection Rule p-value Approach: Reject H0 if p-value < a Critical Value Approach: Reject H0 if F > Fa where the value of F is based on an F distribution with k - 1 numerator d.f. and nT - k denominator d.f.

Testing for the Equality of k Population Means: A Completely Randomized Design Example: Chemitech experiment Chemitech developed a new filtration system for municipal water supplies. There are different methods that can be used to assemble the system. Chemitech has narrowed down to three methods – A, B and C and wants to determine which assembly method can produce the greatest number of filtration systems per week.

Testing for the Equality of k Population Means: A Completely Randomized Design Example: Chemitech experiment For the purpose, it selected 15 workers and then randomly assigned each of the three treatments to 5 of the workers. Factor – Assembly method Treatments – A, B and C Experimental units – Employees Response units – Number of units produced

Sample standard deviation Testing for the Equality of k Population Means: A Completely Randomized Design Observation A B C 1 58 48 2 64 69 57 3 55 71 59 4 66 47 5 67 68 49 Sample mean 62 52 Sample variance 27.5 26.5 31.0 Sample standard deviation 5.244 5.148 5.568

Testing for the Equality of k Population Means: A Completely Randomized Design Example: Chemitech experiment Hypotheses H0: 1 = 2 = 3 Ha: Not all the means are equal where: 1 = mean number of units produced per week using method A 2 = mean number of units produced per week using method B 3 = mean number of units produced per week using method C

Testing for the Equality of k Population Means: A Completely Randomized Design Example: Chemitech experiment Mean Square Between Treatments Because the sample sizes are all equal: 𝑥 = 𝑥 1 + 𝑥 2 + 𝑥 3 /3 = (62+ 66+ 52 )/3 = 60 SSTR = 5(62 - 60)2 + 5(66 - 60)2 + 5(52 - 60)2 = 520 MSTR = 520/(3 - 1) = 260 Mean Square Error SSE = 4(27.5) + 4(26.5) + 4(31) = 340 MSE = 340/(15 - 3) = 28.33

Testing for the Equality of k Population Means: A Completely Randomized Design Example: Chemitech experiment Rejection Rule p-Value Approach: Reject H0 if p-value < .05 Critical Value Approach: Reject H0 if F > 3.89 where F.05 = 3.89 is based on an F distribution with 2 numerator degrees of freedom and 12 denominator degrees of freedom

Testing for the Equality of k Population Means: A Completely Randomized Design Example: Chemitech experiment Test Statistic F = MSTR/MSE = 260/28.33 = 9.18 Conclusion The p-value is less than .01, where F = 9.18. (Excel provides a p-value of .004.) Therefore, we reject H0. There is sufficient evidence to conclude that the means of three populations are not equal.

ANOVA Table for a Completely Randomized Design Example: Chemitech experiment Source of Variation Sum of Squares Degrees of Freedom Mean Square F p- Value Treatments 520 2 260.00 9.18 .004 Error 340 12 28.33 Total 860 14

Excel’s ANOVA : Single Factor Tool Step 1: Click the Data tab on the Ribbon Step 2: In the Analysis group, click Data Analysis Step 3: Choose Anova: Single Factor from the list of Analysis Tools Step 4: When the Anova: Single Factor dialog box appears: (see details on next slide)

Excel’s ANOVA : Single Factor Tool

Excel’s ANOVA : Single Factor Tool Summary and output data

Testing for the Equality of k Population Means: An Observational Study Example: National Computer Products, Inc. (NCP) NCP manufactures printers and fax machines at plants located in Atlanta, Dallas and Seattle. To measure how much employees at each plants know about quality management, a random sample of 6 employees was selected from each plant and the employees selected were given quality awareness examination.

Testing for the Equality of k Population Means: An Observational Study Example: National Computer Products, Inc. (NCP) Managers want to know if there is significant difference in the mean examination score for the employees at three plants. An F test will be conducted using a = .05.

Testing for the Equality of k Population Means: An Observational Study Example: National Computer Products, Inc. (NCP) A simple random sample of 6 employees from each of the three plants was taken and their examination score was tabulated. Factor – Quality awareness Treatments – Atlanta, Dallas, Seattle Experimental units – Employees Response units – Examination score

Testing for the Equality of k Population Means: An Observational Study Atlanta Dallas Seattle 1 85 71 59 2 75 64 3 82 73 62 4 76 74 69 5 6 67 Sample mean 79 66 Sample variance 34 20 32 Sample standard deviation 5.83 4.47 5.66

Testing for the Equality of k Population Means: An Observational Study Example: National Computer Products, Inc. (NCP) p -Value and Critical Value Approaches Develop the hypotheses. H0: 1 = 2 = 3 Ha: Not all the means are equal where:  1 = mean examination score for population 1  2 = mean examination score for population 2  3 = mean examination score for population 3

Testing for the Equality of k Population Means: An Observational Study Example: National Computer Products, Inc. (NCP) p -Value and Critical Value Approaches Specify the level of significance. a = .05 Compute the value of the test statistic. Mean Square Due to Treatments (Sample sizes are all equal.) 𝑥 = (79 + 74 +66)/3 = 73 SSTR = 6(79 - 73)2 + 6(74 - 73)2 + 6(66-73)2 = 516 MSTR = 516/(3 - 1) = 258

Testing for the Equality of k Population Means: An Observational Study Example: National Computer Products, Inc. (NCP) p -Value and Critical Value Approaches 4. Compute the value of the test statistic. Mean Square Due to Error SSE = 5(34.0) + 5(20.0) + 5(32.0) = 430 MSE = 430/(18-3) = 28.667 F = MSTR/MSE = 430/28.667 = 15

Testing for the Equality of k Population Means: An Observational Study Example: National Computer Products, Inc. (NCP) Source of Variation Sum of squares Degrees of freedom Mean square F p - value Treatments 516 2 258 15 Error 430 28.667 Total 846 17

Testing for the Equality of k Population Means: An Observational Study Example: National Computer Products, Inc. (NCP) Rejection Rule p-Value Approach: Reject H0 if p-value < .05 Critical Value Approach: Reject H0 if F > 3.68 where F.05 = 3.68 is based on an F distribution with 2 numerator degrees of freedom and 15 denominator degrees of freedom

Testing for the Equality of k Population Means: An Observational Study Example: National Computer Products, Inc. (NCP) p –Value Approach Compute the p –value. With 2 numerator d.f. and 15 denominator d.f., the p-value is .01 for F = 6.36. Therefore, the p-value is less than .01 for F = 15. Determine whether to reject H0. The p-value < .05, so we reject H0.

Testing for the Equality of k Population Means: An Observational Study Example: National Computer Products, Inc. (NCP) Critical Value Approach 4. Determine the critical value and rejection rule. Based on an F distribution with 2 numerator d.f. and 15 denominator d.f., F.05 = 3.68, Reject H0 if F > 3.68 Determine whether to reject H0. Because F = 15 > 3.68, we reject H0. We can conclude that the mean examination scores of employees working at different plants is not the same.

Multiple Comparison Procedures Suppose that analysis of variance has provided statistical evidence to reject the null hypothesis of equal population means. Fisher’s least significant difference (LSD) procedure can be used to determine where the differences occur.

Fisher’s LSD Procedure Hypotheses H0: mi  mj Ha: mi ≠ mj Test Statistic 𝑡= 𝑥 𝑖 − 𝑥 𝑗 MSE 1 𝑛 𝑖 + 1 𝑛 𝑗

Fisher’s LSD Procedure Rejection Rule p-value Approach: Reject H0 if p-value < a Critical Value Approach: Reject H0 if t ≤ -ta/2 or t ≥ ta/2 where the value of ta/2 is based on a t distribution with nT - k degrees of freedom.

Fisher’s LSD Procedure Based on the Test Statistic 𝑥 𝑖 − 𝑥 𝑗 Hypotheses H0: mi  mj Ha: mi ≠ mj Test Statistic 𝑥 𝑖 − 𝑥 𝑗 Rejection Rule Reject H0 if 𝑥 𝑖 − 𝑥 𝑗 ≥ LSD Where LSD= 𝑡 𝛼/2 MSE 1 𝑛 𝑖 + 1 𝑛 𝑗

Fisher’s LSD Procedure Based on the Test Statistic 𝑥 𝑖 − 𝑥 𝑗 Example: Chemitech Experiment Recall the Chemitech experiment where the managers wanted to know if there is difference in mean number of units produced per week when different assembly methods are used. Analysis of variance has provided statistical evidence to reject the null hypothesis of equal population means. Fisher’s least significant difference (LSD) procedure can be used to determine where the differences occur.

Fisher’s LSD Procedure Based on the Test Statistic 𝑥 𝑖 − 𝑥 𝑗 Example: Chemitech Experiment For  = .05 and nT - k = 15 – 3 = 12 degrees of freedom, t.025 = 2.179 LSD= 𝑡 𝛼/2 MSE 1 𝑛 𝑖 + 1 𝑛 𝑗 LSD=2.179 28.33 1 5 + 1 5 =7.34

Fisher’s LSD Procedure Based on the Test Statistic 𝑥 𝑖 − 𝑥 𝑗 LSD for Method A and Method B Hypotheses (A) H0: m1 = m2 Ha: m1 ≠ m2 Rejection Rule Reject H0 if 𝑥 1 − 𝑥 2 ≥ 7.34 Test Statistic 𝑥 1 − 𝑥 2 = |62 - 66| = 4 Conclusion: The mean number of units produced per week for method A is equal to the population mean for method B.

Fisher’s LSD Procedure Based on the Test Statistic 𝑥 𝑖 − 𝑥 𝑗 LSD for Method B and Method C Hypotheses (B) H0: m1 = m2 Ha: m1 ≠ m2 Rejection Rule Reject H0 if 𝑥 1 − 𝑥 2 > 7.34 Test Statistic 𝑥 1 − 𝑥 2 = |66 - 52| = 14 Conclusion: The mean number of units produced per week for method B is not equal to the population mean for method C.

Fisher’s LSD Procedure Based on the Test Statistic 𝑥 𝑖 − 𝑥 𝑗 LSD for Method A and Method C Hypotheses (C) H0: m1 = m2 Ha: m1 ≠ m2 Rejection Rule Reject H0 if 𝑥 1 − 𝑥 2 > 7.34 Test Statistic 𝑥 1 − 𝑥 2 = |62 - 52| = 10 Conclusion: The mean number of units produced per week for method A is not equal to the population mean for method C. So A and B both differ from method C.

Type I Error Rates The comparison-wise Type I error rate a indicates the level of significance associated with a single pairwise comparison. The experiment-wise Type I error rate aEW is the probability of making a Type I error on at least one of the (k – 1)! pairwise comparisons. aEW = 1 – (1 – a)(k – 1)! The experiment-wise Type I error rate gets larger for problems with more populations (larger k).

End of Chapter 13, Part A