5-5 Inference on the Ratio of Variances of Two Normal Populations

Slides:

Advertisements

Similar presentations

Request Dispatching for Cheap Energy Prices in Cloud Data Centers

Advertisements

SpringerLink Training Kit

Luminosity measurements at Hadron Colliders

From Word Embeddings To Document Distances

Choosing a Dental Plan Student Name

Virtual Environments and Computer Graphics

Chương 1: CÁC PHƯƠNG THỨC GIAO DỊCH TRÊN THỊ TRƯỜNG THẾ GIỚI

THỰC TIỄN KINH DOANH TRONG CỘNG ĐỒNG KINH TẾ ASEAN –

D. Phát triển thương hiệu

NHỮNG VẤN ĐỀ NỔI BẬT CỦA NỀN KINH TẾ VIỆT NAM GIAI ĐOẠN

Điều trị chống huyết khối trong tai biến mạch máu não

BÖnh Parkinson PGS.TS.BS NGUYỄN TRỌNG HƯNG BỆNH VIỆN LÃO KHOA TRUNG ƯƠNG TRƯỜNG ĐẠI HỌC Y HÀ NỘI Bác Ninh 2013.

Nasal Cannula X particulate mask

Evolving Architecture for Beyond the Standard Model

HF NOISE FILTERS PERFORMANCE

Electronics for Pedestrians – Passive Components –

Parameterization of Tabulated BRDFs Ian Mallett (me), Cem Yuksel

L-Systems and Affine Transformations

CMSC423: Bioinformatic Algorithms, Databases and Tools

Some aspect concerning the LMDZ dynamical core and its use

Bayesian Confidence Limits and Intervals

实习总结（Internship Summary)

Current State of Japanese Economy under Negative Interest Rate and Proposed Remedies Naoyuki Yoshino Dean Asian Development Bank Institute Professor Emeritus,

Front End Electronics for SOI Monolithic Pixel Sensor

Face Recognition Monday, February 1, 2016.

Solving Rubik's Cube By: Etai Nativ.

CS284 Paper Presentation Arpad Kovacs

انتقال حرارت 2 خانم خسرویار.

Summer Student Program First results

Theoretical Results on Neutrinos

HERMESでのHard Exclusive生成過程による核子内クォーク全角運動量についての研究

Wavelet Coherence & Cross-Wavelet Transform

yaSpMV: Yet Another SpMV Framework on GPUs

Creating Synthetic Microdata for Higher Educational Use in Japan: Reproduction of Distribution Type based on the Descriptive Statistics Kiyomi Shirakawa.

MOCLA02 Design of a Compact L-band Transverse Deflecting Cavity with Arbitrary Polarizations for the SACLA Injector Sep. 14th, 2015 H. Maesaka, T. Asaka,

Hui Wang†*, Canturk Isci‡, Lavanya Subramanian*,

Fuel cell development program for electric vehicle

Overview of TST-2 Experiment

Optomechanics with atoms

داده کاوی سئوالات نمونه

Inter-system biases estimation in multi-GNSS relative positioning with GPS and Galileo Cecile Deprez and Rene Warnant University of Liege, Belgium

ლექცია 4 - ფული და ინფლაცია

10. predavanje Novac i financijski sustav

Wissenschaftliche Aussprache zur Dissertation

FLUORECENCE MICROSCOPY SUPERRESOLUTION BLINK MICROSCOPY ON THE BASIS OF ENGINEERED DARK STATES* *Christian Steinhauer, Carsten Forthmann, Jan Vogelsang,

Particle acceleration during the gamma-ray flares of the Crab Nebular

Interpretations of the Derivative Gottfried Wilhelm Leibniz

Advisor: Chiuyuan Chen Student: Shao-Chun Lin

Widow Rockfish Assessment

SiW-ECAL Beam Test 2015 Kick-Off meeting

On Robust Neighbor Discovery in Mobile Wireless Networks

Chapter 6 并发：死锁和饥饿 Operating Systems: Internals and Design Principles

You NEED your book!!! Frequency Distribution

Y V =0 a V =V0 x b b V =0 z

Fairness-oriented Scheduling Support for Multicore Systems

Climate-Energy-Policy Interaction

Hui Wang†*, Canturk Isci‡, Lavanya Subramanian*,

Ch48 Statistics by Chtan FYHSKulai

The ABCD matrix for parabolic reflectors and its application to astigmatism free four-mirror cavities.

Measure Twice and Cut Once: Robust Dynamic Voltage Scaling for FPGAs

Online Learning: An Introduction

Factor Based Index of Systemic Stress (FISS)

What is Chemistry? Chemistry is: the study of matter & the changes it undergoes Composition Structure Properties Energy changes.

THE BERRY PHASE OF A BOGOLIUBOV QUASIPARTICLE IN AN ABRIKOSOV VORTEX*

Quantum-classical transition in optical twin beams and experimental applications to quantum metrology Ivano Ruo-Berchera Frascati.

The Toroidal Sporadic Source: Understanding Temporal Variations

FW 3.4: More Circle Practice

ارائه یک روش حل مبتنی بر استراتژی های تکاملی گروه بندی برای حل مسئله بسته بندی اقلام در ظروف

Decision Procedures Christoph M. Wintersteiger 9/11/2017 3:14 PM

Limits on Anomalous WWγ and WWZ Couplings from DØ

Presentation transcript:

5-5 Inference on the Ratio of Variances of Two Normal Populations 5-5.1 Hypothesis Testing on the ratio of Two Variances We wish to test the hypotheses: The development of a test procedure for these hypotheses requires a new probability distribution, the F distribution.

5-5 Inference on the Ratio of Variances of Two Normal Populations The F Distribution

5-5 Inference on the Ratio of Variances of Two Normal Populations 5-5.1 The F Distribution

5-5 Inference on the Ratio of Variances of Two Normal Populations The Test Procedure

5-5 Inference on the Ratio of Variances of Two Normal Populations The Test Procedure

5-5 Inference on the Ratio of Variances of Two Normal Populations The Test Procedure

5-5 Inference on the Ratio of Variances of Two Normal Populations

5-5 Inference on the Ratio of Variances of Two Normal Populations 5-5.2 Confidence Interval on the Ratio of Two Variances

5-5 Inference on the Ratio of Variances of Two Normal Populations

5-5 Inference on the Ratio of Variances of Two Normal Populations

5-8 What If We Have More Than Two Samples? 5-8.1 Completely Randomized Experiment and Analysis of Variance (Replicates) Treatments

5-8 What If We Have More Than Two Samples? 5-8.1 Completely Randomized Experiment and Analysis of Variance The levels of the factor are sometimes called treatments. Each treatment has six observations or replicates. The runs are run in random order.

5-8 What If We Have More Than Two Samples? 5-8.1 Completely Randomized Experiment and Analysis of Variance

5-8 What If We Have More Than Two Samples? 5-8.1 Completely Randomized Experiment and Analysis of Variance Table 5-6. Typical Data for a Single-Factor Experiment Treatment Observations Totals Averages 1 y11 y12 … y1n y1. 𝑦 1. 2 y21 y22 y2n y2. 𝑦 2. . a ya1 ya2 yan ya. 𝑦 a. y.. 𝑦 .. 𝑦 𝑖. = 𝑗=1 𝑛 𝑦 𝑖𝑗 𝑦 𝑖. = 𝑦 𝑖. 𝑛 𝑖=1, 2, …, 𝑎 𝑦 .. = 𝑖=1 𝑎 𝑗=1 𝑛 𝑦 𝑖𝑗 𝑦 .. = 𝑦 .. 𝑁

5-8 What If We Have More Than Two Samples? 5-8.1 Completely Randomized Experiment and Analysis of Variance Linear Statistical Model 𝑌 𝑖𝑗 = μ 𝑖 + 𝜖 𝑖𝑗 𝑖=1 𝑎 𝜏 𝑖 =0

5-8 What If We Have More Than Two Samples? 5-8.1 Completely Randomized Experiment and Analysis of Variance

5-8 What If We Have More Than Two Samples? 5-8.1 Completely Randomized Experiment and Analysis of Variance

5-8 What If We Have More Than Two Samples?

5-8 What If We Have More Than Two Samples? 5-8.1 Completely Randomized Experiment and Analysis of Variance

5-8 What If We Have More Than Two Samples? 5-8.1 Completely Randomized Experiment and Analysis of Variance

5-8 What If We Have More Than Two Samples?

5-8 What If We Have More Than Two Samples?

5-8 What If We Have More Than Two Samples? 5-8.1 Completely Randomized Experiment and Analysis of Variance

5-8 What If We Have More Than Two Samples? Which means differ?

5-8 What If We Have More Than Two Samples? Multiple Comparisons If we wish to make all pairwise comparisons 𝜇 𝑖 − 𝜇 𝑗 we need to worry about the probability of a type I error on each test and also the probability of making at least one type I error on all the tests. This latter error is referred to as the experiment-wise error rate or the overall error rate. A number of procedures are available to protect the overall error rate. They all compare the differences 𝑋 𝑖 − 𝑋 𝑗 to a cut-off value. The cut-off values all have the form (𝑡𝑎𝑏𝑙𝑒𝑑 𝑣𝑎𝑙𝑢𝑒)× 𝑀𝑆𝐸×( 1 𝑛 𝑖 + 1 𝑛 𝑗 )

Steps in Multiple Comparisons 5-8 What If We Have More Than Two Samples? Steps in Multiple Comparisons Calculate 𝑋 𝑖 − 𝑋 𝑗 . Determine cut-off. Any difference in 1) larger than the cut-off have corresponding means that are significantly different from one another. Fisher’s Least Significant Difference (LSD). This does not protect the overall error rate. The overall error rate will be approximately 1− (1−𝛼) 𝑐 , where c is the number of comparisons being made. 𝐿𝑆𝐷= 𝑡 𝛼 2 , 𝑛−𝑟 𝑀𝑆𝐸×( 1 𝑛 𝑖 + 1 𝑛 𝑗 ) If all 𝑛 𝑖 =𝑚 then 𝐿𝑆𝐷= 𝑡 𝛼 2 𝑀𝑆𝐸× 2 𝑚

Steps in Multiple Comparisons 5-8 What If We Have More Than Two Samples? Steps in Multiple Comparisons Tukey’s Highest Significance Difference (HSD). This controls overall error rate. The overall error rate will be a. 𝐻𝑆𝐷= 𝑞 𝑛−𝑟, 𝑟 𝑀𝑆𝐸 ( 1 𝑛 𝑖 + 1 𝑛 𝑗 )/2 If all 𝑛 𝑖 =𝑚 then HSD= 𝑞 𝑛−𝑟, 𝑟 𝑀𝑆𝐸( 1 𝑚 ) An alternative in unbalanced designs is HSD= 𝑞 𝑛−𝑟, 𝑟 𝑀𝑆𝐸( 1 min⁡( 𝑛 𝑖 ) ) With Tukey’s you need both the degrees of freedom (n – r) and also the number of means being compared (r).

Steps in Multiple Comparisons 5-8 What If We Have More Than Two Samples? Steps in Multiple Comparisons Student’s, Neuman’s, Keuls (SNK). This overall error rate is between that of Fisher’s and Tukey’s. SNK= 𝑞 𝑛−𝑟, 𝑑 𝑀𝑆𝐸 ( 1 𝑛 𝑖 + 1 𝑛 𝑗 )/2 d is the number of means apart the two means being compared are in the ordered list of means (d between 2 and r). Notice, more than one cut-off is needed for the SNK. The SNK is done sequentially, starting with the means the farthest apart. Once a pair of means is found to be not significantly different, all pairs closer together are deemed not significantly different.

5-8 What If We Have More Than Two Samples?

5-8 What If We Have More Than Two Samples? 𝐿𝑆𝐷= 𝑡 𝛼 2 , 𝑛−𝑟 𝑀𝑆𝐸×( 1 𝑛 𝑖 + 1 𝑛 𝑗 ) = 𝑡 0.025, (24−4) 6.51× 1 6 + 1 6 =(2.086)(1.473) = 3.073 𝐻𝑆𝐷= 𝑞 𝑛−𝑟, 𝑟 𝑀𝑆𝐸 ( 1 𝑛 𝑖 + 1 𝑛 𝑗 )/2 = 𝑞 20, 4 6.51( 1 6 + 1 6 )/2 =(3.96)(1.042) = 4.125 S𝑁𝐾= 𝑞 𝑛−𝑟, 𝑑 𝑀𝑆𝐸 ( 1 𝑛 𝑖 + 1 𝑛 𝑗 )/2 = 𝑞 20, 2 6.51( 1 6 + 1 6 )/2 =(2.95)(1.042) = 3.074 𝑞 20, 2 =2.95 𝑞 20, 3 =3.58 𝑞 20, 4 =3.96

5-8 What If We Have More Than Two Samples? Cut-off value for Fisher’s LSD = 3.073 Cut-off value for Tukey’s HSD = 4.125 Cut-off value for SNK = 3.074 (d=2) 3.731 (d=3) 4.125 (d=4) 20 15 10 5 22.2 17 15.7 12.2* 7* 5.7* - 6.5* 1.3 5.2* * Significantly different.

5-8 What If We Have More Than Two Samples? Residual Analysis and Model Checking

5-8 What If We Have More Than Two Samples? Residual Analysis and Model Checking

5-8 What If We Have More Than Two Samples? Residual Analysis and Model Checking

5-8 What If We Have More Than Two Samples? Residual Analysis and Model Checking

5-8 What If We Have More Than Two Samples? proc format; value hc 1=' 5%' 2='10%' 3='15%' 4='20%'; DATA ex514; INPUT hc strength @@; format hc hc.; CARDS; 1 7 2 12 3 14 4 19 1 8 2 17 3 18 4 25 1 15 2 13 3 19 4 22 1 11 2 18 3 17 4 23 1 9 2 19 3 16 4 18 1 10 2 15 3 18 4 20 ods graphics on; proc anova data=ex514; class hc; model strength= hc; means hc/lsd snk tukey; TITLE 'proc anova balanced 1-way anova'; proc glm data=ex514 plots = diagnostics; /* General Linear Model uses the least squares to fit the model */ means hc/lsd snk tukey hovtest; /* Homogeneity variance test */ TITLE 'proc glm 1-way anova'; RUN; ods graphics off; QUIT;

5-8 What If We Have More Than Two Samples?

5-8 What If We Have More Than Two Samples?

5-8 What If We Have More Than Two Samples?

5-8 What If We Have More Than Two Samples?

5-8 What If We Have More Than Two Samples?

5-8 What If We Have More Than Two Samples?

5-8 What If We Have More Than Two Samples?

5-8 What If We Have More Than Two Samples?

5-8 What If We Have More Than Two Samples?

5-8 What If We Have More Than Two Samples?

5-8 What If We Have More Than Two Samples?

5-8 What If We Have More Than Two Samples?

5-8 What If We Have More Than Two Samples?

5-8 What If We Have More Than Two Samples? Contrasts Pairwise comparisons are one type of common comparison made in designed experiment. A contrast is a more general type of comparison. A contrast is a linear combination of the cell means such that the coefficients add up to zero, i.e. Ψ= 𝑐 1 𝜇 1 + 𝑐 2 𝜇 2 + …+ 𝑐 𝑟 𝜇 𝑟 such that 𝑖=1 𝑟 𝑐 𝑖 =0. To estimate Ψ we use Ψ = 𝑐 1 𝑋 1 + 𝑐 2 𝑋 2 + …+ 𝑐 𝑟 𝑋 𝑟 The estimated standard error of Ψ is 𝑆𝐸 Ψ = 𝑀𝑆𝐸( 𝑖=1 𝑟 𝑐 𝑖 2 𝑛 𝑖 ) A 100(1-)% confidence interval for Ψ is Ψ ± 𝑡 𝛼 2 𝑆𝐸( Ψ )

Kruskal-Wallis Nonparametric Test 5-8 What If We Have More Than Two Samples? Kruskal-Wallis Nonparametric Test H0: All k populations have the same distribution H1: Not all populations have the same distribution. Test Statistic: 𝐻= 12 𝑛(𝑛+1) 𝑖=1 𝑘 𝑅 𝑖 2 𝑛 𝑖 −3(𝑛+1) Here the Ri are the sum of the ranks for each population. The ranks are found by ranking the data in all populations combined. The n is the total sample size and ni is the sample size of the sample from the ith population. Rejection Region: 𝐻> χ 2 𝛼;𝑘−1 Paired comparison of population i to j: 𝐷=| 𝑅 𝑖 − 𝑅 𝑗 |> χ 2 𝛼;𝑘−1 𝑛(𝑛+1) 12 1 𝑛 𝑖 + 1 𝑛 𝑗

5-8 What If We Have More Than Two Samples? Example An article in Fortune compared rent in five American cities: New York, Chicago, Detroit, Tampa, and Orlando. The following data are small random samples of rents (in dollars) in the five cities. The New York data are Manhattan only. Conduct the Kruskal-Wallis test to determine whether evidence exists that there are significant differences in the rents in these cities. If differences exit, where are they?

5-8 What If We Have More Than Two Samples? OPTIONS NOOVP NODATE NONUMBER LS=80; proc format; value city 1=' New York' 2='Chicago' 3='Detroit' 4='Tampa' 5='Orlando'; DATA rent; INPUT city rent @@; format city city.; CARDS; 1 900 1 1200 1 850 1 1320 1 1400 1 1150 1 975 2 625 2 640 2 775 2 1000 2 690 2 550 2 840 2 750 3 415 3 400 3 420 3 560 3 780 3 620 3 800 3 390 4 410 4 310 4 320 4 280 4 500 4 385 4 440 5 340 5 425 5 275 5 210 5 575 5 360 ods graphics on; proc npar1way data=rent wilcoxon; class city; var rent; TITLE 'Kruskal-Wallis Test'; PROC RANK DATA=RENT OUT=RRENT; VAR RENT; /* A one-way ANOVA applied to ranks is equivalent to the Kruskal-Wallis test. */ PROC GLM DATA=RRENT; CLASS CITY; MODEL RENT = CITY; LSMEANS CITY/ PDIFF=ALL; RUN; ods graphics off; QUIT;

5-8 What If We Have More Than Two Samples?

5-8 What If We Have More Than Two Samples?

5-8 What If We Have More Than Two Samples?

5-8 What If We Have More Than Two Samples?

5-8 What If We Have More Than Two Samples?

5-8 What If We Have More Than Two Samples?

5-8 What If We Have More Than Two Samples?

5-8 What If We Have More Than Two Samples? Cutoff for pairwise comparisons Tampa vs. Orlando χ 𝛼;𝑘−1 2 𝑛(𝑛+1) 12 1 𝑛 𝑖 + 1 𝑛 𝑗 = χ 0.05 2 ;4 36(36+1) 12 1 6 + 1 7 = 18.06 Chicago vs. NY χ 𝛼;𝑘−1 2 𝑛(𝑛+1) 12 1 𝑛 𝑖 + 1 𝑛 𝑗 = χ 0.05 2 ;4 36(36+1) 12 1 8 + 1 7 = 16.80 NYw 8 Chicago Detroit Tampa Orlando 32.6 24 16.9 8.8 8.2 24.4* 15.8 8.7 0.6 - 23.8* 15.2 8.1 15.7 7.1 8.6 * Significantly different.

5-8 What If We Have More Than Two Samples? 5-8.2 Randomized Complete Block Experiment The randomized block design is an extension of the paired t-test to situations where the factor of interest has more than two levels.

5-8 What If We Have More Than Two Samples? 5-8.2 Randomized Complete Block Experiment For example, consider the situation where three different methods were used to predict the shear strength of steel plate girders. Say we use four girders as the experimental units.

5-8 What If We Have More Than Two Samples? 5-8.2 Randomized Complete Block Experiment

5-8 What If We Have More Than Two Samples? 5-8.2 Randomized Complete Block Experiment The appropriate linear statistical model: treatments and blocks are initially fixed factors treatment and block effects as deviations from m treatments and blocks do not interact

5-8 What If We Have More Than Two Samples? 5-8.2 Randomized Complete Block Experiment The hypotheses of interest are:

5-8 What If We Have More Than Two Samples? 5-8.2 Randomized Complete Block Experiment

5-8 What If We Have More Than Two Samples? 5-8.2 Randomized Complete Block Experiment The mean squares are:

5-8 What If We Have More Than Two Samples? 5-8.2 Randomized Complete Block Experiment The expected values of these mean squares are:

5-8 What If We Have More Than Two Samples? 5-8.2 Randomized Complete Block Experiment

5-8 What If We Have More Than Two Samples? 5-8.2 Randomized Complete Block Experiment

5-8 What If We Have More Than Two Samples? 5-8.2 Randomized Complete Block Experiment The model for the randomized block design is the same as the two-way factorial where the interaction terms are assumed to be zero. The main question of interest is whether the means are the same for the different treatment levels. The blocks are used to explain some of the variability and many times to simplify the mechanics of collecting the data. For instance if the blocks are cities, it is much easier to collect data in one city at a time. The SSTR is the same as SSA in the two-way factorial, SSBL is the same as the SSB in the two-way factorial. Notice SSE in the corresponding two-way factorial model has zero degrees of freedom when you have only one observation per cell. The two-way factorial with only one observation is analyzed the same as the randomized block, and also assumes no interaction between the two factors.

5-8 What If We Have More Than Two Samples?

5-8 What If We Have More Than Two Samples?

5-8 What If We Have More Than Two Samples?

5-8 What If We Have More Than Two Samples?

5-8 What If We Have More Than Two Samples? When is Blocking Necessary? RBD에서 error의 자유도는 (a-1)(b-1) CRD에서 error의 자유도는 a(n-1) = a(b-1) 그러므로, blocking을 사용함으로서 자유도 (b-1)의 손실에 의한 효과가 그다지 크지 않기 때문에, blocking effect가 중요하다면 RBD를 사용하는 것이 합리적이다.

5-8 What If We Have More Than Two Samples? Which means differ?

5-8 What If We Have More Than Two Samples? Residual Analysis and Model Checking

5-8 What If We Have More Than Two Samples? Residual Analysis and Model Checking

5-8 What If We Have More Than Two Samples? Residual Analysis and Model Checking

5-8 What If We Have More Than Two Samples? Residual Analysis and Model Checking

5-8 What If We Have More Than Two Samples? Residual Analysis and Model Checking

5-8 What If We Have More Than Two Samples? Example 5-15 OPTIONS NOOVP NODATE NONUMBER LS=80; DATA ex515; DO type=1 to 4; DO fabsam=1 TO 5; INPUT strength @@; OUTPUT; END; END; CARDS; 1.3 1.6 0.5 1.2 1.1 2.2 2.4 0.4 2 1.8 1.8 1.7 0.6 1.5 1.3 3.9 4.4 2 4.1 3.4 PROC GLM DATA=ex515 plots=diagnostics; CLASS type fabsam; MODEL strength= type fabsam; MEANS TYPE/SNK; output out=new p=phat r=resid; TITLE 'Randomized Block Design FabSample: Block, ChemType: Treatment'; proc plot data=new; plot resid*phat; /* Residual Plot */ plot resid*type; /* Residual by Chemical Type */ plot resid*fabsam; /* Residuals by block */ proc anova data=ex515; class type; model strength=type; means type/snk; TITLE 'one-way anova'; run; quit;

5-8 What If We Have More Than Two Samples?

5-8 What If We Have More Than Two Samples?

5-8 What If We Have More Than Two Samples?

5-8 What If We Have More Than Two Samples?

5-8 What If We Have More Than Two Samples?

5-8 What If We Have More Than Two Samples?

5-8 What If We Have More Than Two Samples?

5-8 What If We Have More Than Two Samples?

5-8 What If We Have More Than Two Samples?

5-8 What If We Have More Than Two Samples?

5-8 What If We Have More Than Two Samples?