Lab 5 Hypothesis testing and Confidence Interval.

Slides:



Advertisements
Similar presentations
BINF 702 Spring 2014 Practice Problems Practice Problems BINF 702 Practice Problems.
Advertisements

One sample T Interval Example: speeding 90% confidence interval n=23 Check conditions Model: t n-1 Confidence interval: 31.0±1.52 = (29.48, 32.52) STAT.
Confidence Interval and Hypothesis Testing for:
Copyright ©2011 Brooks/Cole, Cengage Learning Testing Hypotheses about Means Chapter 13.
Copyright ©2011 Brooks/Cole, Cengage Learning Testing Hypotheses about Means Chapter 13.
Comparing Two Population Means The Two-Sample T-Test and T-Interval.
1 Matched Samples The paired t test. 2 Sometimes in a statistical setting we will have information about the same person at different points in time.
PSY 307 – Statistics for the Behavioral Sciences
12.5 Differences between Means (s’s known)
Hypothesis Testing Steps of a Statistical Significance Test. 1. Assumptions Type of data, form of population, method of sampling, sample size.
SADC Course in Statistics Comparing Means from Independent Samples (Session 12)
9-1 Hypothesis Testing Statistical Hypotheses Statistical hypothesis testing and confidence interval estimation of parameters are the fundamental.
BCOR 1020 Business Statistics Lecture 22 – April 10, 2008.
Lecture 6 Outline: Tue, Sept 23 Review chapter 2.2 –Confidence Intervals Chapter 2.3 –Case Study –Two sample t-test –Confidence Intervals Testing.
Hypothesis : Statement about a parameter Hypothesis testing : decision making procedure about the hypothesis Null hypothesis : the main hypothesis H 0.
BCOR 1020 Business Statistics
COMPARING MEANS: INDEPENDENT SAMPLES 1 ST sample: x1, x2, …, xm from population with mean μx; 2 nd sample: y1, y2, …, yn from population with mean μy;
The Scientific Study of Politics (POL 51) Professor B. Jones University of California, Davis.
P-value  Is defined as: the probability of getting a difference at least as big as that observed if the null hypothesis is true.
Chapter 11: Inference for Distributions
Chapter 9 Hypothesis Testing.
Student’s t statistic Use Test for equality of two means
Two-sample problems for population means BPS chapter 19 © 2006 W.H. Freeman and Company.
5-3 Inference on the Means of Two Populations, Variances Unknown
6.1 - One Sample One Sample  Mean μ, Variance σ 2, Proportion π Two Samples Two Samples  Means, Variances, Proportions μ 1 vs. μ 2.
Chapter 9 Comparing Means
 We cannot use a two-sample t-test for paired data because paired data come from samples that are not independently chosen. If we know the data are paired,
Variance-Test-1 Inferences about Variances (Chapter 7) Develop point estimates for the population variance Construct confidence intervals for the population.
Lecture 8 1 Hypothesis tests Hypothesis H 0 : Null-hypothesis is an conjecture which we assume is true until we have too much evidence against it. H 1.
1/2555 สมศักดิ์ ศิวดำรงพงศ์
4-1 Statistical Inference The field of statistical inference consists of those methods used to make decisions or draw conclusions about a population.
Lecture 9 1 Reminder:Hypothesis tests Hypotheses H 0 : Null-hypothesis is an conjecture which we assume is true until we have too much evidence against.
More About Significance Tests
Dependent Samples: Hypothesis Test For Hypothesis tests for dependent samples, we 1.list the pairs of data in 2 columns (or rows), 2.take the difference.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Statistical Inferences Based on Two Samples Chapter 9.
+ Chapter 9 Summary. + Section 9.1 Significance Tests: The Basics After this section, you should be able to… STATE correct hypotheses for a significance.
Comparing Two Population Means
Chapter 10 Comparing Two Means Target Goal: I can use two-sample t procedures to compare two means. 10.2a h.w: pg. 626: 29 – 32, pg. 652: 35, 37, 57.
One Sample Inf-1 If sample came from a normal distribution, t has a t-distribution with n-1 degrees of freedom. 1)Symmetric about 0. 2)Looks like a standard.
Week 111 Power of the t-test - Example In a metropolitan area, the concentration of cadmium (Cd) in leaf lettuce was measured in 7 representative gardens.
Hypothesis tests III. Statistical errors, one-and two sided tests. One-way analysis of variance. 1.
1 Objective Compare of two matched-paired means using two samples from each population. Hypothesis Tests and Confidence Intervals of two dependent means.
Inference for distributions: - Comparing two means IPS chapter 7.2 © 2006 W.H. Freeman and Company.
9-1 Hypothesis Testing Statistical Hypotheses Definition Statistical hypothesis testing and confidence interval estimation of parameters are.
Biostatistics Class 6 Hypothesis Testing: One-Sample Inference 2/29/2000.
1 Section 9-4 Two Means: Matched Pairs In this section we deal with dependent samples. In other words, there is some relationship between the two samples.
Two sample problems:  compare the responses in two groups  each group is a sample from a distinct population  responses in each group are independent.
1 ConceptsDescriptionHypothesis TheoryLawsModel organizesurprise validate formalize The Scientific Method.
Objectives (BPS chapter 19) Comparing two population means  Two-sample t procedures  Examples of two-sample t procedures  Using technology  Robustness.
S-012 Testing statistical hypotheses The CI approach The NHST approach.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 8 Hypothesis Testing.
1 9 Tests of Hypotheses for a Single Sample. © John Wiley & Sons, Inc. Applied Statistics and Probability for Engineers, by Montgomery and Runger. 9-1.
Ch11: Comparing 2 Samples 11.1: INTRO: This chapter deals with analyzing continuous measurements. Later, some experimental design ideas will be introduced.
Week111 The t distribution Suppose that a SRS of size n is drawn from a N(μ, σ) population. Then the one sample t statistic has a t distribution with n.
Tests of significance: The basics BPS chapter 14 © 2006 W.H. Freeman and Company.
CHAPTER 27: One-Way Analysis of Variance: Comparing Several Means
6.1 - One Sample One Sample  Mean μ, Variance σ 2, Proportion π Two Samples Two Samples  Means, Variances, Proportions μ 1 vs. μ 2.
Comparing the Means of Two Dependent Populations.
+ Unit 6: Comparing Two Populations or Groups Section 10.2 Comparing Two Means.
Copyright ©2011 Brooks/Cole, Cengage Learning Testing Hypotheses about Difference Between Two Means.
Inference for Distributions 7.2 Comparing Two Means © 2012 W.H. Freeman and Company.
If we fail to reject the null when the null is false what type of error was made? Type II.
Comparing 2 populations. Placebo go to see a doctor.
Hypothesis Tests u Structure of hypothesis tests 1. choose the appropriate test »based on: data characteristics, study objectives »parametric or nonparametric.
Chapter 7 Inference Concerning Populations (Numeric Responses)
Objectives (PSLS Chapter 18) Comparing two means (σ unknown)  Two-sample situations  t-distribution for two independent samples  Two-sample t test 
Stat 251 (2009, Summer) Final Lab TA: Yu, Chi Wai.
Significance Test for the Difference of Two Proportions
STAT 312 Introduction Z-Tests and Confidence Intervals for a
Hypothesis testing using R
Presentation transcript:

Lab 5 Hypothesis testing and Confidence Interval

Outline One sample t-test Two sample t-test Paired t-test

Lab 5 One-sample t-test

One sample t-test The hypotheses : One sided Two sided

One sample t-test Test statistics

One sample t-test Conclusion Compare the test statistics with the critical value … Compare the p-value with the level of significance α (e.g. 0.05, 0.1) Reject H 0 if p-value < α (enough evidence) Cannot reject H 0 if p-value > α (not enough evidence)

Example Download the biotest.txt data file Read into R using function read.table() Extract the 1 st column and store as ‘X1’ Store the 2 nd column as ‘X2’

Example > X1 = read.table(“biotest.txt”) [,1] > X2 = read.table(“biotest.txt”) [,2]

Example Take ‘X1’ as the sample in this case, Test H 0 : μ = 115 against H 1 : μ ≠ 115 at significant level α = 0.05

[R] command t.test() Syntax: t.test(x=“data”, alternative = “less / greater / two.sided”, mu=“μ 0 ” )

Example 1 > t.test(X1, alternative = “two.sided”, mu=115) One Sample t-test data: X1 t = , df = 9, p-value = alternative hypothesis: true mean is not equal to percent confidence interval: sample estimates: mean of x 115.6

Example 1 > t.test(X1, alternative = “two.sided”, mu=115) One Sample t-test data: X1 t = , df = 9, p-value = alternative hypothesis: true mean is not equal to percent confidence interval: sample estimates: mean of x 115.6

Example 1 > t.test(X1, alternative = “two.sided”, mu=115) One Sample t-test data: X1 t = , df = 9, p-value = alternative hypothesis: true mean is not equal to percent confidence interval: sample estimates: mean of x larger than 0.05 Cannot reject H 0 at 0.05 level of significance

Example 1 > t.test(X1, alternative = “two.sided”, mu=115) One Sample t-test data: X1 t = , df = 9, p-value = alternative hypothesis: true mean is not equal to percent confidence interval: sample estimates: mean of x μ 0 inside the 95% CI

Example 2 Test H 0 : μ ≤ 108 against H 1 : μ > 108 at significant level α = 0.05

Example 2 > t.test(X1, alternative = “greater”, mu=108) One Sample t-test data: X1 t = , df = 9, p-value = alternative hypothesis: true mean is greater than percent confidence interval: Inf sample estimates: mean of x 115.6

Example 2 > t.test(X1, alternative = “greater”, mu=108) One Sample t-test data: X1 t = , df = 9, p-value = alternative hypothesis: true mean is greater than percent confidence interval: Inf sample estimates: mean of x smaller than 0.05 Reject H 0 at 0.05 level of significance

Example 2 Conclude that the population mean is significantly greater than 108

Example 2 > t.test(X1, alternative = “greater”, mu=108) One Sample t-test data: X1 t = , df = 9, p-value = alternative hypothesis: true mean is greater than percent confidence interval: Inf sample estimates: mean of x Statistical significance vs. Practical significance

Confidence Interval By default, the function t.test() includes a 95% confidence interval Question: Can we change the confidence level?

Confidence Interval e.g. want a 99% confidence interval > t.test(x1, alternative=“greater”, mu=108, conf.level = 0.99)

Lab 5 Two-sample t-test

Testing the population mean of two independent samples

Two-sample t-test Two-sided One-sided

Example 3 Consider the two sample X1 and X2 Want to test if there is there is a significant difference between the mean of X1 and mean of X2.

Example 3 Two sided test H 0 : μ 1 = μ 2 against H 1 : μ 1 ≠ μ 2 at 0.05 level of significance Assuming equal variance

Example 3 > t.test(X1, X2, alternative = “two.sided”, var.equal = TRUE) Two Sample t-test data: X1 and X2 t = , df = 18, p-value = alternative hypothesis: true difference in means is not equal to 0 95 percent confidence interval: sample estimates: mean of x mean of y

Example 3 > t.test(X1, X2, alternative = “two.sided”, var.equal = TRUE) Two Sample t-test data: X1 and X2 t = , df = 18, p-value = alternative hypothesis: true difference in means is not equal to 0 95 percent confidence interval: sample estimates: mean of x mean of y

Example 3 Not assuming equal variance? > t.test(X1, X2, alternative = “two.sided”, var.equal = FALSE)

Lab 5 Paired t-test

Two samples problem But they are no longer independent Example: Measurement taken twice at different time point from the same group of subjects Blood pressure before and after some treatment Want to test the difference of the means

Paired t-test If we take the difference of the measurements of each subject. Reduce to a one sample problem The rest is the same as a one sample t-test X1 X2 X3 X4 y1 y2 y3 y4 -= d1 d2 d3 d4

Example 4 Consider again the dataset X1 and X2, and assume they are pairwise observations Test the equality of the means i.e. test if difference in mean = 0 H 0 : μ 1 = μ 2 against H 1 : μ 1 ≠ μ 2 at 0.05 level of significance

Example 4 > t.test(X1, X2, alternative = “two.sided”, paired = TRUE) Paired t-test data: X1 and X2 t = , df = 9, p-value = alternative hypothesis: true difference in means is not equal to 0 95 percent confidence interval: sample estimates: mean of the differences -4.8

Example 4 > t.test(X1, X2, alternative = “two.sided”, paired = TRUE) Paired t-test data: X1 and X2 t = , df = 9, p-value = alternative hypothesis: true difference in means is not equal to 0 95 percent confidence interval: sample estimates: mean of the differences -4.8

Alternatively… > t.test(X1-X2, alternative = “two.sided”) One Sample t-test data: X1 - X2 t = , df = 9, p-value = alternative hypothesis: true mean is not equal to 0 95 percent confidence interval: sample estimates: mean of x -4.8

Alternatively… > t.test(X1-X2, alternative = “two.sided”) One Sample t-test data: X1 - X2 t = , df = 9, p-value = alternative hypothesis: true mean is not equal to 0 95 percent confidence interval: sample estimates: mean of x -4.8 EXACTLY THE SAME RESULT!!

Final Remarks Notice that the conclusion from the two sample t-test and the paired t-test are different even if we are looking at the same data set. Should check if the two sample are independent or not

Final Remarks Using the wrong test either lead to loss of sensitivity or invalid analysis.