Dependent Samples: Hypothesis Test For Hypothesis tests for dependent samples, we 1.list the pairs of data in 2 columns (or rows), 2.take the difference.

Slides:



Advertisements
Similar presentations
Hypothesis Testing An introduction. Big picture Use a random sample to learn something about a larger population.
Advertisements

Is it statistically significant?
A statistical hypothesis is an assumption about a population parameter. This assumption may or may not be true
The Independent- Samples t Test Chapter 11. Independent Samples t-Test >Used to compare two means in a between-groups design (i.e., each participant is.
One sample T Interval Example: speeding 90% confidence interval n=23 Check conditions Model: t n-1 Confidence interval: 31.0±1.52 = (29.48, 32.52) STAT.
STATISTICAL INFERENCE PART V
Confidence Interval and Hypothesis Testing for:
Copyright ©2011 Brooks/Cole, Cengage Learning Testing Hypotheses about Means Chapter 13.
Copyright ©2011 Brooks/Cole, Cengage Learning Testing Hypotheses about Means Chapter 13.
Significance Testing Chapter 13 Victor Katch Kinesiology.
Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. More About Significance Tests Chapter 13.
Comparing Two Population Means The Two-Sample T-Test and T-Interval.
Chapter 9: Inferences for Two –Samples
Chapter 10 Two-Sample Tests
© 2010 Pearson Prentice Hall. All rights reserved Least Squares Regression Models.
Chap 9-1 Copyright ©2012 Pearson Education, Inc. publishing as Prentice Hall Basic Business Statistics 12 th Edition Chapter 9 Fundamentals of Hypothesis.
PSY 307 – Statistics for the Behavioral Sciences
BCOR 1020 Business Statistics Lecture 22 – April 10, 2008.
6.4 One and Two-Sample Inference for Variances. Example - Problem 26 – Page 435  D. Kim did some crude tensile strength testing on pieces of some nominally.
BCOR 1020 Business Statistics
Chapter Goals After completing this chapter, you should be able to:
Hypothesis Testing. Introduction Always about a population parameter Attempt to prove (or disprove) some assumption Setup: alternate hypothesis: What.
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 10-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Two Population Means Hypothesis Testing and Confidence Intervals With Unknown Standard Deviations.
BCOR 1020 Business Statistics Lecture 20 – April 3, 2008.
Stat 301 – Day 26 Prediction Intervals Paired t-test.
5-3 Inference on the Means of Two Populations, Variances Unknown
Hypothesis Testing Using The One-Sample t-Test
Chapter 24: Comparing Means.
1/2555 สมศักดิ์ ศิวดำรงพงศ์
Copyright © Cengage Learning. All rights reserved. 13 Linear Correlation and Regression Analysis.
Section 10.1 ~ t Distribution for Inferences about a Mean Introduction to Probability and Statistics Ms. Young.
Lesson Comparing Two Means.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Inference on the Least-Squares Regression Model and Multiple Regression 14.
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 11 Section 2 – Slide 1 of 25 Chapter 11 Section 2 Inference about Two Means: Independent.
1 Level of Significance α is a predetermined value by convention usually 0.05 α = 0.05 corresponds to the 95% confidence level We are accepting the risk.
Laws of Logic and Rules of Evidence Larry Knop Hamilton College.
Comparing the Means of Two Dependent Populations The Paired T-test ….
More About Significance Tests
Chapter 11 Comparing Two Populations or Treatments.
Comparing Two Population Means
1 CSI5388: Functional Elements of Statistics for Machine Learning Part I.
Week 111 Power of the t-test - Example In a metropolitan area, the concentration of cadmium (Cd) in leaf lettuce was measured in 7 representative gardens.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 24 Comparing Means.
1 © 2008 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 11 Comparing Two Populations or Treatments.
A Broad Overview of Key Statistical Concepts. An Overview of Our Review Populations and samples Parameters and statistics Confidence intervals Hypothesis.
A Course In Business Statistics 4th © 2006 Prentice-Hall, Inc. Chap 9-1 A Course In Business Statistics 4 th Edition Chapter 9 Estimation and Hypothesis.
Chap 9-1 Two-Sample Tests. Chap 9-2 Two Sample Tests Population Means, Independent Samples Means, Related Samples Population Variances Group 1 vs. independent.
Essential Question:  How do scientists use statistical analyses to draw meaningful conclusions from experimental results?
Lecture 9 Chap 9-1 Chapter 2b Fundamentals of Hypothesis Testing: One-Sample Tests.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 9-1 Chapter 9 Two-Sample Tests Statistics for Managers Using Microsoft.
Lesson Comparing Two Means. Knowledge Objectives Describe the three conditions necessary for doing inference involving two population means. Clarify.
Week111 The t distribution Suppose that a SRS of size n is drawn from a N(μ, σ) population. Then the one sample t statistic has a t distribution with n.
Chapter 8 Parameter Estimates and Hypothesis Testing.
KNR 445 Statistics t-tests Slide 1 Introduction to Hypothesis Testing The z-test.
to accompany Introduction to Business Statistics
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 9-1 Chapter 9 Fundamentals of Hypothesis Testing: One-Sample Tests Basic Business Statistics.
6.1 - One Sample One Sample  Mean μ, Variance σ 2, Proportion π Two Samples Two Samples  Means, Variances, Proportions μ 1 vs. μ 2.
ISMT253a Tutorial 1 By Kris PAN Skewness:  a measure of the asymmetry of the probability distribution of a real-valued random variable 
MATB344 Applied Statistics I. Experimental Designs for Small Samples II. Statistical Tests of Significance III. Small Sample Test Statistics Chapter 10.
Homework and project proposal due next Thursday. Please read chapter 10 too…
© The McGraw-Hill Companies, Inc., Chapter 12 Analysis of Variance (ANOVA)
Copyright ©2011 Brooks/Cole, Cengage Learning Testing Hypotheses about Difference Between Two Means.
Chapter 9 Lecture 3 Section: 9.3. We will now consider methods for using sample data from two independent samples to test hypotheses made about two population.
Chapter 7 Inference Concerning Populations (Numeric Responses)
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 10 Comparing Two Groups Section 10.3 Other Ways of Comparing Means and Comparing Proportions.
Statistical hypothesis Statistical hypothesis is a method for testing a claim or hypothesis about a parameter in a papulation The statement H 0 is called.
Independent Samples: Comparing Means Lecture 39 Section 11.4 Fri, Apr 1, 2005.
Lecture Notes and Electronic Presentations, © 2013 Dr. Kelly Significance and Sample Size Refresher Harrison W. Kelly III, Ph.D. Lecture # 3.
Chapter 9 Lecture 4 Section: 9.4.
Presentation transcript:

Dependent Samples: Hypothesis Test For Hypothesis tests for dependent samples, we 1.list the pairs of data in 2 columns (or rows), 2.take the difference between each pair of data, 3.analyze the differences where d i is the difference for the i th pair of data and n is the # of pairs of data. s d is the standard deviation of the differences. Then, has a t distribution with df = n-1 under the usual assumptions.

Dependent Samples: P-value Calculation For Hypothesis tests, we want to find p-values. This requires Minitab. Let the observed test statistic be t* Now, p-value = P(t n-1 > t*), when H a :  d >  0 P(t n-1 < t*), when H a :  d <  0 or, 2P(t n-1 > |t*|), when H a :  d  0 where n = number of pairs in the analysis.

Diabetes Knowledge Scores Scores before and after a lecture. Let  d =  after-before = 3.2 s d = (Minitab) Before After Difference (After-Before)

Dependent Samples Hypothesis Test Step 1: H 0 :  d = 0 H 1 :  d > 0 Step 2: t* = has a t-distribution with df = 9. Step 3: t* = = 3.7 (round to tenths place) Step 4: p-value = P(t df=9 > 3.7) = (Minitab)

Dependent Samples Hypothesis Test Step 5: Since the p-value is less than any reasonable level of significance, we reject the null hypothesis and accept the research hypothesis. Step 6: With a.05 level of significance, we conclude there is enough evidence that the program yields a mean increase in knowledge of diabetes.

Dependent Samples Confidence Interval We can also find a formula to estimate the mean difference. The result is Diabetes Example: Find a 95% CI. Recall, = 3.2 and s d = Also, there were n = 10 pairs of data so df = 9. t.025,9 = from software. Thus, = = (1.2, 5.2 ) We are 95% confident the mean increase in knowledge about diabetes from the course is between 1.2 and 5.2 points.

Dependent Samples Confidence Interval Travel Times for two routes (Minitab output) Paired T-Test and CI: Route I, Route II Paired T for Route I - Route II N Mean StDev SE Mean Route I Route II Difference % CI for mean difference: (-0.142, 1.742) T-Test of mean difference = 0 (vs not = 0): T-Value = 1.92 P-Value = It is unclear at the 95% level of confidence which route is faster on average (takes less time). If route I is faster, it is by no more than 0.14 minute, on average. If route II is faster on average, it is by no more than 1.74 minutes.

Prescription Drug Costs Are Rx drugs from Canada cheaper than drugs in the USA? Because costs of drugs vary considerably, we should consider using a dependent samples design. Otherwise, the large variation is costs among the Rx drugs may hide a difference in typical costs. A sample of 10 drugs were chosen, then using an online pharmacy price checking website, the drugs were priced at a Canadian and a USA pharmacy.

Rx Costs - Independent Samples Analysis The output from an independent samples analysis is given on the next slide. What is the research hypothesis? H a : The mean cost of drugs in Canada is lower than the mean cost of drugs in the USA. What would we conclude from THIS analysis? We will ask whether this is appropriate or not later.

Rx Costs - Independent Samples Analysis Two-Sample T-Test and CI: Canada, USA N Mean StDev SE Mean Canada USA Difference = mu (Canada) - mu (USA) Estimate for difference: T-Test of difference = 0 (vs <): T-Value = P-Value = DF = 18 Both use Pooled StDev =

Rx Costs - Independent Samples Analysis From this output, we would conclude that there is not enough evidence to convince us that the mean cost of drugs from Canada is lower than the mean cost of drugs in the USA. Do you believe this? Why or why not?

Rx Costs - Dependent Samples Analysis Because we priced the same 10 drugs, we should be using a dependent samples (paired) analysis. This also controls for variability among drug costs. The output from a dependent samples analysis is given on the next slide. What is the research hypothesis? H a : The mean cost of drugs in Canada is lower than in the USA.

Rx Costs - Dependent Samples Analysis Paired T-Test and CI: Canada, USA Paired T for Canada - USA N Mean StDev SE Mean Canada USA Difference % upper bound for mean difference: T-Test of mean difference = 0 (vs < 0): T-Value = P-Value = 0.000

Rx Costs - Dependent Samples Analysis Notice that there is now overwhelming evidence to support the research hypothesis. The p-value is listed as (< is how it would be reported). This implies there is almost no chance of seeing this pattern in a sample of 10 pairs of drug costs if the mean costs are not different between Canada and the USA. (ie: there is no chance of seeing the sample data if the null hypothesis is true). So we would claim the research hypothesis has been proven beyond a reasonable doubt.

Rx Costs - Dependent Samples Analysis What about assumptions? In dependent samples analysis, we need to have a random sample of paired observations differences must be normally distributed, or we need a large sample (30 or more differences as a rule of thumb) The sample was selected when I looked at names of drugs listed on the website, so it is not truly a random sample. This is one criticism. How would we check the normality assumption?

Is Data Normal or not? Stat  Basic Statistics  Normality Test gives a way to check. (Enter the variable) Your hypotheses are: H o : The data comes from a normal population of data. H a : The data comes from a population that is not a normal distribution. Caveat: Statisticians NEVER accept the null hypothesis, yet that is exactly what this test does!

Is Data Normal or not? If the p-value is small, we must use another technique (called nonparametrics). If the p-value is large, then researchers assume the t- test is ok to use. The t-test is called robust to this assumption, so only when the original population is very different from a normal distribution does it make a big difference (the p-value is not affect much when the population is slightly non-normal).

Is Data Normal or not? Minitab