Data Mining 2016/2017 Fall MIS 331 Chapter 2 Sampliing Distribution

Slides:

Advertisements

Similar presentations

Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 10-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter.

Advertisements

Chapter 6 Sampling and Sampling Distributions

Business and Economics 9th Edition

Statistics for Managers Using Microsoft Excel, 5e © 2008 Prentice-Hall, Inc.Chap 10-1 Statistics for Managers Using Microsoft® Excel 5th Edition Chapter.

Chapter 10 Two-Sample Tests

Chapter 8 Estimation: Additional Topics

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Statistics for Business and Economics 7 th Edition Chapter 10 Hypothesis Testing:

Chapter 10 Two-Sample Tests

Chapter 7 Sampling and Sampling Distributions

© 2002 Prentice-Hall, Inc.Chap 8-1 Statistics for Managers using Microsoft Excel 3 rd Edition Chapter 8 Two Sample Tests with Numerical Data.

Chap 11-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 11 Hypothesis Testing II Statistics for Business and Economics.

Chapter Goals After completing this chapter, you should be able to:

Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 9-1 Introduction to Statistics Chapter 10 Estimation and Hypothesis.

Chapter Topics Comparing Two Independent Samples:

1/45 Chapter 11 Hypothesis Testing II EF 507 QUANTITATIVE METHODS FOR ECONOMICS AND FINANCE FALL 2008.

Chap 9-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 9 Estimation: Additional Topics Statistics for Business and Economics.

A Decision-Making Approach

Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 10-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.

Part III: Inference Topic 6 Sampling and Sampling Distributions

© 2002 Prentice-Hall, Inc.Chap 8-1 Statistics for Managers using Microsoft Excel 3 rd Edition Kafla 8 Próf fyrir tvö úrtök (Ekki þýtt)

Chapter 11 Hypothesis Tests and Estimation for Population Variances

Chapter 7 Estimation: Single Population

© 2004 Prentice-Hall, Inc.Chap 10-1 Basic Business Statistics (9 th Edition) Chapter 10 Two-Sample Tests with Numerical Data.

Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 9-1 Chapter 9 Two Sample Tests Statistics for Managers Using Microsoft.

Basic Business Statistics (9th Edition)

Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Statistics for Business and Economics 8 th Edition Chapter 6 Sampling and Sampling.

1/49 EF 507 QUANTITATIVE METHODS FOR ECONOMICS AND FINANCE FALL 2008 Chapter 9 Estimation: Additional Topics.

Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 10-1 Chapter 10 Two-Sample Tests Basic Business Statistics 10 th Edition.

Hypothesis Testing – Two Samples

Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 11-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.

Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap th & 7 th Lesson Hypothesis Testing for Two Population Parameters.

Chapter 10 Two-Sample Tests and One-Way ANOVA

Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 10-1 Chapter 2c Two-Sample Tests.

10-1 Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall Chapter 10 Two-Sample Tests Statistics for Managers using Microsoft Excel 6 th.

Pengujian Hipotesis Dua Populasi By. Nurvita Arumsari, Ssi, MSi.

Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 10-1 Chapter 10 Two-Sample Tests and One-Way ANOVA Business Statistics, A First.

A Course In Business Statistics 4th © 2006 Prentice-Hall, Inc. Chap 9-1 A Course In Business Statistics 4 th Edition Chapter 9 Estimation and Hypothesis.

Industrial Statistics 2

Chap 9-1 Two-Sample Tests. Chap 9-2 Two Sample Tests Population Means, Independent Samples Means, Related Samples Population Variances Group 1 vs. independent.

Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap th Lesson Hypothesis Tests for One and Two Population Variances.

Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Statistics for Business and Economics 8 th Edition Chapter 10 Hypothesis Testing:

Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 9-1 Chapter 9 Two-Sample Tests Statistics for Managers Using Microsoft.

Chap 10-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 10 Hypothesis Tests for.

Statistics for Business and Economics 8 th Edition Chapter 7 Estimation: Single Population Copyright © 2013 Pearson Education, Inc. Publishing as Prentice.

Comparing Sample Means

Copyright © 2016, 2013, 2010 Pearson Education, Inc. Chapter 10, Slide 1 Two-Sample Tests and One-Way ANOVA Chapter 10.

AP Statistics. Chap 13-1 Chapter 13 Estimation and Hypothesis Testing for Two Population Parameters.

Statistics for Managers Using Microsoft Excel, 5e © 2008 Prentice-Hall, Inc.Chap 10-1 Statistics for Managers Using Microsoft® Excel 5th Edition Chapter.

Lecture 8 Estimation and Hypothesis Testing for Two Population Parameters.

10-1 Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall Chapter 10 Two-Sample Tests Statistics for Managers using Microsoft Excel 6 th.

Chapter 6 Sampling and Sampling Distributions

Chapter 9 Estimation: Additional Topics

Chapter 10 Two-Sample Tests and One-Way ANOVA.

Statistics for Managers using Microsoft Excel 3rd Edition

Chapter 11 Hypothesis Testing II

Chapter 10 Two Sample Tests

Estimation & Hypothesis Testing for Two Population Parameters

Chapter 11 Hypothesis Testing II

Chapter 10 Two-Sample Tests.

Data Mining 2016/2017 Fall MIS 331 Chapter 2 Sampliing Distribution

Chapter 10 Two-Sample Tests and One-Way ANOVA.

Chapter 9 Hypothesis Testing.

Chapter 11 Hypothesis Tests and Estimation for Population Variances

Chapter 8 Estimation: Additional Topics

Chapter 10 Hypothesis Tests for One and Two Population Variances

Data Mining 2018/2019 Fall MIS 331 Chapter 7-A Sampliing Distribution,

Chapter 10 Two-Sample Tests

Chapter 9 Estimation: Additional Topics

Chapter 8 Estimation: Additional Topics

Presentation transcript:

Data Mining 2016/2017 Fall MIS 331 Chapter 2 Sampliing Distribution Confidence Interval Estimation Hypothesis Testing for Variance of a Population Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall

Outline Sampling Distributio of Sample Variances Confidence Interval Estimation for the Variance Tests of the Variance of a Normal Distribution Tests of Equality of Two Variances

Sampling Distributions of Sample Variances 6.4 Sampling Distributions Sampling Distributions of Sample Means Sampling Distributions of Sample Proportions Sampling Distributions of Sample Variances Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall

Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Sample Variance Let x1, x2, . . . , xn be a random sample from a population. The sample variance is the square root of the sample variance is called the sample standard deviation the sample variance is different for different random samples from the same population Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall

Sampling Distribution of Sample Variances The sampling distribution of s2 has mean σ2 If the population distribution is normal, then Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall

Chi-Square Distribution of Sample and Population Variances If the population distribution is normal then has a chi-square (2 ) distribution with n – 1 degrees of freedom Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall

The Chi-square Distribution The chi-square distribution is a family of distributions, depending on degrees of freedom: d.f. = n – 1 Text Appendix Table 7 contains chi-square probabilities 2 2 2 0 4 8 12 16 20 24 28 0 4 8 12 16 20 24 28 0 4 8 12 16 20 24 28 d.f. = 1 d.f. = 5 d.f. = 15 Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall

Expected value of a chi-square distribution with degree of freedom v is v E[2v] = v Variance of achi-square distribution with degree of freedom v is 2v Var[2v] = 2v

Since (n-1)s2/2 has a chi-square distribution with df: n-1 E[(n-1)s2/2] = n-1 ((n-1)/2)E[s2] = n-1 E[s2] = 2, Similarly Var[(n-1)s2/2] = 2(n-1) ((n-1)2/4)Var[s2] = 2(n-1) Var[s2] = 24/(n-1)

Degrees of Freedom (df) Idea: Number of observations that are free to vary after sample mean has been calculated Example: Suppose the mean of 3 numbers is 8.0 Let X1 = 7 Let X2 = 8 What is X3? If the mean of these three values is 8.0, then X3 must be 9 (i.e., X3 is not free to vary) Here, n = 3, so degrees of freedom = n – 1 = 3 – 1 = 2 (2 values can be any numbers, but the third is not free to vary for a given mean) Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall

Table 7 in Appandix d.f. versus probabilities for critical values P(210 < KL) = 0.05 KL = 3.940 hence P(210 < 3.940) = 0.05 P(210 > KU) = 0.05 KU = 18.31 hence P(210 > 18.31) = 0.05

Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Chi-square Example A commercial freezer must hold a selected temperature with little variation. Specifications call for a standard deviation of no more than 4 degrees (a variance of 16 degrees2). A sample of 14 freezers is to be tested What is the upper limit (K) for the sample variance such that the probability of exceeding this limit, given that the population standard deviation is 4, is less than 0.05? Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall

Finding the Chi-square Value Is chi-square distributed with (n – 1) = 13 degrees of freedom Use the the chi-square distribution with area 0.05 in the upper tail: 213 = 22.36 (α = .05 and 14 – 1 = 13 d.f.) probability α = .05 2 213 = 22.36 Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall

Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Chi-square Example (continued) 213 = 22.36 (α = .05 and 14 – 1 = 13 d.f.) So: or (where n = 14) so If s2 from the sample of size n = 14 is greater than 27.52, there is strong evidence to suggest the population variance exceeds 16. Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall

Confidence Interval Estimation for the Variance 7.5 Confidence Intervals Population Mean Population Proportion Population Variance (From a normally distributed population) σ2 Known σ2 Unknown Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Ch. 7-15

Confidence Intervals for the Population Variance Goal: Form a confidence interval for the population variance, σ2 The confidence interval is based on the sample variance, s2 Assumed: the population is normally distributed Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Ch. 7-16

Confidence Intervals for the Population Variance (continued) The random variable follows a chi-square distribution with (n – 1) degrees of freedom Where the chi-square value denotes the number for which Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Ch. 7-17

P(2n-1 > 2n-1,/2 ) = /2 P(2n-1 > 2n-1,1-/2 ) = 1 - /2 or P(2n-1 < 2n-1,1-/2 ) = /2 Finally, P(2n-1,1-/2 < 2n-1 < 2n-1,/2) = 1 - /2 - /2 =1- 

two numbers such that probability that chi-square with d. f two numbers such that probability that chi-square with d.f. 6 is laying between tham is 0.90 P(26,0.950 < 26 < 26,0.05) =0.90 The two numbers 26,0.950 = 1.635 26,0.05 = 12.932 hence P(1.635 < 26 < 12.935) =0.90

Confidence Intervals for the Population Variance (continued) The 100(1 - )% confidence interval for the population variance is given by Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Ch. 7-20

Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Example You are testing the speed of a batch of computer processors. You collect the following data (in Mhz): Sample size 17 Sample mean 3004 Sample std dev 74 Assume the population is normal. Determine the 95% confidence interval for σx2 Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Ch. 7-21

Finding the Chi-square Values n = 17 so the chi-square distribution has (n – 1) = 16 degrees of freedom  = 0.05, so use the the chi-square values with area 0.025 in each tail: probability α/2 = .025 probability α/2 = .025 216 216 = 6.91 216 = 28.85 Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Ch. 7-22

Calculating the Confidence Limits The 95% confidence interval is Converting to standard deviation, we are 95% confident that the population standard deviation of CPU speed is between 55.1 and 112.6 Mhz Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Ch. 7-23

Tests of the Variance of a Normal Distribution 9.6 Goal: Test hypotheses about the population variance, σ2 (e.g., H0: σ2 = σ02) If the population is normally distributed, has a chi-square distribution with (n – 1) degrees of freedom Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Chap 11-24

Tests of the Variance of a Normal Distribution (continued) The test statistic for hypothesis tests about one population variance is Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Chap 11-25

Decision Rules: Variance Population variance Lower-tail test: H0: σ2  σ02 H1: σ2 < σ02 Upper-tail test: H0: σ2 ≤ σ02 H1: σ2 > σ02 Two-tail test: H0: σ2 = σ02 H1: σ2 ≠ σ02 a a a/2 a/2 Reject H0 if Reject H0 if Reject H0 if or Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Chap 11-26

Newbold 9.47 Test the hypothesis H0:2 <=100 againts H1 2 >100 a) s2 = 165, n=25 b) s2 = 165, n=29 c) s2 = 159, n=25 d) s2 = 67, n=38

Solution

Solution

Newbold 7.48 new safety device random sample for 8 days 618 660 638 625 571 598 639 582 management concenrs about variability test the null hypothesis variance less than 500 at a significance level of 10%

Solution

Chapter 10 Hypothesis Testing: Additional Topics Statistics for Business and Economics 8th Edition Chapter 10 Hypothesis Testing: Additional Topics Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Ch. 10-32

Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Chapter Goals After completing this chapter, you should be able to: Test hypotheses for the difference between two population means Two means, matched pairs Independent populations, population variances known Independent populations, population variances unknown but equal Complete a hypothesis test for the difference between two proportions (large samples) Use the F table to find critical F values Complete an F test for the equality of two variances Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Ch. 10-33

Two Sample Tests Two Sample Tests Population Means, Dependent Samples Population Means, Independent Samples Population Proportions Population Variances Examples: Same group before vs. after treatment Group 1 vs. independent Group 2 Proportion 1 vs. Proportion 2 Variance 1 vs. Variance 2 Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Ch. 10-34

Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Dependent Samples 10.1 Tests of the Difference Between Two Normal Population Means: Dependent Samples Dependent Samples Tests Means of 2 Related Populations Paired or matched samples Repeated measures (before/after) Use difference between paired values: Assumptions: Both Populations Are Normally Distributed di = xi - yi Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Ch. 10-35

Test Statistic: Dependent Samples The test statistic for the mean difference is a t value, with n – 1 degrees of freedom: where Population Means, Dependent Samples For tests of the following form: H0: μx – μy  0 H0: μx – μy ≤ 0 H0: μx – μy = 0 sd = sample standard dev. of differences n = the sample size (number of pairs) Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Ch. 10-36

Decision Rules: Matched Pairs Matched or Paired Samples Lower-tail test: H0: μx – μy  0 H1: μx – μy < 0 Upper-tail test: H0: μx – μy ≤ 0 H1: μx – μy > 0 Two-tail test: H0: μx – μy = 0 H1: μx – μy ≠ 0 a a a/2 a/2 -ta ta -ta/2 ta/2 Reject H0 if t < -tn-1, a Reject H0 if t > tn-1, a Reject H0 if t < -tn-1 , a/2 or t > tn-1 , a/2 has n - 1 d.f. Where Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Ch. 10-37

Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Matched Pairs Example Assume you send your salespeople to a “customer service” training workshop. Has the training made a difference in the number of complaints? You collect the following data:  di d = Number of Complaints: (2) - (1) Salesperson Before (1) After (2) Difference, di C.B. 6 4 - 2 T.F. 20 6 -14 M.H. 3 2 - 1 R.K. 0 0 0 M.O. 4 0 - 4 -21 n = - 4.2 Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Ch. 10-38

Critical Value = ± 2.776 d.f. = n − 1 = 4 Matched Pairs: Solution Has the training made a difference in the number of complaints (at the  = 0.05 level)? Reject Reject H0: μx – μy = 0 H1: μx – μy  0 /2 /2  = .05 d = - 4.2 - 2.776 2.776 - 1.66 Critical Value = ± 2.776 d.f. = n − 1 = 4 Decision: Do not reject H0 (t stat is not in the reject region) Test Statistic: Conclusion: There is not a significant change in the number of complaints. Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Ch. 10-39

Independent Samples 10.2 Tests of the Difference Between Two Normal Population Means: Dependent Samples Population means, independent samples Goal: Form a confidence interval for the difference between two population means, μx – μy Different populations Unrelated Independent Sample selected from one population has no effect on the sample selected from the other population Normally distributed Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Ch. 10-40

Difference Between Two Means (continued) Population means, independent samples σx2 and σy2 known Test statistic is a z value σx2 and σy2 unknown σx2 and σy2 assumed equal Test statistic is a a value from the Student’s t distribution σx2 and σy2 assumed unequal Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Ch. 10-41

* σx2 and σy2 Known Assumptions: Population means, independent samples Samples are randomly and independently drawn both population distributions are normal Population variances are known * σx2 and σy2 known σx2 and σy2 unknown Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Ch. 10-42

σx2 and σy2 Known (continued) When σx2 and σy2 are known and both populations are normal, the variance of X – Y is Population means, independent samples * σx2 and σy2 known …and the random variable has a standard normal distribution σx2 and σy2 unknown Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Ch. 10-43

Test Statistic, σx2 and σy2 Known Population means, independent samples The test statistic for μx – μy is: * σx2 and σy2 known σx2 and σy2 unknown Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Ch. 10-44

Hypothesis Tests for Two Population Means Two Population Means, Independent Samples Lower-tail test: H0: μx  μy H1: μx < μy i.e., H0: μx – μy  0 H1: μx – μy < 0 Upper-tail test: H0: μx ≤ μy H1: μx > μy i.e., H0: μx – μy ≤ 0 H1: μx – μy > 0 Two-tail test: H0: μx = μy H1: μx ≠ μy i.e., H0: μx – μy = 0 H1: μx – μy ≠ 0 Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Ch. 10-45

Decision Rules a a a/2 a/2 -za za -za/2 za/2 Two Population Means, Independent Samples, Variances Known Lower-tail test: H0: μx – μy  0 H1: μx – μy < 0 Upper-tail test: H0: μx – μy ≤ 0 H1: μx – μy > 0 Two-tail test: H0: μx – μy = 0 H1: μx – μy ≠ 0 a a a/2 a/2 -za za -za/2 za/2 Reject H0 if z < -za Reject H0 if z > za Reject H0 if z < -za/2 or z > za/2 Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Ch. 10-46

Newbold 10.8 A screening procedure - measure attitudes toward minorities high scores indicate negative attitudes low scores indicate positve attitudes Independent random samples 151 male 108 female financial analysits for males sample mean. 85.8, std dev: 19.13 for females sample mean. 71.5, std dev: 12.2

Newbold 10.8 Test the null hypothesis that the two population meand are equal against the alternative that the true mean score is higher for male then for female financial analysts

Solution

σx2 and σy2 Unknown, Assumed Equal Assumptions: Samples are randomly and independently drawn Populations are normally distributed Population variances are unknown but assumed equal Population means, independent samples σx2 and σy2 known σx2 and σy2 unknown * σx2 and σy2 assumed equal σx2 and σy2 assumed unequal Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Ch. 10-50

σx2 and σy2 Unknown, Assumed Equal (continued) The population variances are assumed equal, so use the two sample standard deviations and pool them to estimate σ use a t value with (nx + ny – 2) degrees of freedom Population means, independent samples σx2 and σy2 known σx2 and σy2 unknown * σx2 and σy2 assumed equal σx2 and σy2 assumed unequal Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Ch. 10-51

Test Statistic, σx2 and σy2 Unknown, Equal The test statistic for H0 :μx – μy = 0 is: σx2 and σy2 unknown * σx2 and σy2 assumed equal σx2 and σy2 assumed unequal Where t has (n1 + n2 – 2) d.f., and Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Ch. 10-52

Decision Rules a a a/2 a/2 -ta ta -ta/2 ta/2 Two Population Means, Independent Samples, Variances Unknown Lower-tail test: H0: μx – μy  0 H1: μx – μy < 0 Upper-tail test: H0: μx – μy ≤ 0 H1: μx – μy > 0 Two-tail test: H0: μx – μy = 0 H1: μx – μy ≠ 0 a a a/2 a/2 -ta ta -ta/2 ta/2 Reject H0 if t < -t (n1+n2 – 2), a Reject H0 if t > t (n1+n2 – 2), a Reject H0 if t < -t (n1+n2 – 2), a/2 or t > t (n1+n2 – 2), a/2 Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Ch. 10-53

Pooled Variance t Test: Example You are a financial analyst for a brokerage firm. Is there a difference in dividend yield between stocks listed on the NYSE & NASDAQ? You collect the following data: NYSE NASDAQ Number 21 25 Sample mean 3.27 2.53 Sample std dev 1.30 1.16 Assuming both populations are approximately normal with equal variances, is there a difference in average yield ( = 0.05)? Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Ch. 10-54

Calculating the Test Statistic H0: μ1 - μ2 = 0 i.e. (μ1 = μ2) H1: μ1 - μ2 ≠ 0 i.e. (μ1 ≠ μ2) The test statistic is: Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Ch. 10-55

Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Solution Reject H0 Reject H0 H0: μ1 - μ2 = 0 i.e. (μ1 = μ2) H1: μ1 - μ2 ≠ 0 i.e. (μ1 ≠ μ2)  = 0.05 df = 21 + 25 − 2 = 44 Critical Values: t = ± 2.0154 Test Statistic: .025 .025 -2.0154 2.0154 t 2.040 Decision: Conclusion: Reject H0 at a = 0.05 There is evidence of a difference in means. Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Ch. 10-56

σx2 and σy2 Unknown, Assumed Unequal Assumptions: Samples are randomly and independently drawn Populations are normally distributed Population variances are unknown and assumed unequal Population means, independent samples σx2 and σy2 known σx2 and σy2 unknown σx2 and σy2 assumed equal * σx2 and σy2 assumed unequal Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Ch. 10-57

σx2 and σy2 Unknown, Assumed Unequal (continued) Forming interval estimates: The population variances are assumed unequal, so a pooled variance is not appropriate use a t value with  degrees of freedom, where Population means, independent samples σx2 and σy2 known σx2 and σy2 unknown σx2 and σy2 assumed equal * σx2 and σy2 assumed unequal Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Ch. 10-58

Test Statistic, σx2 and σy2 Unknown, Unequal The test statistic for H0: μx – μy = 0 is: σx2 and σy2 unknown σx2 and σy2 assumed equal * σx2 and σy2 assumed unequal Where t has  degrees of freedom: Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Ch. 10-59

Two Population Proportions 10.3 Tests of the Difference Between Two Population Proportions (Large Samples) Population proportions Goal: Test hypotheses for the difference between two population proportions, Px – Py Assumptions: Both sample sizes are large, nP(1 – P) > 5 Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Ch. 10-60

Two Population Proportions (continued) The random variable has a standard normal distribution Population proportions Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Ch. 10-61

Test Statistic for Two Population Proportions The test statistic for H0: Px – Py = 0 is a z value: Population proportions Where Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Ch. 10-62

Decision Rules: Proportions Population proportions Lower-tail test: H0: Px – Py  0 H1: Px – Py < 0 Upper-tail test: H0: Px – Py ≤ 0 H1: Px – Py > 0 Two-tail test: H0: Px – Py = 0 H1: Px – Py ≠ 0 a a a/2 a/2 -za za -za/2 za/2 Reject H0 if z < -za Reject H0 if z > za Reject H0 if z < -za/2 or z > za/2 Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Ch. 10-63

Example: Two Population Proportions Is there a significant difference between the proportion of men and the proportion of women who will vote Yes on Proposition A? In a random sample, 36 of 72 men and 31 of 50 women indicated they would vote Yes Test at the .05 level of significance Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Ch. 10-64

Example: Two Population Proportions (continued) The hypothesis test is: H0: PM – PW = 0 (the two proportions are equal) H1: PM – PW ≠ 0 (there is a significant difference between proportions) The sample proportions are: Men: = 36/72 = .50 Women: = 31/50 = .62 The estimate for the common overall proportion is: Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Ch. 10-65

Example: Two Population Proportions (continued) Reject H0 Reject H0 The test statistic for PM – PW = 0 is: .025 .025 -1.96 1.96 -1.31 Decision: Do not reject H0 Conclusion: There is not significant evidence of a difference between men and women in proportions who will vote yes. Critical Values = ±1.96 For  = .05 Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Ch. 10-66

Tests of Equality of Two Variances 10.4 Tests of Equality of Two Variances Tests for Two Population Variances Goal: Test hypotheses about two population variances H0: σx2  σy2 H1: σx2 < σy2 Lower-tail test F test statistic H0: σx2 ≤ σy2 H1: σx2 > σy2 Upper-tail test H0: σx2 = σy2 H1: σx2 ≠ σy2 Two-tail test The two populations are assumed to be independent and normally distributed Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Ch. 10-67

Hypothesis Tests for Two Variances (continued) The random variable Tests for Two Population Variances F test statistic Has an F distribution with (nx – 1) numerator degrees of freedom and (ny – 1) denominator degrees of freedom Denote an F value with 1 numerator and 2 denominator degrees of freedom by Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Ch. 10-68

Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Test Statistic Tests for Two Population Variances The critical value for a hypothesis test about two population variances is F test statistic where F has (nx – 1) numerator degrees of freedom and (ny – 1) denominator degrees of freedom Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Ch. 10-69

Decision Rules: Two Variances Use sx2 to denote the larger variance. H0: σx2 = σy2 H1: σx2 ≠ σy2 H0: σx2 ≤ σy2 H1: σx2 > σy2 /2  F F Do not reject H0 Reject H0 Do not reject H0 Reject H0 rejection region for a two-tail test is: where sx2 is the larger of the two sample variances Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Ch. 10-70

Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Example: F Test You are a financial analyst for a brokerage firm. You want to compare dividend yields between stocks listed on the NYSE & NASDAQ. You collect the following data: NYSE NASDAQ Number 21 25 Mean 3.27 2.53 Std dev 1.30 1.16 Is there a difference in the variances between the NYSE & NASDAQ at the  = 0.10 level? Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Ch. 10-71

F Test: Example Solution Form the hypothesis test: H0: σx2 = σy2 (there is no difference between variances) H1: σx2 ≠ σy2 (there is a difference between variances) Find the F critical values for  = .10/2: Degrees of Freedom: Numerator (NYSE has the larger standard deviation): nx – 1 = 21 – 1 = 20 d.f. Denominator: ny – 1 = 25 – 1 = 24 d.f. Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Ch. 10-72

F Test: Example Solution (continued) The test statistic is: H0: σx2 = σy2 H1: σx2 ≠ σy2 /2 = .05 F Do not reject H0 Reject H0 F = 1.256 is not in the rejection region, so we do not reject H0 Conclusion: There is not sufficient evidence of a difference in variances at  = .10 Copyright © 2013 Pearson Education, Inc. Publishing as Prentice Hall Ch. 10-73