Significance Tests in practice

Slides:



Advertisements
Similar presentations
Lesson Tests about a Population Mean. Knowledge Objectives Define the one-sample t statistic.
Advertisements

Chapter 9 Tests of Significance Target Goal: I can perform a significance test to support the alternative hypothesis. I can interpret P values in context.
Comparing 2 population parameters Chapter 13. Introduction: Two Sample problems  Ex: How do small businesses that fail differ from those that succeed?
BPS - 3rd Ed. Chapter 141 Tests of Significance: The Basics.
Significance Tests in practice Chapter Tests about a population mean  When we don’t know the population standard deviation σ, we perform a one.
Tests about a Population Mean σ unkown. p ,2,3 p ,6,7.
Essential Statistics Chapter 131 Introduction to Inference.
CHAPTER 14 Introduction to Inference BPS - 5TH ED.CHAPTER 14 1.
Section 10.1 Estimating with Confidence AP Statistics February 11 th, 2011.
Daniel S. Yates The Practice of Statistics Third Edition Chapter 12: Significance Tests in Practice Copyright © 2008 by W. H. Freeman & Company.
The Practice of Statistics Third Edition Chapter 12: Significance Tests in Practice Copyright © 2008 by W. H. Freeman & Company Daniel S. Yates.
Warm Up Exercise Warm Up Write down the four steps to confidence interval. 2.
Chapters 22, 24, 25 Inference for Two-Samples. Confidence Intervals for 2 Proportions.
Review of Power of a Test
Section 9.3 One-sample z and t-test for a Population Mean
CHAPTER 9 Testing a Claim
Lecture #23 Tuesday, November 8, 2016 Textbook: 13.1 through 13.4
Chapter 8: Estimating with Confidence
Inference for Two-Samples
Paired Samples and Blocks
Inference about Two Means - Independent Samples
Two-Sample Hypothesis Testing
Chapter 9: Testing a Claim
Chapters 20, 21 Hypothesis Testing-- Determining if a Result is Different from Expected.
Inference for Distributions
Chapter 8: Estimating with Confidence
Chapter 9: testing a claim
Hypothesis Testing for Proportions
CHAPTER 8 Confidence Estimating with Estimating a Population 8.3 Mean
Chapter 9 Testing A Claim
Hypothesis Tests for a Population Mean in Practice
Section 10.2: Tests of Significance
Section 10.2 Tests of Significance
CHAPTER 9 Testing a Claim
Lesson Comparing Two Means.
Essential Statistics Introduction to Inference
YOU HAVE REACHED THE FINAL OBJECTIVE OF THE COURSE
Tests of Significance.
Chapter 8: Estimating with Confidence
Click the mouse button or press the Space Bar to display the answers.
Chapter 8: Estimating with Confidence
Click the mouse button or press the Space Bar to display the answers.
CHAPTER 9 Testing a Claim
Lecture 10/24/ Tests of Significance
Tests About a Population Proportion
Chapter 9: Testing a Claim
Chapter 9 Testing a Claim
Basic Practice of Statistics - 3rd Edition Introduction to Inference
Review of Chapter 10 Comparing Two Population Parameters
Day 63 Agenda:.
Chapter 9: Testing a Claim
CHAPTER 9 Testing a Claim
Chapter 8: Estimating with Confidence
CHAPTER 9 Testing a Claim
Click the mouse button or press the Space Bar to display the answers.
CHAPTER 9 Testing a Claim
AP Statistics Chapter 12 Notes.
CHAPTER 9 Testing a Claim
Chapter 9: Testing a Claim
Carrying Out Significance Tests
CHAPTER 18: Inference about a Population Mean
Chapter 8: Estimating with Confidence
CHAPTER 9 Testing a Claim
Chapter 9: Testing a Claim
CHAPTER 9 Testing a Claim
CHAPTER 9 Testing a Claim
Section 10.2 Comparing Two Means.
Chapter 8: Estimating with Confidence
Chapter 8: Estimating with Confidence
Basic Practice of Statistics - 5th Edition Introduction to Inference
Presentation transcript:

Significance Tests in practice Chapter 12

12.1 Tests about a population mean When we don’t know the population standard deviation σ, we perform a one sample T test instead of a Z test We get a “T calc” (instead of a Z calc) and the formula is the same except we put in the standard deviation Sx of our sample as a replacement for σ

Interpretation and P values same as Z interval, except if you are using the tables, look up “T star” to find the critical T value you need to beat. I recommend the calculator though… The P value is from your calculated T to the tail. If it is a 2 tailed test, remember this gets doubled!

T test on calc Go to Stat/Tests choose 2: T-test. Enter either data or Stats (just like Z test)

Example: Diversity (P. 750) An investor with a stock portfolio sued his broker b/c lack of diversification in his portfolio led to poor performance. The table gives the rates of return for the 39 months that the account was managed by the broker. Consider the 39 monthly returns as a random sample from the monthly returns the broker would generate if he managed the account forever. Are these returns compatible with a population of μ=.95%, The S&P 500 average?

SRS- We are told this in problem -8.36 1.63 -2.27 -2.93 -2.70 -9.14 -2.64 6.82 -2.35 -3.58 6.13 7 -15.25 -8.66 -1.03 -9.16 -1.25 -1.22 -10.27 -5.11 -.80 -1.44 1.28 -.65 4.34 12.22 -7.21 -.09 7.34 5.04 -7.24 -2.14 -1.01 -1.41 12.03 -2.56 4.33 2.35 Step 1: Hypothesis- (Where μ is the mean return for all possible months that the broker could manage this account) H0: μ = .95 HA: μ ≠ .95 Step 2: Conditions- Since we do not know σwe must use a one-sample t test. Now we check conditions: SRS- We are told this in problem Normality: Sample is large enough (39) so CLT applies Independence: We must treat the 39 monthly returns as independent observations from the population of months in which the broker could have managed the account.

Step 3: Calculations- Test Statistic: (-1.10 - .95) / (5.99/√39) = -2.14 P value: df are 39-1 = 38. Since we are performing a 2-tailed test, our P-value is .039 (can be done on calc, or on table- if doing table, make sure to double the area under the curve b/c of 2 tailed!) Step 4: Interpretation: The mean monthly return on investment for this client’s account was -1.1% for this period (this is our x bar). This is significantly difference at the alpha = .05 level of the S&P 500 for the same period (t = -2.14, P<.05)

Paired T Test – Testing the differences Example: Sweet Cola (P. 746) Diet colas use artificial sweeteners which gradually lose their sweetness overtime. Trained tasters sip cola and give it a “sweetness scale rating” of 1 to 10. Then it is stored for 4 months and tested again. The differences in sweetness score are listed (The bigger the differences, the bigger the loss of sweetness, negative value means it gained sweetness) 2 .4 .7 -.4 2.2 -1.3 1.2 1.1 2.3

Step 1: Hypothesis- since we care about the difference, μbefore-μafter = μdiff H0: μdiff = 0 “The mean sweetness loss for the population of tasters is zero” HA: μdiff > 0 “The mean sweetness loss for the population of tasters is positive. The cola seems to be losing sweetness in storage Step 2: Conditions- Since we do not know the standard deviation of sweetness loss in the population of tasters, we must use a one-sample t test. Now we check conditions: SRS- assume the 10 tasters are randomly selected Normality: 10 is too small a sample so we check our normal probability plot

It is slightly left skewed, but no gaps/outliers so we proceed with caution Independence: we assume there are more then 10x10 = 100 tasters in the population

Step 3: Calculations Test Statistic- The basic statistics are X bardiff = 1.01 and Sdiff= 1.196 Tcalc= (1.02 – 0)/ (1.196/√10) = 2.70 P value: degrees of freedom are 10-1 = 9 for alpha = .05, p = .0123

Step 4: Interpretation- A P-value this low gives strong evidence against the null hypothesis. We reject H0 and conclude that the cola has lost sweetness during storage *Remember the 3C’s! (Conclusion, connection, context)

Robustness and Power Recall that t procedures are robust against non-normality of the population except when outliers or strong skewness are present (skewness is more serious). As the sample size increases, the CLT ensures that our distribution of sample means approximates normality. Again, calculating power is not in this course, but remember we can still commit the same type I / type II errors when rejecting (or failing to reject) the null.

12.2 Tests about a population proportion Remember, when the 3 conditions are met (SRS, Normality, and Indep.), the sampling distribution of p hat is approximately Normal with mean equal to rho and standard deviation = √(pq/n)

Note: In the standard error (denominator of test statistic) we use the values of our assumed null hypothesis- this is different from the standard error of the CI where we used sample proportion values!

One proportion z on calc Stat- Test – 1-propZtest put in: Rho x = the count (number of successes in your sample) n = number of people in your sample choose one-tailed or 2 tailed hypothesis calculate

Example (P. 767) work stress Job stress poses a major threat to the health of workers. A national survey or restaurant employees found that 75% said that work stress had a negative impact on their personal lives. A random sample of 100 employees from a large restaurant chain finds that 68 answer “yes” when asked “Does work stress have a negative impact on your personal life?”. Is this good reason to think that the proportion of all employees in this chain who would say “yes” differs from the national proportion of ρ= .75?

Hypothesis: we want to test a claim about ρ, the true proportion of this chain’s employees who would say that work stress has a negative impact on their personal lives. H0: ρ=.75 HA: ρ ≠ .75 Conditions: We should use a one-proportion Z test if the conditions are met SRS: we are told in problem Normality: The expected number of “yes” and “no” responses are (100)(.75) = 75 and (100)(.25) = 25 respectively. Both are at least 10 s Independence: We assume the chain has at least 10(100) = 1000 employees

Calculations Test Statistic: P value = .1052 (if you do normalcdf, remember to double your value b/c this is a 2 tailed test! so 2x.0526!) BUT! MUCH easier to do on Calc though and not use NormalCDF! Try it Interpretation There is over a 10% chance of obtaining a sample result as unusual as or even more unusual than we did (p hat = .68) when the null is true so we have insufficient evidence to suggest that the proportion of this chain restaurant’s employees who suffer from work stress is different from the national survey result, .75

Confidence Intervals using the info from that same example, if we were to calculate a 95% CI (.59, .77) by hand, using formula: Note: the CI standard error uses the sample proportions, but in the test statistic formula we use the value assumed in the null hypothesis Using calc is easier though: choose 1 prop z int Interpretation: We are 95% confident that between 59% and 77% of the restaurant chain’s employees feel that work stress is damaging their personal lives.