Hypothesis Tests OR Tests of Significance One Sample Means.

Slides:

Advertisements

Similar presentations

Statistics Hypothesis Testing.

Advertisements

Two-Sample Inference Procedures with Means

Hypothesis Tests Hypothesis Tests One Sample Means.

Hypothesis Tests Hypothesis Tests One Sample Means.

Statistics for Managers Using Microsoft® Excel 5th Edition

Tests of significance: The basics BPS chapter 15 © 2006 W.H. Freeman and Company.

CHAPTER 23 Inference for Means.

Hypothesis Tests Hypothesis Tests One Sample Proportion.

Experimental Statistics - week 2

Overview Definition Hypothesis

Confidence Intervals and Hypothesis Testing - II

Fundamentals of Hypothesis Testing: One-Sample Tests

Section 9.1 Introduction to Statistical Tests 9.1 / 1 Hypothesis testing is used to make decisions concerning the value of a parameter.

We looked at screen tension and learned that when we measured the screen tension of 20 screens that the mean of the sample was We know the standard.

Chapter 11.1 Inference for the Mean of a Population.

Inference for One-Sample Means

Confidence Intervals with Means. What is the purpose of a confidence interval? To estimate an unknown population parameter.

Confidence Intervals and Hypothesis tests with Proportions.

Hypothesis Tests with Proportions Chapter 10 Notes: Page 169.

Hypothesis Tests with Proportions Chapter 10. Write down the first number that you think of for the following... Pick a two-digit number between 10 and.

Significance Tests: THE BASICS Could it happen by chance alone?

Inference for Proportions One Sample. Confidence Intervals One Sample Proportions.

10.2 Tests of Significance Use confidence intervals when the goal is to estimate the population parameter If the goal is to.

Two-Sample Inference Procedures with Means. Of the following situations, decide which should be analyzed using one-sample matched pair procedure and which.

Hypothesis Tests for Notes: Page 194 Hypothesis Tests for One Sample Means Notes: Page 194.

Confidence intervals are one of the two most common types of statistical inference. Use a confidence interval when your goal is to estimate a population.

Chapter 20 Testing hypotheses about proportions

1 Chapter 10: Introduction to Inference. 2 Inference Inference is the statistical process by which we use information collected from a sample to infer.

Hypothesis Tests Hypothesis Tests One Sample Means.

The z test statistic & two-sided tests Section

Introduction to the Practice of Statistics Fifth Edition Chapter 6: Introduction to Inference Copyright © 2005 by W. H. Freeman and Company David S. Moore.

Economics 173 Business Statistics Lecture 4 Fall, 2001 Professor J. Petry

Section 10.1 Estimating with Confidence AP Statistics February 11 th, 2011.

Hypothesis Tests One Sample Means

AP Statistics Chapter 24 Comparing Means.

CHAPTER 15: Tests of Significance The Basics ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.

Chap 8-1 Fundamentals of Hypothesis Testing: One-Sample Tests.

Copyright © 2006 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide

AP Statistics Section 11.1 B More on Significance Tests.

Introduction to inference Tests of significance IPS chapter 6.2 © 2006 W.H. Freeman and Company.

CH 25 Paired Samples and Blocks. Paired Data 1. Observations that are collected in pairs (data on age differences between husbands and wives, for instance).

Understanding Basic Statistics Fourth Edition By Brase and Brase Prepared by: Lynn Smith Gloucester County College Chapter Nine Hypothesis Testing.

Significance Tests Section Cookie Monster’s Starter Me like Cookies! Do you? You choose a card from my deck. If card is red, I give you coupon.

Matched Pairs Test A special type of t-inference Notes: Page 196.

Hypothesis Tests Hypothesis Tests One Sample Means.

AP Statistics Tuesday, 09 February 2016 OBJECTIVE TSW explore Hypothesis Testing. Student to Ms. Havens: “Is either yesterday’s test or the previous test.

Of the following situations, decide which should be analyzed using one-sample matched pair procedure and which should be analyzed using two-sample procedures?

Copyright © 2009 Pearson Education, Inc. 9.2 Hypothesis Tests for Population Means LEARNING GOAL Understand and interpret one- and two-tailed hypothesis.

Hypothesis Tests Hypothesis Tests (for Means). 1. A government agency has received numerous complaints that a particular restaurant has been selling underweight.

Hypothesis Tests Hypothesis Tests Large Sample 1- Proportion z-test.

Hypothesis Tests Hypothesis Tests One Sample Means.

The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 9: Testing a Claim Section 9.1 Significance Tests: The Basics.

Inference with Proportions II Hypothesis Testing Using a Single Sample.

Slide Slide 1 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing Chapter 8.

4-1 Statistical Inference Statistical inference is to make decisions or draw conclusions about a population using the information contained in a sample.

Hypothesis Tests. 1. A lottery advertises that 10% of people who buy a lottery ticket win a prize. Recently, the organization that oversees this lottery.

Chapter Nine Hypothesis Testing.

Hypothesis Tests One Sample Means

Student t-Distribution

We looked at screen tension and learned that when we measured the screen tension of 20 screens that the mean of the sample was We know the pop.

Basketball Applet

Two-Sample Inference Procedures with Means

Hypothesis Tests One Sample Means

Hypothesis Tests for 1-Sample Proportion

Hypothesis Tests One Sample Means

Hypothesis Tests One Sample Means

Hypothesis Tests One Sample Means

Hypothesis Tests with Proportions

A special type of t-inference

Presentation transcript:

Hypothesis Tests OR Tests of Significance One Sample Means

Basketball Applet ► _666398__ _666398__ _666398__

Level of Significance Activity ► Fish Oil vs Regular Oil

Let’s say a government agency has received numerous complaints that a particular restaurant has been selling underweight hamburgers. The restaurant advertises that it’s patties are “a quarter pound” (4 ounces). How can I tell if they really are underweight? Take a sample & find x. expect unlikely But how do I know if this x is one that I expect to happen or is it one that is unlikely to happen? Hypothesis test will help me decide!

What are hypothesis tests? Calculations that tell us if a value occurs by random chance or not – if it is statistically significant Is it...  a random occurrence due to variation?  a biased occurrence due to some other reason?

Nature of hypothesis tests - ► First begin by supposing the claim or “effect” is NOT present ► Next, see if data provides evidence against the supposition Example:murder trial How does a murder trial work? First - assume that the person is innocent must Then – must have sufficient evidence to prove guilty Hmmmmm … Hypothesis tests use the same process!

Debra Interpretation ► Someone/some business makes a pronouncement ► You don’t believe it (if you did why test?) ► The null hypothesis is their pronouncement ► The alternative hypothesis is what you really think is going on ► So confidence tests or hypothesis tests are probability tests that measure how well the data and the pronouncement agree ► Confidence tests DO NOT make decisions regarding the trueness of the alternative hypothesis

Steps: 1) Assumptions 2) Hypothesis statements & define parameters 3) Calculations 4) Conclusion, in context Notice the steps are the same except we add hypothesis statements – which you will learn today

Assumptions for z-test (t-test): ► Have an SRS of context ► Distribution is (approximately) normal  Given  Large sample size  Graph data ►  is known (unknown) YEA YEA – These are the same assumptions as confidence intervals!!

Example 1: Bottles of a popular cola are supposed to contain 300 mL of cola. There is some variation from bottle to bottle. An inspector, who suspects that the bottler is under- filling, measures the contents of six randomly selected bottles. Are the assumptions met? Have an SRS of bottles Sampling distribution is approximately normal because the boxplot is symmetrical  is unknown

Writing Hypothesis statements: ► Null hypothesis – is the statement being tested; this is a statement of “no effect” or “no difference” - is an equality statement  In other words, the test that everything is as it should be H0:H0:

Writing Hypothesis statements: ► Alternative hypothesis – is the statement that we suspect is true  In other words, we think that the original claim is not true, and we think that the actual results are different in some way Ha:Ha:

The form: Null hypothesis H 0 : parameter = hypothesized value H 0 : parameter = hypothesized value Alternative hypothesis H a : parameter > hypothesized value H a : parameter > hypothesized value H a : parameter < hypothesized value H a : parameter < hypothesized value H a : parameter = hypothesized value H a : parameter = hypothesized value H 0 MUST be “=“ !

Example 2: A government agency has received numerous complaints that a particular restaurant has been selling underweight hamburgers. The restaurant advertises that it’s patties are “a quarter pound” (4 ounces). State the hypotheses : Where  is the true mean weight of hamburger patties H 0 :  = 4 H a :  < 4 Must define what µ is

Example 3: A car dealer advertises that is new subcompact models get 47 mpg. You suspect the mileage might be overrated. State the hypotheses : Where  is the true mean mpg H 0 :  = 47 H a :  < 47 Must define what µ is

Example 4: Many older homes have electrical systems that use fuses rather than circuit breakers. A manufacturer of 40-A fuses wants to make sure that the mean amperage at which its fuses burn out is in fact 40. If the mean amperage is lower than 40, customers will complain because the fuses require replacement too often. If the amperage is higher than 40, the manufacturer might be liable for damage to an electrical system due to fuse malfunction. State the hypotheses : Where  is the true mean amperage of the fuses H 0 :  = 40 H a :  = 40

Facts to remember about hypotheses: ► ALWAYS refer to populations (parameters) ► The null hypothesis for the “difference” between populations is usually equal to zero ► The null hypothesis for the correlation (rho) of two events is usually equal to zero. H 0 :  x-y = 0 H 0 :  = 0

Activity: For each pair of hypotheses, indicate which are not legitimate & explain why Must use parameter (population) x is a statistics (sample)  is the population proportion! Must use same number as H 0 !  is parameter for population correlation coefficient – but H 0 MUST be “=“ ! Must be NOT equal!

P-values - ► The probability that the test statistic would have a value as extreme or more than what is actually observed In other words... is it far out in the tails of the distribution?

P-values - ► The smaller the p-value, the stronger the evidence against H 0 provided by the data ► Large p-values fail to give evidence against H 0

Level of significance - ► This is the amount of evidence necessary before we begin to doubt that the null hypothesis is true ► Is the probability that we will reject the null hypothesis, assuming that it is true ► Denoted by   Can be any value  Usual values: 0.1, 0.05, 0.01  Most common is 0.05

Statistically significant – ► The p-value is as small or smaller than the level of significance (  ) ► If p > , “fail to reject” the null hypothesis at the  level. ► If p < , “reject” the null hypothesis at the  level.

Facts about p-values: ► ALWAYS make decision about the null hypothesis! ► Large p-values show support for the null hypothesis, but never that it is true! ► Small p-values show support that the null is not true. ► Double the p-value for two-tail (=) tests ► Never accept the null hypothesis!

Never “accept” the null hypothesis!

At an  level of.05, would you reject or fail to reject H 0 for the given p-values? a).03 b).15 c).45 d).023 Reject Fail to reject

Calculating p-values ► With z-test statistic (same as z-score but for statistic (ie sample))  normalcdf(lb, ub,[mean,standard deviation])  You may have to find the z-test number first ► With t-test statistic (same as t-score but for statistic (ie sample))  Use tcdf(lb, ub, df)  You may have to find the t-test number first

Draw & shade a curve & calculate the p-value: 1) right-tail test t = 1.6; n = 20 2) left-tail testz = -2.4; n = 15 3) two-tail testt = 2.3; n = 25

Writing Conclusions: 1) A statement of the decision being made (reject or fail to reject H 0 ) & why (linkage) 2) A statement of the results in context. (state in terms of H a ) AND

“Since the p-value ) , I reject (fail to reject) the H 0. There is (is not) sufficient evidence to suggest that H a.” Be sure to write H a in context (words)!

Example 5: Drinking water is considered unsafe if the mean concentration of lead is 15 ppb (parts per billion) or greater. Suppose a community randomly selects of 25 water samples and computes a t-test statistic of 2.1. Assume that lead concentrations are normally distributed. Write the hypotheses, calculate the p-value & write the appropriate conclusion for  = H 0 :  = 15 H a :  > 15 Where  is the true mean concentration of lead in drinking water P-value = tcdf(2.1,10^99,24) =.0232 t=2.1 Since the p-value < , I reject H 0. There is sufficient evidence to suggest that the mean concentration of lead in drinking water is greater than 15 ppb.

Example 6: A certain type of frozen dinners states that the dinner contains 240 calories. A random sample of 12 of these frozen dinners was selected from production to see if the caloric content was greater than stated on the box. The t-test statistic was calculated to be 1.9. Assume calories vary normally. Write the hypotheses, calculate the p-value & write the appropriate conclusion for  = H 0 :  = 240 H a :  > 240 Where  is the true mean caloric content of the frozen dinners P-value = tcdf(1.9,10^99,11) =.0420 t=1.9 Since the p-value < , I reject H 0. There is sufficient evidence to suggest that the true mean caloric content of these frozen dinners is greater than 240 calories.

Formulas:  known: z = 

Formulas:  unknown: t = 

Example 7: The Fritzi Cheese Company buys milk from several suppliers as the essential raw material for its cheese. Fritzi suspects that some producers are adding water to their milk to increase their profits. Excess water can be detected by determining the freezing point of milk. The freezing temperature of natural milk varies normally, with a mean of degrees and a standard deviation of Added water raises the freezing temperature toward 0 degrees, the freezing point of water (in Celsius). The laboratory manager measures the freezing temperature of five randomly selected lots of milk from one producer with a mean of degrees. Is there sufficient evidence to suggest that this producer is adding water to his milk?

Assumptions: I have an SRS of milk from one producer The freezing temperature of milk is a normal distribution. (given)  is known SRS? Normal? How do you know? Do you know  ? H 0 :  = H a :  > where  is the true mean freezing temperature of milk What are your hypothesis statements? Is there a key word? Plug values into formula. p-value = normalcdf(1.9566,1E99)=.0252 Use normalcdf to calculate p-value.  =.05

Conclusion: Compare your p-value to  & make decision Since p-value < , I reject the null hypothesis. Write conclusion in context in terms of H a. There is sufficient evidence to suggest that the true mean freezing temperature is greater than This suggests that the producer is adding water to the milk.

Example 8: The Degree of Reading Power (DRP) is a test of the reading ability of children. Here are DRP scores for a random sample of 44 third-grade students in a suburban district: (data on note page) At the  =.1, is there sufficient evidence to suggest that this district’s third graders reading ability is different than the national mean of 34?

I have an SRS of third-graders Since the sample size is large, the sampling distribution is approximately normally distributed OR Since the histogram is unimodal with no outliers, the sampling distribution is approximately normally distributed  is unknown SRS? Normal? How do you know? Do you know  ? What are your hypothesis statements? Is there a key word? Plug values into formula. p-value = tcdf(.6467,1E99,43)=.2606(2)=.5212 Use tcdf to calculate p-value.  =.1 H 0 :  = 34where  is the true mean reading H a :  = 34 ability of the district’s third-graders

Conclusion: Compare your p-value to  & make decision Since p-value > , I fail to reject the null hypothesis. Write conclusion in context in terms of H a. There is not sufficient evidence to suggest that the true mean reading ability of the district’s third-graders is different than the national mean of 34.

Example 9: The Wall Street Journal (January 27, 1994) reported that based on sales in a chain of Midwestern grocery stores, President’s Choice Chocolate Chip Cookies were selling at a mean rate of $1323 per week. Suppose a random sample of 30 weeks in 1995 in the same stores showed that the cookies were selling at the average rate of $1208 with standard deviation of $275. Does this indicate that the sales of the cookies is different from the earlier figure?

Assume: Have an SRS of weeks Distribution of sales is approximately normal due to large sample size  unknown H0:  = 1323 where  is the true mean cookie sales per Ha:  ≠ 1323week Since p-value <  of 0.05, I reject the null hypothesis. There is sufficient evidence to suggest that the sales of cookies are different from the earlier figure.

Example 9 Cont.: President’s Choice Chocolate Chip Cookies were selling at a mean rate of $1323 per week. Suppose a random sample of 30 weeks in 1995 in the same stores showed that the cookies were selling at the average rate of $1208 with standard deviation of $275. Compute a 95% confidence interval for the mean weekly sales rate. CI = ($ , $ ) Based on this interval, is the mean weekly sales rate statistically different from the reported $1323?

What do you notice about the decision from the confidence interval & the hypothesis test? You expect that the significance test would support the confidence interval and visa versa. That means that if you reject the null hypothesis, you would expect that the null hypothesis value to be outside of your calculated interval.

Matched Pairs Test A special type of t- inference

Matched Pairs – two forms ► Pair individuals by certain characteristics ► Randomly select treatment for individual A ► Individual B is assigned to other treatment ► Assignment of B is dependent on assignment of A ► Individual persons or items receive both treatments ► Order of treatments are randomly assigned or before & after measurements are taken ► The two measures are dependent on the individual

Is this an example of matched pairs? 1)A college wants to see if there’s a difference in time it took last year’s class to find a job after graduation and the time it took the class from five years ago to find work after graduation. Researchers take a random sample from both classes and measure the number of days between graduation and first day of employment No, there is no pairing of individuals, you have two independent samples

Is this an example of matched pairs? 2) In a taste test, a researcher asks people in a random sample to taste a certain brand of spring water and rate it. Another random sample of people is asked to taste a different brand of water and rate it. The researcher wants to compare these samples No, there is no pairing of individuals, you have two independent samples – If you would have the same people taste both brands in random order, then it would bean example of matched pairs.

Is this an example of matched pairs? 3) A pharmaceutical company wants to test its new weight-loss drug. Before giving the drug to a random sample, company researchers take a weight measurement on each person. After a month of using the drug, each person’s weight is measured again. Yes, you have two measurements that are dependent on each individual.

A whale-watching company noticed that many customers wanted to know whether it was better to book an excursion in the morning or the afternoon. To test this question, the company collected the following data on 15 randomly selected days over the past month. (Note: days were not consecutive.) Day Morning After- noon First, you must find the differences for each day. Since you have two values for each day, they are dependent on the day – making this data matched pairs You may subtract either way – just be careful when writing H a

Day Morning After- noon Differen ces Assumptions: Have an SRS of days for whale-watching  unknown Since the normal probability plot is approximately linear, the distribution of difference is approximately normal. I subtracted: Morning – afternoon You could subtract the other way! You need to state assumptions using the differences! Notice the granularity in this plot, it is still displays a nice linear relationship!

Differen ces Is there sufficient evidence that more whales are sighted in the afternoon? Be careful writing your H a ! Think about how you subtracted: M-A If afternoon is more should the differences be + or -? Don’t look at numbers!!!! H 0 :  D = 0 H a :  D < 0 Where  D is the true mean difference in whale sightings from morning minus afternoon Notice we used  D for differences & it equals 0 since the null should be that there is NO difference. If you subtract afternoon – morning; then H a :  D >0

Differen ces finishing the hypothesis test: Since p-value > , I fail to reject H 0. There is insufficient evidence to suggest that more whales are sighted in the afternoon than in the morning. Notice that if you subtracted A-M, then your test statistic t = +.945, but p- value would be the same In your calculator, perform a t-test using the differences (L3)