Introduction to Hypothesis Testing Raymond J. Carroll Department of Statistics Faculty of Nutrition Texas A&M University

Slides:



Advertisements
Similar presentations
Statistics Versus Parameters
Advertisements

Chapter 8: Estimating with Confidence
Introduction to Confidence Intervals using Population Parameters Chapter 10.1 & 10.3.
Inference Sampling distributions Hypothesis testing.
Statistical Studies: Statistical Investigations
Inference1 Data Analysis Inferential Statistics Research Methods Gail Johnson.
Chapter 10: Hypothesis Testing
Copyright (c) Bani Mallick1 Stat 651 Lecture 5. Copyright (c) Bani Mallick2 Topics in Lecture #5 Confidence intervals for a population mean  when the.
QM Spring 2002 Business Statistics Introduction to Inference: Hypothesis Testing.
1 Trust and divorce Separated or Divorced trust | No Yes | Total Low | | 247 | |
1 The Basics of Regression Regression is a statistical technique that can ultimately be used for forecasting.
1 Hypothesis Testing In this section I want to review a few things and then introduce hypothesis testing.
Copyright (c) Bani Mallick1 Lecture 4 Stat 651. Copyright (c) Bani Mallick2 Topics in Lecture #4 Probability The bell-shaped (normal) curve Normal probability.
Copyright (c) Bani K. Mallick1 STAT 651 Lecture #15.
Hypothesis Tests for Means The context “Statistical significance” Hypothesis tests and confidence intervals The steps Hypothesis Test statistic Distribution.
Chapter 3 Hypothesis Testing. Curriculum Object Specified the problem based the form of hypothesis Student can arrange for hypothesis step Analyze a problem.
Lecture Slides Elementary Statistics Twelfth Edition
© 2013 Pearson Education, Inc. Active Learning Lecture Slides For use with Classroom Response Systems Introductory Statistics: Exploring the World through.
Experimental Evaluation
Inferences About Process Quality
Confidence Intervals for Proportions
Comparing Means.  Comparing two means is not very different from comparing two proportions.  This time the parameter of interest is the difference between.
Inferential Statistics
Psy B07 Chapter 8Slide 1 POWER. Psy B07 Chapter 8Slide 2 Chapter 4 flashback  Type I error is the probability of rejecting the null hypothesis when it.
Significance Tests …and their significance. Significance Tests Remember how a sampling distribution of means is created? Take a sample of size 500 from.
June 18, 2008Stat Lecture 11 - Confidence Intervals 1 Introduction to Inference Sampling Distributions, Confidence Intervals and Hypothesis Testing.
+ Chapter 9 Summary. + Section 9.1 Significance Tests: The Basics After this section, you should be able to… STATE correct hypotheses for a significance.
8.1 Inference for a Single Proportion
Ch 8 Estimating with Confidence. Today’s Objectives ✓ I can interpret a confidence level. ✓ I can interpret a confidence interval in context. ✓ I can.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 8: Estimating with Confidence Section 8.1 Confidence Intervals: The.
M33 Confidence intervals 1  Department of ISM, University of Alabama, Confidence Intervals Estimation.
Topics: Statistics & Experimental Design The Human Visual System Color Science Light Sources: Radiometry/Photometry Geometric Optics Tone-transfer Function.
Hypothesis Testing for Proportions
Sampling Distribution ● Tells what values a sample statistic (such as sample proportion) takes and how often it takes those values in repeated sampling.
Significance Tests: THE BASICS Could it happen by chance alone?
Copyright © Cengage Learning. All rights reserved. 10 Inferences Involving Two Populations.
Chapter 23 Inference for One- Sample Means. Steps for doing a confidence interval: 1)State the parameter 2)Conditions 1) The sample should be chosen randomly.
Confidence Intervals: The Basics BPS chapter 14 © 2006 W.H. Freeman and Company.
Determination of Sample Size: A Review of Statistical Theory
Section 10.1 Confidence Intervals
Introduction to Inferece BPS chapter 14 © 2010 W.H. Freeman and Company.
HYPOTHESIS TESTING. A hypothesis test is a procedure for deciding if a null hypothesis should be accepted or rejected in favour of an alternative hypothesis.
Warm up On slide.
Chapter 10: Introduction to Statistical Inference.
Sampling distributions rule of thumb…. Some important points about sample distributions… If we obtain a sample that meets the rules of thumb, then…
Introduction to Confidence Intervals using Population Parameters Chapter 10.1 & 10.3.
1 CHAPTER 4 CHAPTER 4 WHAT IS A CONFIDENCE INTERVAL? WHAT IS A CONFIDENCE INTERVAL? confidence interval A confidence interval estimates a population parameter.
Section Estimating a Proportion with Confidence Objectives: 1.To find a confidence interval graphically 2.Understand a confidence interval as consisting.
26134 Business Statistics Tutorial 11: Hypothesis Testing Introduction: Key concepts in this tutorial are listed below 1. Difference.
AP Statistics Section 11.1 B More on Significance Tests.
Inen 460 Lecture 2. Estimation (ch. 6,7) and Hypothesis Testing (ch.8) Two Important Aspects of Statistical Inference Point Estimation – Estimate an unknown.
Business Statistics for Managerial Decision Farideh Dehkordi-Vakil.
© Copyright McGraw-Hill 2004
One Sample Mean Inference (Chapter 5)
Confidence Interval Estimation For statistical inference in decision making: Chapter 9.
Copyright © 2009 Pearson Education, Inc. 8.1 Sampling Distributions LEARNING GOAL Understand the fundamental ideas of sampling distributions and how the.
INFERENTIAL STATISTICS DOING STATS WITH CONFIDENCE.
Point Estimates. Remember….. Population  It is the set of all objects being studied Sample  It is a subset of the population.
Section 10.2: Tests of Significance Hypothesis Testing Null and Alternative Hypothesis P-value Statistically Significant.
Today: Hypothesis testing. Example: Am I Cheating? If each of you pick a card from the four, and I make a guess of the card that you picked. What proportion.
Section 9.1 First Day The idea of a significance test What is a p-value?
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 8: Estimating with Confidence Section 8.1 Confidence Intervals: The.
Class Seven Turn In: Chapter 18: 32, 34, 36 Chapter 19: 26, 34, 44 Quiz 3 For Class Eight: Chapter 20: 18, 20, 24 Chapter 22: 34, 36 Read Chapters 23 &
Tutorial 11: Hypothesis Testing
CHAPTER 6 Random Variables
Chapter 4. Inference about Process Quality
Statistics and Data Analysis
Chapter Nine Part 1 (Sections 9.1 & 9.2) Hypothesis Testing
EXAMPLE: The weight of a can of Coca Cola is supposed to have mean = 12 oz with std. dev.= 0.82 oz. The product is declared underweight if it weighs.
Confidence Intervals.
Presentation transcript:

Introduction to Hypothesis Testing Raymond J. Carroll Department of Statistics Faculty of Nutrition Texas A&M University

Outline Series of Examples Data Collection for Examples

Example #1 My Hypothesis: Texas A&M Students simply guess when they are asked whether they are drinking diet Pepsi or diet Coke The Experiment: Blind taste test. You are asked which cup you drink is diet Coke Our Goal: Test this hypothesis, using statistical principles and probability statements

A Warning Yes or No: No statistician will ever answer a question “yes” or “no” Probabilities: We always say things like “the chance is less than 5% that your hypothesis is correct”

Example #1 Data Model: The data model is –Normal? –Gamma? –Binomial? –Poisson?

Example #1 Data Model: The data model is –Normal? –Gamma? –Binomial? Because each outcome is yes or no, success or failure –Poisson?

Example #1 My Hypothesis in terms of population parameters: I have claimed that you can do no better than guess Each of you is a Binomial(1,p) or Binomial(1  When I say you are guessing, what am I saying about the population?

Example #1 My Hypothesis in terms of population parameters: I have claimed that you can do no better than guess Each of you is a Binomial(1,p) or Binomial(1  When I say you are guessing, what am I saying about the population? That the proportion of successes is p =  = ½

Example #2 My Hypotheses: Keebler used to advertise 17 chocolate chip per cookie More chocolate chips than another brand The Experiment: Get a cookie of each type, count the number of chips, criticize the experiment Our Goal: Test these hypotheses, using statistical principles and probability statements

Example #2 Data Model: The data model is –Normal? –Gamma? –Binomial? –Poisson?

Example #2 Data Model:The data model is –Normal? –Gamma? –Binomial? –Poisson? It could be Poisson or normal. Poisson is the better choice, because it is a count We’ll use the central limit theorem to make inferences

Example #2 My Hypothesis in terms of population parameters: Keebler has claimed that it gives you 17 chips per cookie, on average Each of you is a Poisson with mean When I say Keebler is correct, what am I saying about the population?

Example #2 My Hypothesis in terms of population parameters: Keebler has claimed that it gives you 17 chips per cookie, on average Each of you is a Poisson with mean When I say Keebler is correct, what am I saying about the population? That the population mean number of chips is 17

Example #3 My Hypotheses: The percentage of regular M&M’s that are green is the same as the percentage of peanut M&M’s that are green The Experiment: Compute the percentage of green M&M’s in each bag Our Goal: Test these hypotheses, using statistical principles and probability statements

Example #3 Data Model: The data model is –Normal? –Gamma? –Binomial? –Poisson?

Example #3 Data Model:The data model is –Normal? –Gamma? –Binomial? –Poisson? Roughly normal, since each data point is a percentage We’ll use the central limit theorem to make inferences

Example #3 My Hypothesis in terms of population parameters: The %-green M&M’s does not depend on the type of M&M’s What am I saying about the two populations?

Example #3 My Hypothesis in terms of population parameters: The %-green M&M’s does not depend on the type of M&M’s What am I saying about the two populations? That they have the same population mean.

Example #4 My Hypotheses: Women who keep track of their diet by diaries or PDA do not lower their caloric intake in a 6-day period The Experiment: The WISH Study at the National Cancer Institute, with 400 women The data appear to contradict my hypothesis

Typical (Median) Values of Reported Caloric Intake Over 6 Diary Days: WISH Study A major point of STAT211 is to prepare you to answer the question as to whether these data, which look convincing, really are convincing in terms of probability statements.

Example #4 Data Model: The data model is –Normal? –Gamma? –Binomial? –Poisson?

Example #4 Data Model:The data model is –Normal? –Gamma? –Binomial? –Poisson? Lognormal, so most people take logarithms of caloric intake and analyze them as normal

Example #4 Data Model: The data that we use is the difference between Day 1 and Day 6, i.e., Day 1 – Day 6

Example #4 My Hypothesis in terms of population parameters: What am I saying about the population, when I claim that writing down diets will not lead to a change in reported caloric intake?

Example #4 My Hypothesis in terms of population parameters: What am I saying about the population, when I claim that writing down diets will not lead to a change in reported caloric intake? That the population mean difference between Day 1 and Day 6 = 0

Some Final Comments Formulating statistical hypothesis testing is really intuitive Don’t let the formulae obscure the fact that all we are doing is –Asking questions about population parameters –Constructing confidence intervals for population parameters –Using these confidence intervals to answer the question

The WISH Data I computed a 99% confidence interval for the population mean change in the WISH data. This interval was entirely above 0, and ranged roughly from 75 to 375 In other words, with 99% confidence, Day 1 reported between 75 and 375 more calories than Day 6. Is the hypothesis true?