Lecture 3 Outline: Tues, Jan 20 Chapter 1.3 Probability model for 2-group randomized experiment. Hypothesis testing review Randomization test p-value Principle.

Slides:



Advertisements
Similar presentations
Our goal is to assess the evidence provided by the data in favor of some claim about the population. Section 6.2Tests of Significance.
Advertisements

Testing Hypotheses About Proportions
Anthony Greene1 Simple Hypothesis Testing Detecting Statistical Differences In The Simplest Case:  and  are both known I The Logic of Hypothesis Testing:
Lecture 3 Outline: Thurs, Sept 11 Chapters Probability model for 2-group randomized experiment Randomization test p-value Probability model for.
Chapter 12 Tests of Hypotheses Means 12.1 Tests of Hypotheses 12.2 Significance of Tests 12.3 Tests concerning Means 12.4 Tests concerning Means(unknown.
Our goal is to assess the evidence provided by the data in favor of some claim about the population. Section 6.2Tests of Significance.
INFERENCE: SIGNIFICANCE TESTS ABOUT HYPOTHESES Chapter 9.
Testing Hypotheses About Proportions Chapter 20. Hypotheses Hypotheses are working models that we adopt temporarily. Our starting hypothesis is called.
Introduction to Hypothesis Testing
Section 7.1 Hypothesis Testing: Hypothesis: Null Hypothesis (H 0 ): Alternative Hypothesis (H 1 ): a statistical analysis used to decide which of two competing.
Lecture 2: Thu, Jan 16 Hypothesis Testing – Introduction (Ch 11)
Stat 112 – Notes 3 Homework 1 is due at the beginning of class next Thursday.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 9-1 Chapter 9 Fundamentals of Hypothesis Testing: One-Sample Tests Basic Business Statistics.
Chapter 9 Hypothesis Testing.
BCOR 1020 Business Statistics
Lecture 4 Outline Chapter 1.3, 1.5 Control in Experimental Design Causal Inference in Observational Studies Summarizing Data –Numerical methods –Graphical.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 8 Tests of Hypotheses Based on a Single Sample.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 9 Hypothesis Testing.
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 9. Hypothesis Testing I: The Six Steps of Statistical Inference.
Descriptive statistics Inferential statistics
Chapter 8 Hypothesis testing 1. ▪Along with estimation, hypothesis testing is one of the major fields of statistical inference ▪In estimation, we: –don’t.
Fundamentals of Hypothesis Testing: One-Sample Tests
Testing Hypotheses About Proportions
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 20 Testing Hypotheses About Proportions.
Hypothesis Testing.
Testing Hypotheses Tuesday, October 28. Objectives: Understand the logic of hypothesis testing and following related concepts Sidedness of a test (left-,
Hypothesis Testing: One Sample Cases. Outline: – The logic of hypothesis testing – The Five-Step Model – Hypothesis testing for single sample means (z.
Chapter 8 Introduction to Hypothesis Testing
Lecture 7 Introduction to Hypothesis Testing. Lecture Goals After completing this lecture, you should be able to: Formulate null and alternative hypotheses.
LECTURE 19 THURSDAY, 14 April STA 291 Spring
Stat 112 Notes 3 Today: –Finish Chapter 3.3 (Hypothesis Testing). Homework 1 due next Thursday.
Chapter 20 Testing hypotheses about proportions
Lecture 16 Dustin Lueker.  Charlie claims that the average commute of his coworkers is 15 miles. Stu believes it is greater than that so he decides to.
CHAPTER 17: Tests of Significance: The Basics
No criminal on the run The concept of test of significance FETP India.
Copyright © 2008 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 20 Testing Hypotheses About Proportions.
1 Chapter 10: Introduction to Inference. 2 Inference Inference is the statistical process by which we use information collected from a sample to infer.
Chapter 221 What Is a Test of Significance?. Chapter 222 Thought Question 1 The defendant in a court case is either guilty or innocent. Which of these.
Economics 173 Business Statistics Lecture 4 Fall, 2001 Professor J. Petry
Chapter 20 Testing Hypothesis about proportions
Lecture 18 Dustin Lueker.  A way of statistically testing a hypothesis by comparing the data to values predicted by the hypothesis ◦ Data that fall far.
Lecture 17 Dustin Lueker.  A way of statistically testing a hypothesis by comparing the data to values predicted by the hypothesis ◦ Data that fall far.
CHAPTER 9 Testing a Claim
Fall 2002Biostat Statistical Inference - Confidence Intervals General (1 -  ) Confidence Intervals: a random interval that will include a fixed.
Chap 8-1 Fundamentals of Hypothesis Testing: One-Sample Tests.
Rejecting Chance – Testing Hypotheses in Research Thought Questions 1. Want to test a claim about the proportion of a population who have a certain trait.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Logic and Vocabulary of Hypothesis Tests Chapter 13.
Introduction Suppose that a pharmaceutical company is concerned that the mean potency  of an antibiotic meet the minimum government potency standards.
Business Statistics for Managerial Decision Farideh Dehkordi-Vakil.
STA Lecture 221 !! DRAFT !! STA 291 Lecture 22 Chapter 11 Testing Hypothesis – Concepts of Hypothesis Testing.
AP Statistics Chapter 11 Notes. Significance Test & Hypothesis Significance test: a formal procedure for comparing observed data with a hypothesis whose.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide
Chapter 12 Tests of Hypotheses Means 12.1 Tests of Hypotheses 12.2 Significance of Tests 12.3 Tests concerning Means 12.4 Tests concerning Means(unknown.
Today: Hypothesis testing p-value Example: Paul the Octopus In 2008, Paul the Octopus predicted 8 World Cup games, and predicted them all correctly Is.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 9 Testing a Claim 9.1 Significance Tests:
CHAPTER 15: Tests of Significance The Basics ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
Slide 20-1 Copyright © 2004 Pearson Education, Inc.
Chapter 20 Testing Hypotheses About Proportions. confidence intervals and hypothesis tests go hand in hand:  A confidence interval shows us the range.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 1 FINAL EXAMINATION STUDY MATERIAL III A ADDITIONAL READING MATERIAL – INTRO STATS 3 RD EDITION.
Statistics 20 Testing Hypothesis and Proportions.
Chapter 9 Testing A Claim 9.1 SIGNIFICANT TESTS: THE BASICS OUTCOME: I WILL STATE THE NULL AND ALTERNATIVE HYPOTHESES FOR A SIGNIFICANCE TEST ABOUT A POPULATION.
Chapter Nine Hypothesis Testing.
Module 10 Hypothesis Tests for One Population Mean
Testing Hypotheses About Proportions
Testing Hypotheses about Proportions
Daniela Stan Raicu School of CTI, DePaul University
Testing Hypotheses About Proportions
STA 291 Spring 2008 Lecture 17 Dustin Lueker.
Presentation transcript:

Lecture 3 Outline: Tues, Jan 20 Chapter 1.3 Probability model for 2-group randomized experiment. Hypothesis testing review Randomization test p-value Principle of control in experimental design

Vocabulary of Experiments A study is an experiment when we actually do something to people, animals or objects to observe the response. Experimental units are the things to which treatments are applied, e.g., people, rats, samples of materials or pieces of land. When units are human beings, they are called subjects. A specific experimental condition applied to the units is called a treatment. The “control” refers to a treatment that is considered a baseline for comparing all other treatments. Creativity study: Experimental units? Treatments?

Probability Model for 2-treatment Randomized Experiment Creativity Study –Chance mechanism for randomizing units to treatment groups ensures that every subset of 24 subjects gets the same chance of becoming intrinsic group –For example, 23 red and 24 black cards could be shuffled and dealt, one to each subject and the subjects with black cards would be the intrinsic group. –Tables of random numbers can be used to assign units to groups (assign the units with the 24 highest numbers to group 1).

Potential Outcomes Each subject has two potential outcomes -- Y i and Y i * -- that are the outcomes that would be observed if the subject was exposed to the extrinsic questionnaire and the intrinsic questionnaire respectively. We only see one potential outcome for each subject – the potential outcome for the questionnaire to which the subject was assigned. Another example of potential outcomes. Let the two treatments be do not attend college and attend college and the outcome be lifetime earnings. Y i is subject i’s lifetime earnings if she does not attend college and Y i * is subject i’s lifetime earnings if she does attend college.

Additive Treatment Effect Model Additive Treatment Effect Model: For every subject, Y*=Y+ is a parameter – an unknown constant that describes a key feature in model for answering questions of interest. Meaning of : – >0: Intrinsic questionnaire improves creativity. – =0 : Intrinsic questionnaire (treatment) makes no difference. – <0: Intrinsic questionnaire makes creativity worse.

Hypothesis Testing A key question in the additive treatment effect model is whether or not the taking the intrinsic vs. extrinsic questionnaire has a causal effect on outcome, i.e., does ? The goal of hypothesis testing is to decide which of two hypotheses, H 0 (null hypothesis) or H a (alternative or research hypothesis), is true based on the sample. Null hypothesis: Typically a simpler state of affairs; our focus is on asking whether there is enough evidence to disprove the null hypothesis. H 0 :, H a :

Risks of Hypothesis Testing Two types of errors are possible in hypothesis testing: –Type I error: Reject the null hypothesis when it is true –Type II error: Accept the null hypothesis when it is false. Probability of Type I error when H 0 is true = significance level of test, denoted by Probability of making correct decision when H a is true ( = 1-Prob. of Type II error) = power of test

Hypothesis Testing in the Courtroom Null hypothesis: The defendant is innocent Alternative hypothesis: The defendant is guilty The goal of the procedure is to determine whether there is enough evidence to conclude that the alternative hypothesis is true. The burden of proof is on the alternative hypothesis. Two types of errors: –Type I error: Reject null hypothesis when null hypothesis is true (convict an innocent defendant) –Type II error: Do not reject null hypothesis when null is false (fail to convict a guilty defendant)

Hypothesis Testing in Statistics Use test statistic that summarizes information about parameter in sample. Accept H 0 if the test statistic falls in a range of values that would be plausible if H 0 were true. Reject H 0 if the test statistic falls in a range of values that would be implausible if H 0 were true. Choose the rejection region so that the probability of rejecting H 0 if H 0 is true equals (most commonly 0.05) The p-value is the probability that the test statistic would be at least as implausible as it actually was if H 0 were true

Logic of Hypothesis Testing We assume H 0 is true and see what samples are most plausible. If the sample is implausible under H 0, we doubt that H 0 is true. Example: You are playing 5-card poker and the dealer sits down and draws 3 royal straight flushes in a row. The null hypothesis is that this is a fair poker game and the dealer is not cheating. The chance of a royal straight flush in a fair game is In a fair game, the chance of getting three hands as good as royal straight flushes in a row is * * =3.6* The p-value is 3.6* and we doubt H 0 is true.

Test Statistic for two group randomized experiment Test statistic for 2-group randomized exp.: –Let be the sample mean of the outcome for units assigned to group 1. –Let be the sample mean of the outcome for units assigned to group 2. –Test statistic: T= Test statistic’s value should be “close” to zero if is true and “far” from zero if is true. For motivation-creativity study, T=4.14. Is this “close” to zero or “far” from zero? Even if there is no treatment effect, T will not necessarily equal 0 because the random assignment can result in an uneven mix of abilities in the two groups.

Randomization Test p-value The observed value of the test statistic can be extreme (far from zero) because –(a) there is an effect of the treatment –(b) the random assignment resulted in an uneven mix A randomization test p-value is the probability associated with explanation (b) The smaller the p-value, the less believable (b) is as an explanation.

Exact Calculation of the p-value The p-value is the probability that |T|>=4.14 if, in fact, there is no treatment effect (and based on the random assignment of units to groups) Important starting point: If there is no treatment effect, then the creativity score for an individual would have been the same had they been assigned to the other group. Exact Calculation of p-value –Calculate T for every possible grouping of the 47 numbers into groups of size 23 and 24 –The p-value is the proportion of regroupings with |T|>=4.14.

Example Suppose the creativity study had just six students. Suppose the three students assigned to the intrinsic group had scores of 12, 20 and 28 and the three students assigned to the extrinsic group had scores of 10, 18 and 26. Calculate the p-value for testing if there is a treatment effect.

P-value for Creativity Study For the actual creativity study, using a computer program, the p-value is Conclusion: either –(i) there is no treatment effect and we happened to get an uneven randomization –(ii) there is a treatment effect. The probability associated with (i) is.011. So either there is a treatment effect or we obtained an unusual (one-in-a-hundred) randomization. A p-value of around.01 is considered strong evidence against the null hypothesis, see pg. 47.

Approximating the p-value For the creativity study, there are 1.6*10 13 different groupings. Approximating the randomization test p-value. –(i) Monte Carlo simulation: Randomly choose many groupings. Approximate the randomization distribution by the histogram of the test statistic for the randomly chosen groupings –(ii) (Chapter 2). The randomization distribution of the “t-statistic” is approximated by the “t”-distribution.

Decisions based on p-values In order to decide whether to accept H 0 or accept H a (reject H 0 ), we choose a significance level (prob. of type I error) Our decision rule is: Accept H 0 if p-value Reject H 0 if p-value By convention is often chosen to be Best to report the p-value itself rather than just whether H 0 was accepted or rejected because it provides more information (Sect ). Display 2.12 provides a chart for interpreting the p-value.

One-sided vs. Two-sided Tests For some problems, we might know that the treatment effect is >=0 or <=0 and want to use a one-sided alternative hypothesis –(i) H a : or –(ii) H a : For (i), large values of are considered implausible under H 0 and for (ii), small values of T are considered implausible under H 0. p-value for (i) is probability that T would be at least as large as it actually was if H 0 is true. Scientists mostly report two-sided tests.

Scope of Inference The conclusion that the intrinsic questionnaire causes a difference in creativity (i.e., ) strictly applies only to the subjects in the study. If the subjects were obtained by a random sample, then we could conclude that the intrinsic questionnaire has causal effects for a larger population (see Display 1.5).

The meaning of the causal inference In the motivation-creativity study, we concluded that there is a strong evidence that the “intrinsic questionnaire” treatment caused a difference in creativity compared to the “extrinsic questionnaire” treatment. This difference could be caused by anything that differs between the two treatments, e.g, the actual questionnaire, the order in which the poems were judged, the relative preferences of the judges for the two treatments.

Control in Experimental Design The principle of control in experimental design is to make sure that all other factors besides the intended treatments are kept the same in the different groups. Then we conclude that the intended treatment causes a difference between the groups. Examples of control: – Use a placebo for the control group. – Double blinding – Judge poems in random order.

Experimental Design Example: Salk Vaccine Field Trial In the first half of the 20 th century, polio was one of the most frightening diseases, striking hardest at young children and leaving many helpless cripples. By the 1950s, Jonas Salk developed a vaccine for polio that had proved promising in laboratory experiments but it was necessary to try it in the real world before releasing it for general use.

Designs for Salk Vaccine Field Trial Historical Control Approach: Distribute the vaccine as widely as possible, through the schools, to see whether the rate of reported polio was appreciably less than usual during the subsequent season. Observed Control Approach: Offer vaccination to all children in the second grade of participating schools and follow the polio experience not only in these children but in the first and third grade children. Placebo Control Approach: Choose the control group from the same population as the treatment group – children whose parents consented to vaccination. Assign the treatment randomly. Give a placebo to control group. Do not tell doctors which group children belong to.