Synthesis and Review 2/20/12 Hypothesis Tests: the big picture Randomization distributions Connecting intervals and tests Review of major topics Open Q+A.

Slides:



Advertisements
Similar presentations
Panel at 2013 Joint Mathematics Meetings
Advertisements

STAT 101 Dr. Kari Lock Morgan
Hypothesis Testing: Intervals and Tests
Hypothesis Testing I 2/8/12 More on bootstrapping Random chance
Statistics: Unlocking the Power of Data Lock 5 Hypothesis Testing: p-value STAT 250 Dr. Kari Lock Morgan SECTION 4.2 Randomization distribution p-value.
Hypothesis Testing: Hypotheses
Statistics: Unlocking the Power of Data Lock 5 STAT 101 Dr. Kari Lock Morgan 9/6/12 Describing Data: One Variable SECTIONS 2.1, 2.2, 2.3, 2.4 One categorical.
Copyright ©2011 Brooks/Cole, Cengage Learning Testing Hypotheses about Means Chapter 13.
Describing Data: One Quantitative Variable
Stat 301 – Day 15 Comparing Groups. Statistical Inference Making statements about the “world” based on observing a sample of data, with an indication.
Stat Day 16 Observations (Topic 16 and Topic 14)
Monday, 4/29/02, Slide #1 MA 102 Statistical Controversies Monday, 4/29/02 Today: CLOSING CEREMONIES!  Discuss HW #3  Review for final exam  Evaluations.
Stat 512 – Lecture 12 Two sample comparisons (Ch. 7) Experiments revisited.
Stat 512 – Lecture 13 Chi-Square Analysis (Ch. 8).
Chapter Sampling Distributions and Hypothesis Testing.
Stat 217 – Day 25 Regression. Last Time - ANOVA When?  Comparing 2 or means (one categorical and one quantitative variable) Research question  Null.
STAT 101 Dr. Kari Lock Morgan Exam 2 Review.
Section 4.4 Creating Randomization Distributions.
Chapter 9 Hypothesis Testing.
Review for Exam 2 Some important themes from Chapters 6-9 Chap. 6. Significance Tests Chap. 7: Comparing Two Groups Chap. 8: Contingency Tables (Categorical.
Simple Linear Regression Least squares line Interpreting coefficients Prediction Cautions The formal model Section 2.6, 9.1, 9.2 Professor Kari Lock Morgan.
Confidence Intervals and Hypothesis Tests
Statistics: Unlocking the Power of Data Lock 5 1 in 8 women (12.5%) of women get breast cancer, so P(breast cancer if female) = in 800 (0.125%)
ANOVA 3/19/12 Mini Review of simulation versus formulas and theoretical distributions Analysis of Variance (ANOVA) to compare means: testing for a difference.
Hypothesis Testing III 2/15/12 Statistical significance Errors Power Significance and sample size Section 4.3 Professor Kari Lock Morgan Duke University.
Statistics: Unlocking the Power of Data Lock 5 Hypothesis Testing: Hypotheses STAT 101 Dr. Kari Lock Morgan SECTION 4.1 Statistical test Null and alternative.
Synthesis and Review 3/26/12 Multiple Comparisons Review of Concepts Review of Methods - Prezi Essential Synthesis 3 Professor Kari Lock Morgan Duke University.
More Randomization Distributions, Connections
Statistics: Unlocking the Power of Data Lock 5 Normal Distribution STAT 250 Dr. Kari Lock Morgan Chapter 5 Normal distribution Central limit theorem Normal.
Statistics: Unlocking the Power of Data Lock 5 STAT 250 Nathaniel Cannon Describing Data: Categorical Variables SECTIONS 2.1 One categorical variable Two.
Confidence Intervals I 2/1/12 Correlation (continued) Population parameter versus sample statistic Uncertainty in estimates Sampling distribution Confidence.
Essential Synthesis SECTION 4.4, 4.5, ES A, ES B
+ Chapter 9 Summary. + Section 9.1 Significance Tests: The Basics After this section, you should be able to… STATE correct hypotheses for a significance.
Statistics: Unlocking the Power of Data Lock 5 Synthesis STAT 250 Dr. Kari Lock Morgan SECTIONS 4.4, 4.5 Connecting bootstrapping and randomization (4.4)
Using Lock5 Statistics: Unlocking the Power of Data
Statistics: Unlocking the Power of Data Lock 5 Afternoon Session Using Lock5 Statistics: Unlocking the Power of Data Patti Frazer Lock University of Kentucky.
Estimation: Sampling Distribution
Review of Chapters 1- 5 We review some important themes from the first 5 chapters 1.Introduction Statistics- Set of methods for collecting/analyzing data.
6.1 - One Sample One Sample  Mean μ, Variance σ 2, Proportion π Two Samples Two Samples  Means, Variances, Proportions μ 1 vs. μ 2.
Statistics: Unlocking the Power of Data Lock 5 STAT 101 Dr. Kari Lock Morgan 9/18/12 Confidence Intervals: Bootstrap Distribution SECTIONS 3.3, 3.4 Bootstrap.
LECTURE 19 THURSDAY, 14 April STA 291 Spring
● Final exam Wednesday, 6/10, 11:30-2:30. ● Bring your own blue books ● Closed book. Calculators and 2-page cheat sheet allowed. No cell phone/computer.
Statistics: Unlocking the Power of Data Lock 5 STAT 101 Dr. Kari Lock Morgan 9/11/12 Describing Data: Two Variables SECTIONS 2.1, 2.4, 2.5 Two categorical.
Statistics: Unlocking the Power of Data Lock 5 Normal Distribution STAT 101 Dr. Kari Lock Morgan 10/18/12 Chapter 5 Normal distribution Central limit theorem.
Agresti/Franklin Statistics, 1 of 122 Chapter 8 Statistical inference: Significance Tests About Hypotheses Learn …. To use an inferential method called.
Confidence intervals are one of the two most common types of statistical inference. Use a confidence interval when your goal is to estimate a population.
Inference We want to know how often students in a medium-size college go to the mall in a given year. We interview an SRS of n = 10. If we interviewed.
CHAPTER 17: Tests of Significance: The Basics
Review of Chapters 1- 6 We review some important themes from the first 6 chapters 1.Introduction Statistics- Set of methods for collecting/analyzing data.
1 Chapter 10: Introduction to Inference. 2 Inference Inference is the statistical process by which we use information collected from a sample to infer.
Statistics - methodology for collecting, analyzing, interpreting and drawing conclusions from collected data Anastasia Kadina GM presentation 6/15/2015.
CHAPTER 9 Testing a Claim
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 13 Multiple Regression Section 13.3 Using Multiple Regression to Make Inferences.
Introducing Inference with Bootstrapping and Randomization Kari Lock Morgan Department of Statistical Science, Duke University with.
Statistics: Unlocking the Power of Data Lock 5 Bootstrap Intervals Dr. Kari Lock Morgan PSU /12/14.
Bayesian Inference, Review 4/25/12 Frequentist inference Bayesian inference Review The Bayesian Heresy (pdf)pdf Professor Kari Lock Morgan Duke University.
Statistics: Unlocking the Power of Data Lock 5 Exam 2 Review STAT 101 Dr. Kari Lock Morgan 11/13/12 Review of Chapters 5-9.
Statistics: Unlocking the Power of Data Lock 5 STAT 101 Dr. Kari Lock Morgan 12/6/12 Synthesis Big Picture Essential Synthesis Bayesian Inference (continued)
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Statistics: Unlocking the Power of Data Lock 5 STAT 250 Dr. Kari Lock Morgan Describing Data: One Quantitative Variable SECTIONS 2.2, 2.3 One quantitative.
Statistics: Unlocking the Power of Data Lock 5 Hypothesis Testing: Hypotheses STAT 250 Dr. Kari Lock Morgan SECTION 4.1 Hypothesis test Null and alternative.
Statistics: Unlocking the Power of Data Lock 5 Inference for Means STAT 250 Dr. Kari Lock Morgan Sections 6.4, 6.5, 6.6, 6.10, 6.11, 6.12, 6.13 t-distribution.
Statistical Inference Drawing conclusions (“to infer”) about a population based upon data from a sample. Drawing conclusions (“to infer”) about a population.
Statistics: Unlocking the Power of Data Lock 5 STAT 250 Dr. Kari Lock Morgan Synthesis and Review for Exam 2.
Statistics Nik Bobrovitz BHSc, MSc PhD Student University of Oxford December 2015
Statistics: Unlocking the Power of Data Lock 5 STAT 250 Dr. Kari Lock Morgan Synthesis and Review for Exam 1.
Describing Data: Two Variables
Synthesis and Review for Exam 1
Simulation-Based Approach for Comparing Two Means
Chapter 9: Hypothesis Tests Based on a Single Sample
Presentation transcript:

Synthesis and Review 2/20/12 Hypothesis Tests: the big picture Randomization distributions Connecting intervals and tests Review of major topics Open Q+A Section 4.4, 4.5, ES 2 Professor Kari Lock Morgan Duke University

Make one double-sided page of notes for in- class exam WORK PRACTICE PROBLEMS! (old exams and solutions to review questions under Documents on course website) Read sections corresponding to anything you are still confused about Practice using technology to summarize, visualize, and perform inference on data To Do

Today, 3 – 4 pm (Prof Morgan) Today, 4 – 6 pm (Christine) Tuesday, 3 – 6 pm (Prof Morgan) Tuesday, 6 – 8 pm (Yue) Tuesday, 8 – 9 pm (Michael) (My office hours this week have been moved to Monday and Tuesday to answer questions before the exam) Office Hours this Week

Hypothesis Testing 1.Define the parameter(s) of interest 2.State your hypotheses 3.Set significance level,  (usually 0.05 if unspecified) 4.(Collect your data) 5.Plot your data 6.Calculate the observed sample statistic 7.Create a randomization distribution 8.Calculate the p-value 9.Assess the strength of evidence against H 0 10.Make a formal decision based on the significance level 11.Interpret the conclusion in context

Exercise and Gender Among college students, does one gender exercise more than the other?

Exercise and Gender

p-value =  Little evidence against H 0  Do not reject H 0  This study does not provide evidence that there is any association between gender and exercise times among college students Conclusion: Results this extreme would happen about 22% of the time just by random chance if H 0 were true, so this study does not provide adequate evidence against H 0 Think:

A randomization distribution is the distribution of statistics that would be observed, just by random chance, if the null hypothesis were true 1.Simulate randomizations assuming the null hypothesis is true 2.Calculate the statistic for each simulated randomization Randomization Distribution

In a randomized experiment the “randomness” is the random allocation of cases to treatment groups If the null hypothesis is true, it doesn’t make any difference which treatment group you get placed in Simulate randomizations assuming H 0 is true by reallocate units to treatment groups, and keeping the response values the same Randomized Experiments

In observational studies, there is no random allocation to treatment groups In observational studies, what does “by random chance” even mean? What is random??? How could we generate a randomization distribution for observational studies? Observational Studies

When data is collected by random sampling, without random allocation between groups, we can bootstrap to see what would happen by random chance Bootstrapping (resampling with replacement) simulates the distribution of the sample statistic that we would observe when taking many random samples of the population Bootstrapping

For a randomization distribution however, we need to know the distribution of the sample statistic, when the null hypothesis is true How could we bootstrap assuming the null hypothesis is true? Add/subtract values to each unit first to make the null hypothesis true (“shift the distribution”) Bootstrapping

Reallocating versus Resampling What is random?How do we simulate “random chance”? Randomized Experiments Random assignment to treatment groups Reallocate (rerandomize) Observational Studies Random sampling from the population Resample (bootstrap) In both cases, we need to make the null hypothesis true for a randomization distribution

Was the exercise by gender data collected via a randomized experiment? (a)Yes (b) No (c) There is no way to tell Exercise by Gender

The randomness is not who is which gender (as with randomized experiments), but who is selected to be a part of the study Male sample mean: 12.4 hours Female sample mean: 9.4 hours Add 3 hours to all the females, and then resample using bootstrapping Exercise by Gender

Reallocating and resampling usually give similar answers in terms of a p-value For this class, it is fine to just use reallocating for tests, even if it is not actually a randomized experiment The point is to understand the reason for generating a randomization distribution Method of Randomization

Let’s return to the body temperature data Using bootstrapping, we found a 95% confidence interval for the mean body temperature to be (98.05 ,  ) Let’s do a hypothesis test to see how much evidence this data provides against  = 98.6  H 0 :  = 98.6  H a :  ≠ 98.6  Body Temperatures

How would we create a randomization distribution? The sample mean is . Add 0.34  to each unit so we can sample with replacement mimicking sampling from a population with mean 98.6  Take many bootstrap samples to create a randomization distribution Body Temperatures

Randomization Distribution p-value = 0.002

Two Distributions

If a (1 – α)% confidence interval does not contain the value of the null hypothesis, then a two-sided hypothesis test will reject the null hypothesis using significance level α Intervals provide a range of plausible values for the population parameter, tests are designed to assess evidence against a null hypothesis Intervals and Tests

Using bootstrapping, we found a 95% confidence interval for the mean body temperature to be (98.05 ,  ) H 0 :  = 98.6  H a :  ≠ 98.6  At α = 0.05, we would reject H 0 Body Temperatures

REVIEW

Population Sample Sampling Statistical Inference The Big Picture Exploratory Data Analysis

Was the sample randomly selected? Possible to generalize to the population Yes Should not generalize to the population No Was the explanatory variable randomly assigned? Possible to make conclusions about causality Yes Can not make conclusions about causality No Data Collection

Variable(s)VisualizationSummary Statistics Categoricalbar chart, pie chart frequency table, relative frequency table, proportion Quantitativedotplot, histogram, boxplot mean, median, max, min, standard deviation, z-score, range, IQR, five number summary Categorical vs Categorical side-by-side bar chart, segmented bar chart, mosaic plot two-way table, proportions Quantitative vs Categorical side-by-side boxplotsstatistics by group Quantitative vs Quantitative scatterplotcorrelation

Descriptive Statistics Think of a topic or question you would like to use data to help you answer. – What would the cases be? – What would the variables be? (Limit to one or two variables)

Descriptive Statistics How would you visualize and summarize the variable or relationship between variables? a)bar chart/pie chart, proportions, frequency table/relative frequency table b)dotplot/histogram/boxplot, mean/median, sd/range/IQR, five number summary c)side-by-side or segmented bar plots/mosaic plots, difference in proportions, two-way table d)side-by-side boxplot, stats by group e)scatterplot, correlation

Statistic vs Parameter A sample statistic is a number computed from sample data. A population parameter is a number that describes some aspect of a population

Sampling Distribution A sampling distribution is the distribution of statistics computed for different samples of the same size taken from the same population The spread of the sampling distribution helps us to assess the uncertainty in the sample statistic In real life, we rarely get to see the sampling distribution – we usually only have one sample

A bootstrap sample is a random sample taken with replacement from the original sample, of the same size as the original sample A bootstrap statistic is the statistic computed on the bootstrap sample A bootstrap distribution is the distribution of many bootstrap statistics Bootstrap

Original Sample Bootstrap Sample Bootstrap Statistic Sample Statistic Bootstrap Statistic Bootstrap Distribution

Confidence Interval A confidence interval for a parameter is an interval computed from sample data by a method that will capture the parameter for a specified proportion of all samples A 95% confidence interval will contain the true parameter for 95% of all samples

Standard Error The standard error (SE) is the standard deviation of the sample statistic The SE can be estimated by the standard deviation of the bootstrap distribution For symmetric, bell-shaped distributions, a 95% confidence interval is

Percentile Method If the bootstrap distribution is approximately symmetric, a P% confidence interval can be gotten by taking the middle P% of a bootstrap distribution

Bootstrap Distribution

How unusual would it be to get results as extreme (or more extreme) than those observed, if the null hypothesis is true? If it would be very unusual, then the null hypothesis is probably not true! If it would not be very unusual, then there is not evidence against the null hypothesis Hypothesis Testing

The p-value is the probability of getting a statistic as extreme (or more extreme) as that observed, just by random chance, if the null hypothesis is true The p-value measures evidence against the null hypothesis p-value

Hypothesis Testing

A randomization distribution is the distribution of sample statistics we would observe, just by random chance, if the null hypothesis were true The p-value is calculated by finding the proportion of statistics in the randomization distribution that fall beyond the observed statistic Randomization Distribution

Statistical Conclusions Strength of evidence against H 0 : Formal decision of hypothesis test, based on  = 0.05 :

Formal Decisions For a given significance level, , p-value <   Reject H o p-value >   Do not Reject H o

Errors Reject H 0 Do not reject H 0 H 0 true H 0 false TYPE I ERROR TYPE II ERROR Truth Decision

QUESTIONS???