Review bootstrap and permutation

Slides:



Advertisements
Similar presentations
Probability models- the Normal especially.
Advertisements

Inference in the Simple Regression Model
“Students” t-test.
Chapter 16 Inferential Statistics
Recap of confidence intervals If the 95%CI does not include the hypothesized μ, we conclude that our sample is statistically different from the assumed.
Hypothesis testing and confidence intervals by resampling by J. Kárász.
CmpE 104 SOFTWARE STATISTICAL TOOLS & METHODS MEASURING & ESTIMATING SOFTWARE SIZE AND RESOURCE & SCHEDULE ESTIMATING.
Hypothesis Testing I 2/8/12 More on bootstrapping Random chance
Chapter 10 Section 2 Hypothesis Tests for a Population Mean
Lecture 5 Outline – Tues., Jan. 27 Miscellanea from Lecture 4 Case Study Chapter 2.2 –Probability model for random sampling (see also chapter 1.4.1)
The Basics of Regression continued
8 - 10: Intro to Statistical Inference
4-1 Statistical Inference The field of statistical inference consists of those methods used to make decisions or draw conclusions about a population.
Bootstrap spatobotp ttaoospbr Hesterberger & Moore, chapter 16 1.
Statistical hypothesis testing – Inferential statistics I.
Statistics 11 Hypothesis Testing Discover the relationships that exist between events/things Accomplished by: Asking questions Getting answers In accord.
Overview of Statistical Hypothesis Testing: The z-Test
Hypothesis Testing (Statistical Significance). Hypothesis Testing Goal: Make statement(s) regarding unknown population parameter values based on sample.
4-1 Statistical Inference The field of statistical inference consists of those methods used to make decisions or draw conclusions about a population.
Given a sample from some population: What is a good “summary” value which well describes the sample? We will look at: Average (arithmetic mean) Median.
1 Tests with two+ groups We have examined tests of means for a single group, and for a difference if we have a matched sample (as in husbands and wives)
Education 793 Class Notes T-tests 29 October 2003.
Go to Index Analysis of Means Farrokh Alemi, Ph.D. Kashif Haqqi M.D.
+ Chapter 9 Summary. + Section 9.1 Significance Tests: The Basics After this section, you should be able to… STATE correct hypotheses for a significance.
BPS - 3rd Ed. Chapter 211 Inference for Regression.
Today’s lesson Confidence intervals for the expected value of a random variable. Determining the sample size needed to have a specified probability of.
PARAMETRIC STATISTICAL INFERENCE
Comparing two sample means Dr David Field. Comparing two samples Researchers often begin with a hypothesis that two sample means will be different from.
+ Chapter 12: Inference for Regression Inference for Linear Regression.
9 Mar 2007 EMBnet Course – Introduction to Statistics for Biologists Nonparametric tests, Bootstrapping
+ Chapter 12: More About Regression Section 12.1 Inference for Linear Regression.
S-012 Testing statistical hypotheses The CI approach The NHST approach.
Introduction to Inferece BPS chapter 14 © 2010 W.H. Freeman and Company.
1 9 Tests of Hypotheses for a Single Sample. © John Wiley & Sons, Inc. Applied Statistics and Probability for Engineers, by Montgomery and Runger. 9-1.
Limits to Statistical Theory Bootstrap analysis ESM April 2006.
Statistical Inference for the Mean Objectives: (Chapter 9, DeCoursey) -To understand the terms: Null Hypothesis, Rejection Region, and Type I and II errors.
Mystery 1Mystery 2Mystery 3.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
1 Probability and Statistics Confidence Intervals.
1 Section 8.2 Basics of Hypothesis Testing Objective For a population parameter (p, µ, σ) we wish to test whether a predicted value is close to the actual.
Learning Objectives After this section, you should be able to: The Practice of Statistics, 5 th Edition1 DESCRIBE the shape, center, and spread of the.
Hypothesis Testing Steps for the Rejection Region Method State H 1 and State H 0 State the Test Statistic and its sampling distribution (normal or t) Determine.
BIOL 582 Lecture Set 2 Inferential Statistics, Hypotheses, and Resampling.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
BPS - 5th Ed. Chapter 231 Inference for Regression.
Chapter 9 Sampling Distributions 9.1 Sampling Distributions.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 1 FINAL EXAMINATION STUDY MATERIAL III A ADDITIONAL READING MATERIAL – INTRO STATS 3 RD EDITION.
Bias-Variance Analysis in Regression  True function is y = f(x) +  where  is normally distributed with zero mean and standard deviation .  Given a.
Simple Linear Regression and Correlation (Continue..,) Reference: Chapter 17 of Statistics for Management and Economics, 7 th Edition, Gerald Keller. 1.
15 Inferential Statistics.
More on Inference.
Chapter 4. Inference about Process Quality
Unit 5: Hypothesis Testing
CHAPTER 10 Comparing Two Populations or Groups
Lecture 8 Preview: Interval Estimates and Hypothesis Testing
CHAPTER 12 More About Regression
CHAPTER 10 Comparing Two Populations or Groups
When we free ourselves of desire,
More on Inference.
Bootstrap Confidence Intervals using Percentiles
Lecture 10/24/ Tests of Significance
CHAPTER 12 More About Regression
CHAPTER 12 Inference for Proportions
STA 291 Spring 2008 Lecture 18 Dustin Lueker.
CHAPTER 12 Inference for Proportions
CHAPTER 10 Comparing Two Populations or Groups
Confidence Interval.
CHAPTER 12 More About Regression
Comparing Two Proportions
Statistical Power.
Presentation transcript:

Review bootstrap and permutation

Main points Definition of confidence intervals Definition of p-value Why use bootstrap, why use permutation tests? What are the differences between bootstrap and permutation tests? How to use bootstrap? How to use permutation tests?

Definition of confidence intervals A confidence interval represents the precision of the estimation of a test statistic If the same experiment was replicated a hundred times, the 95% CI would, on average, contain the estimated TS in 95 of these samples.

Definition of p-value A p-value represents the probability of observing a TS this extreme or more extreme if the null hypothesis is true

Why use bootstrap, why use permutation tests? Test statistic Theoretical hypothesis Operational hypothesis Null hypothesis P-value OR CI Underlying distribution If one of those is unusual or unknown, bootstrap or permutation is useful 5

Test statistics Is not implemented in standard software Is estimated by a single value in the whole sample Is a relationship between two TS

Null hypothesis Is different from zero in a way that is hard to quantify Example: If you are expecting a certain variable to explain more than 50 percent of a different variable, such that the r squared is greater than 50. We would want to bootstrap the r-statistics and see whether the confidence interval includes 50, or values below 50. If the confidence interval does not, then we can say that our hypothesis is supported. If it does, then we cannot reject the null.

Underlying distribution Is unknown because TS was unknown Is unknown because conditions of applications for parametric tests do not seem to be met Is known for the null hypothesis but seems likely to be different for the alternative hypothesis p-value should be correct but CI will be incorrect.

Differences between bootstrap and permutation tests estimates confidence interval, bias and standard error Simulates data under the alternative hypothesis Sampling is done with replacement of subjects Many bootstrap samples because of replacement Permutation tests estimates p-value and distribution under the null. Simulates data under the null hypothesis Sampling is done without replacement of subjects Finite number of potential permutation samples

Main points: how to How to bootstrap a test statistics Determine the test statistic of interest (must be a single value) What is randomly sampled? How many subjects in the bootstrap samples (with or without replacement)? How many bootstrap samples Examine histogram of TS* with TS (the observed TS), average of TS*, and boundaries of percentile and bca CI Interpret results

Main points: how to How to use a permutation test Determine the test statistic of interest (must be a single value) What must be shuffled in order to simulate what happens under the null hypothesis? How many subjects in the permutation samples (with or without replacement)? How many permutation samples? Examine histogram of TS* with TS (the observed TS) Compute 2 p-values if possible: one-tailed, two-tailed Interpret results

Extra assignment Read the article by Zentner et al. (2007). If an hypothesis is tested by either bootstrap or permutation test, describe in details: the hypothesis, How the hypothesis was operationalized The procedure, The results The interpretation of the results.