Statistical Bootstrapping Peter D. Christenson Biostatistician January 20, 2005.

Slides:



Advertisements
Similar presentations
1 Probabilistic Uncertainty Bounding in Output Error Models with Unmodelled Dynamics 2006 American Control Conference, June 2006, Minneapolis, Minnesota.
Advertisements

Objectives 10.1 Simple linear regression
McGraw-Hill Ryerson Copyright © 2011 McGraw-Hill Ryerson Limited. Adapted by Peter Au, George Brown College.
The Simple Linear Regression Model: Specification and Estimation
Class 5: Thurs., Sep. 23 Example of using regression to make predictions and understand the likely errors in the predictions: salaries of teachers and.
8 - 10: Intro to Statistical Inference
Hypothesis Tests for Means The context “Statistical significance” Hypothesis tests and confidence intervals The steps Hypothesis Test statistic Distribution.
Bootstrapping LING 572 Fei Xia 1/31/06.
Getting Started with Hypothesis Testing The Single Sample.
Bootstrapping applied to t-tests
Standard Error of the Mean
Inference for regression - Simple linear regression
Prediction concerning Y variable. Three different research questions What is the mean response, E(Y h ), for a given level, X h, of the predictor variable?
Statistics for clinical research An introductory course.
Biostatistics Case Studies 2005 Peter D. Christenson Biostatistician Session 4: Taking Risks and Playing the Odds: OR vs.
Biostatistics in Clinical Research Peter D. Christenson Biostatistician January 12, 2005IMSD U*STAR RISE.
 The situation in a statistical problem is that there is a population of interest, and a quantity or aspect of that population that is of interest. This.
Dan Piett STAT West Virginia University
Populations, Samples, Standard errors, confidence intervals Dr. Omar Al Jadaan.
Comparing Two Proportions
Copyright © 2010, 2007, 2004 Pearson Education, Inc Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by.
Lecture 14 Sections 7.1 – 7.2 Objectives:
Understanding the Variability of Your Data: Dependent Variable Two "Sources" of Variability in DV (Response Variable) –Independent (Predictor/Explanatory)
Review of Chapters 1- 5 We review some important themes from the first 5 chapters 1.Introduction Statistics- Set of methods for collecting/analyzing data.
Bootstrapping (And other statistical trickery). Reminder Of What We Do In Statistics Null Hypothesis Statistical Test Logic – Assume that the “no effect”
PARAMETRIC STATISTICAL INFERENCE
Probability, contd. Learning Objectives By the end of this lecture, you should be able to: – Describe the difference between discrete random variables.
Biostatistics in Practice Peter D. Christenson Biostatistician Session 5: Methods for Assessing Associations.
Why Draw A Sample? Why not just the get the whole enchilada? – Pragmatic reasons – The true population is typically “unknowable” When done right, a small.
Biostatistics in Practice Peter D. Christenson Biostatistician LABioMed.org /Biostat Session 6: Case Study.
Biostatistics Case Studies 2007 Peter D. Christenson Biostatistician Session 3: Incomplete Data in Longitudinal Studies.
Using Resampling Techniques to Measure the Effectiveness of Providers in Workers’ Compensation Insurance David Speights Senior Research Statistician HNC.
Slides to accompany Weathington, Cunningham & Pittenger (2010), Chapter 7: Sampling 1.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved Section 8-3 Testing a Claim About a Proportion.
Statistical estimation, confidence intervals
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide
Lecture 8 Simple Linear Regression (cont.). Section Objectives: Statistical model for linear regression Data for simple linear regression Estimation.
Biostatistics Case Studies 2005 Peter D. Christenson Biostatistician Session 2: Using the Bootstrap: Throw Out Those Messy.
The Central Limit Theorem © Christine Crisp “Teach A Level Maths” Statistics 1.
Chapter 7 Probability and Samples: The Distribution of Sample Means.
Chapter 7 Sampling Distributions Statistics for Business (Env) 1.
Stat 112: Notes 2 Today’s class: Section 3.3. –Full description of simple linear regression model. –Checking the assumptions of the simple linear regression.
Sampling And Resampling Risk Analysis for Water Resources Planning and Management Institute for Water Resources May 2007.
Copyright © 2010 Pearson Education, Inc. Chapter 22 Comparing Two Proportions.
Biostatistics in Practice Peter D. Christenson Biostatistician Session 6: Case Study.
Introduction to Inference: Confidence Intervals and Hypothesis Testing Presentation 4 First Part.
Biostatistics in Practice Peter D. Christenson Biostatistician Session 3: Testing Hypotheses.
Chapter 7 Sampling Distributions. Sampling Distribution of the Mean Inferential statistics –conclusions about population Distributions –if you examined.
Biostatistics in Practice Peter D. Christenson Biostatistician Session 4: Study Size for Precision or Power.
Bootstrap Event Study Tests Peter Westfall ISQS Dept. Joint work with Scott Hein, Finance.
Session 1 Probability to confidence intervals By : Allan Chang.
SAMPLING DISTRIBUTION OF MEANS & PROPORTIONS. PPSS The situation in a statistical problem is that there is a population of interest, and a quantity or.
ISMT253a Tutorial 1 By Kris PAN Skewness:  a measure of the asymmetry of the probability distribution of a real-valued random variable 
Education 793 Class Notes Inference and Hypothesis Testing Using the Normal Distribution 8 October 2003.
Math 3680 Lecture #15 Confidence Intervals. Review: Suppose that E(X) =  and SD(X) = . Recall the following two facts about the average of n observations.
Biostatistics in Practice Peter D. Christenson Biostatistician LABioMed.org /Biostat Session 3: Testing Hypotheses.
Sampling Design and Analysis MTH 494 Lecture-21 Ossam Chohan Assistant Professor CIIT Abbottabad.
1 Probability and Statistics Confidence Intervals.
SAMPLING DISTRIBUTION OF MEANS & PROPORTIONS. SAMPLING AND SAMPLING VARIATION Sample Knowledge of students No. of red blood cells in a person Length of.
SAMPLING DISTRIBUTION OF MEANS & PROPORTIONS. SAMPLING AND SAMPLING VARIATION Sample Knowledge of students No. of red blood cells in a person Length of.
Selecting a Sample. outline Difference between sampling in quantitative & qualitative research.
Chapter 9: Introduction to the t statistic. The t Statistic The t statistic allows researchers to use sample data to test hypotheses about an unknown.
Uncertainty and confidence Although the sample mean,, is a unique number for any particular sample, if you pick a different sample you will probably get.
The inference and accuracy We learned how to estimate the probability that the percentage of some subjects in the sample would be in a given interval by.
Bootstrapping James G. Anderson, Ph.D. Purdue University.
Confidence Interval Estimation
Statistical Methods For Engineers
Simulation: Sensitivity, Bootstrap, and Power
Statistical inference for the slope and intercept in SLR
Density Curves Normal Distribution Area under the curve
Presentation transcript:

Statistical Bootstrapping Peter D. Christenson Biostatistician January 20, 2005

Outline Example of bootstrap in a previous paper Basics: Precision of estimates Practical difficulties Bootstrap concept Particular use in our example Other uses Good bootstrap reference with tutorial pdf* and software: *Source for most figures here.

Paper Using Bootstrapping Outcome: Labor progression was estimated by the duration of labor for each cm of cervical dilation using serial vaginal exams. Predictor: Classification as overweight, obese or normal weight. Many possible confounding factors, e.g. fetal size.

Paper: Major Results p-value is based on the precision of the estimated durations of 6.20 and 7.52 hours. p<0.01 ↔ the 99% CI centered at is above 0 (so it is unlikely that there is no difference).

Paper: From statistical methods Survival analysis methods were needed to estimate the median duration from, say 4 to 5 cm dilation, since the exact times were not known. Nevertheless, they did not just use the p-values from the software output, but used “bootstrapping”. Why?

Basics: Precision vs. Normal Range A random sample of 100 women shows a 4-10 cm dilation duration mean±SD of 7.0±1.25. Normal (95%) range is ~ 7±2(1.25) = 4.5 to 9.5 hours. With no other info about a patient, we predict she will have between 4.5 and 9.5 hours, based on SD=variation among individuals. But, how well (precise) did the study estimate the mean duration of all women? We only have one mean, but want 7.0±2(SD of means).

Estimating Precision from Theory “All” pregnant women To get SD of mean, conceptually take many samples: Of course, we don’t have the luxury of more than 1 sample. From math theory, SD of a mean of N is SD/√N = SEM: “All” pregnant women Mean of X-bars SD of X-bars

Extensions of the Theory The theory has been extended beyond means (SEM) to SEs for more complex measures, such as predictions from regression: Blue bands are “normal ranges” and red bands are CIs, showing precision. But, the relation is not just a factor of √N, as with means.

Difficulties For most situations, standard errors (SE) have been developed based on theory. They may not be accurate in some circumstances: The sample may not be a simple random one that is required for standard SE formulas. There may be non-sampling sources of variation, e.g., using estimated results from one analysis in further analyses. There may be approximations or assumptions required for the formulas based on theory that are known to not hold.

The Bootstrap Standard Error Obtain a single sample of size N. Then: 1.Create thousands of samples with replacement of size N, called “bootstrap samples” or “resamples” from the original sample. 2.Calculate the quantity of interest, the bootstrap estimate, for each sample. 3.Find the bootstrap distribution of these quantities, and in particular their SD, which is the bootstrap SE: M M* M*s are the bootstrap estimates of M

The Bootstrap SE: Concept Consider a sample of N=6 with 3 bootstrap samples: Mean±SD of original sample = 4.46±7.54. SEM = 7.54/√6 = 3.08 Bootstrap SEM is SD(4.13,4.64,1.74) = 1.55 Here, bootstrap SE is awful since only 3 samples were drawn. Typically, thousands are used.

Back to Labor Progression Paper Why was bootstrapping used in the paper? The design used a stratified random sample, not a simple random sample: There are SE formulas for some quantities from studies that over-sample some groups, as was done here, but perhaps not for these adjusted medians.

Other Bootstrap Advantages Typically fewer assumptions. Very general and reliable: can use the same software code for many estimation problems that have different formulas. Some studies show more accuracy than classical methods. Can model the entire estimation process, not just sampling error. See next 2 slides.

Labor Progression Paper: Covariate Adjustment Method Recall that durations were adjusted for several covariates, including oxytocin use and fetal size. The oversampling of heavier women was accounted for with bootstrapping so that these adjustments were unbiased. A single set of covariates was used for all bootstrap samples.

Labor Progression Paper: Alternative Covariate Adjustment Recall that durations were adjusted for several covariates, including oxytocin use and fetal size. The entire process of selecting covariates could have been performed separately with each bootstrap sample. This could better incorporate the uncertainty of choosing which factors need to be used for adjustment.

Conclusions Bootstrapping can avoid the requirement of unnecessary assumptions. It is not needed in most applications. It is needed for studies w/o simple random sampling, unless software for other sampling designs is used. For our paper here, it probably had a small impact, but could have been used to gain further advantages. In general, it is of relatively minor importance for most studies. Just excluding a confounder would typically have a greater impact.