Based on “An Introduction to the Bootstrap” (Efron and Tibshirani)

Slides:



Advertisements
Similar presentations
Lecture (11,12) Parameter Estimation of PDF and Fitting a Distribution Function.
Advertisements

Happiness comes not from material wealth but less desire. 1.
Uncertainty and confidence intervals Statistical estimation methods, Finse Friday , 12.45–14.05 Andreas Lindén.
1 Bootstrap Confidence Intervals for Three-way Component Methods Henk A.L. Kiers University of Groningen The Netherlands.
Chapter 11- Confidence Intervals for Univariate Data Math 22 Introductory Statistics.
Today: Quizz 11: review. Last quizz! Wednesday: Guest lecture – Multivariate Analysis Friday: last lecture: review – Bring questions DEC 8 – 9am FINAL.
3 pivot quantities on which to base bootstrap confidence intervals Note that the first has a t(n-1) distribution when sampling from a normal population.
Resampling techniques
2008 Chingchun 1 Bootstrap Chingchun Huang ( 黃敬群 ) Vision Lab, NCTU.
Bootstrapping LING 572 Fei Xia 1/31/06.
1 Confidence Intervals for Means. 2 When the sample size n< 30 case1-1. the underlying distribution is normal with known variance case1-2. the underlying.
Bootstrapping applied to t-tests
Bootstrap spatobotp ttaoospbr Hesterberger & Moore, chapter 16 1.
Analysis of Variance. ANOVA Probably the most popular analysis in psychology Why? Ease of implementation Allows for analysis of several groups at once.
AM Recitation 2/10/11.
SECTION 6.4 Confidence Intervals for Variance and Standard Deviation Larson/Farber 4th ed 1.
Applications of bootstrap method to finance Chin-Ping King.
Today’s lesson Confidence intervals for the expected value of a random variable. Determining the sample size needed to have a specified probability of.
1 CSI5388: Functional Elements of Statistics for Machine Learning Part I.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Confidence Interval Estimation Basic Business Statistics 11 th Edition.
Population All members of a set which have a given characteristic. Population Data Data associated with a certain population. Population Parameter A measure.
Estimation Bias, Standard Error and Sampling Distribution Estimation Bias, Standard Error and Sampling Distribution Topic 9.
Estimating a Population Variance
University of Ottawa - Bio 4118 – Applied Biostatistics © Antoine Morin and Scott Findlay 08/10/ :23 PM 1 Some basic statistical concepts, statistics.
9 Mar 2007 EMBnet Course – Introduction to Statistics for Biologists Nonparametric tests, Bootstrapping
Estimating Incremental Cost- Effectiveness Ratios from Cluster Randomized Intervention Trials M. Ashraf Chaudhary & M. Shoukri.
1 Nonparametric Methods II Henry Horng-Shing Lu Institute of Statistics National Chiao Tung University
Chapter 7 Process Capability. Introduction A “capable” process is one for which the distributions of the process characteristics do lie almost entirely.
Confidence Intervals Lecture 3. Confidence Intervals for the Population Mean (or percentage) For studies with large samples, “approximately 95% of the.
Limits to Statistical Theory Bootstrap analysis ESM April 2006.
Nonparametric Methods II 1 Henry Horng-Shing Lu Institute of Statistics National Chiao Tung University
Lecture 4 Confidence Intervals. Lecture Summary Last lecture, we talked about summary statistics and how “good” they were in estimating the parameters.
Project Plan Task 8 and VERSUS2 Installation problems Anatoly Myravyev and Anastasia Bundel, Hydrometcenter of Russia March 2010.
Quantifying Uncertainty
Bootstrapping James G. Anderson, Ph.D. Purdue University.
Confidence Intervals. Point Estimate u A specific numerical value estimate of a parameter. u The best point estimate for the population mean is the sample.
Estimating standard error using bootstrap
Chapter 3 INTERVAL ESTIMATES
Cross Tabulation with Chi Square
Confidence Intervals and Sample Size
Inference for the Mean of a Population
Standard Errors Beside reporting a value of a point estimate we should consider some indication of its precision. For this we usually quote standard error.
Application of the Bootstrap Estimating a Population Mean
Confidence Interval Estimation
Hypotheses and test procedures
Chapter 3 INTERVAL ESTIMATES
Point and interval estimations of parameters of the normally up-diffused sign. Concept of statistical evaluation.
Copyright (c) 2005 John Wiley & Sons, Inc.
Confidence Intervals Copyright  2008 by The McGraw-Hill Companies. This material is intended for educational purposes by licensed users of LearningStats.
Goodness of Fit x² -Test
Statistics in SPSS Lecture 7
Chapter 8 Confidence Interval Estimation.
Elementary Statistics
Estimates of Bias & The Jackknife
Section 6-4 – Confidence Intervals for the Population Variance and Standard Deviation Estimating Population Parameters.
Quantifying uncertainty using the bootstrap
Elementary Statistics
Bootstrap - Example Suppose we have an estimator of a parameter and we want to express its accuracy by its standard error but its sampling distribution.
M A R I O F. T R I O L A Estimating Population Proportions Section 6-5
BOOTSTRAPPING: LEARNING FROM THE SAMPLE
Chapter 6 Confidence Intervals.
Confidence Interval Estimation
Statistical Process Control
Bootstrapping Jackknifing
Chapter 6 Confidence Intervals.
Introduction to the t Test
Simulation Berlin Chen
Generalized Additive Model
Techniques for the Computing-Capable Statistician
Introductory Statistics
Presentation transcript:

Based on “An Introduction to the Bootstrap” (Efron and Tibshirani) Nir Keret Based on “An Introduction to the Bootstrap” (Efron and Tibshirani)

The Percentile Interval

The Percentile Interval

The Percentile Interval

The Percentile Interval Looks good… where does it go wrong? Perhaps the assumption that such a transformation “m” exists is too strong an assumption? We require that “m” is both normalizing and variance-stabilizing. Such a transformation need not necessarily exist. In addition, we would like to correct for potential bias in the plug-in estimate. The percentile method does not deal with that.

The BCa Interval The name BCa stands for “Bias-Correction and Acceleration”. It relaxes the assumption that the transformation “m” is variance-stabilizing, and in addition it corrects for potential bias in the estimate. A motivational example:

The BCa Interval

The BCa Interval How do BCa and ABC perform compared with the percentile?

The BCa Interval The exact CI: is taking into account the fact that the mean had to be estimated, hence uses n-1 degrees of freedom for the chi-squared distribution. In contrast, the sample variance, which we used as the plug-in estimate in the bootstrap CIs is biased. How come the BCa and ABC do not seem biased but the percentile is? In addition, note that the shape of the percentile CI is nowhere near the exact. The right part of the CI should be about 2.5 times longer than the left part.

The BCa Interval There is no golden standard for the non-parametric case, but we can see that there are big differences.

The BCa Interval The BCa endpoints are also based on the bootstrap distribution percentiles, but not necessarily the same ones as the regular percentile interval. The BCa percentiles depend on two numbers: responsible for correcting non-stable variance (“acceleration”) and bias.

The BCa Interval

The BCa Interval Where does this formula come from?

The BCa Interval

The BCa Interval

The BCa Interval

The BCa Interval - Bias Correction

The BCa Interval - Bias Correction

The BCa Interval - Bias Correction Rationale for that formula:

The BCa Interval - Acceleration One-Parameter Parametric Bootstrap

The BCa Interval - Acceleration Parametric bootstrap with nuisance parameters In most cases we will have more than one unknown parameter, and the former formula will not apply. The solution is to reduce the problem to a one-parameter parametric family, and apply the formula on it. How do we do that?

The BCa Interval - Acceleration Least Favorable Family

The BCa Interval - Acceleration

The BCa Interval - Acceleration

The BCa Interval - Acceleration Nonparametric Bootstrap

The BCa Interval - Acceleration

Geometrical interpretation

The BCa Interval - Influence Function Gâteaux Derivative The Gâteaux derivative of T at F in the direction G is defined by Mathematically speaking, the Gâteaux derivative is a generalization of the concept of directional derivative. Statistically speaking, it measures the rate of change in a statistical functional upon a small amount of contamination by another distribution G.

The BCa Interval - Influence Function The Influence Function

The BCa Interval - Influence Function Example: the mean

The BCa Interval - Influence Function Example: the variance We can take advantage of the chain rule:

The BCa Interval - Influence Function Example: the variance:

The BCa Interval - Influence Function The influence components behave a lot like the score values: For a parametric model:

The BCa Interval - Influence Function In practice, in order to make the method general and not functional-specific, we can take the corresponding jackknife value as an approximation for the influence component. The jackknife practically replaces epsilon with The jackknife approximation yields:

ABC Intervals Stands for “Approximate Bootstrap CIs” It tries to mimic the BCa intervals in an analytic fashion - without actually performing resampling at all. It requires second derivatives, hence works for smoothly defined parameters in exponential families or smoothly defined nonparametric problems.

ABC Intervals

ABC Intervals

ABC Intervals

ABC Intervals

Convergence Order

Convergence Order Correctness at a given level Accuracy at the same level

ABC BCa Percentile Bootstrap t Second-order correct Transformation - respecting

Thanks!