Computational statistics, lecture3 Resampling and the bootstrap  Generating random processes  The bootstrap  Some examples of bootstrap techniques.

Slides:



Advertisements
Similar presentations
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Advertisements

Hypothesis testing and confidence intervals by resampling by J. Kárász.
STATISTICAL INFERENCE PART V CONFIDENCE INTERVALS 1.
Sampling Distributions (§ )
Objectives (BPS chapter 24)
Confidence Intervals Underlying model: Unknown parameter We know how to calculate point estimates E.g. regression analysis But different data would change.
Chapter 13 Conducting & Reading Research Baumgartner et al Data Analysis.
Resampling techniques Why resampling? Jacknife Cross-validation Bootstrap Examples of application of bootstrap.
Descriptive statistics Experiment  Data  Sample Statistics Sample mean Sample variance Normalize sample variance by N-1 Standard deviation goes as square-root.
Time series analysis - lecture 5
Chapter 6 Introduction to Sampling Distributions
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 6 Introduction to Sampling Distributions.
BCOR 1020 Business Statistics Lecture 28 – May 1, 2008.
Descriptive statistics Experiment  Data  Sample Statistics Experiment  Data  Sample Statistics Sample mean Sample mean Sample variance Sample variance.
2008 Chingchun 1 Bootstrap Chingchun Huang ( 黃敬群 ) Vision Lab, NCTU.
Analysis of Simulation Input.. Simulation Machine n Simulation can be considered as an Engine with input and output as follows: Simulation Engine Input.
T T07-01 Sample Size Effect – Normal Distribution Purpose Allows the analyst to analyze the effect that sample size has on a sampling distribution.
Bagging LING 572 Fei Xia 1/24/06. Ensemble methods So far, we have covered several learning methods: FSA, HMM, DT, DL, TBL. Question: how to improve results?
Quantitative Business Analysis for Decision Making Simple Linear Regression.
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Business Statistics - QBM117 Statistical inference for regression.
Chapter 12 Section 1 Inference for Linear Regression.
Inference for regression - Simple linear regression
1 Least squares procedure Inference for least squares lines Simple Linear Regression.
Applications of bootstrap method to finance Chin-Ping King.
Stats for Engineers Lecture 9. Summary From Last Time Confidence Intervals for the mean t-tables Q Student t-distribution.
1 Introduction to Estimation Chapter Concepts of Estimation The objective of estimation is to determine the value of a population parameter on the.
Population All members of a set which have a given characteristic. Population Data Data associated with a certain population. Population Parameter A measure.
Biostatistics IV An introduction to bootstrap. 2 Getting something from nothing? In Rudolph Erich Raspe's tale, Baron Munchausen had, in one of his many.
+ Chapter 12: Inference for Regression Inference for Linear Regression.
1 Statistical Distribution Fitting Dr. Jason Merrick.
AP Statistics Chapter 15 Notes. Inference for a Regression Line Goal: To determine if there is a relationship between two quantitative variables. Goal:
Lecture 8 Simple Linear Regression (cont.). Section Objectives: Statistical model for linear regression Data for simple linear regression Estimation.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
AP Statistics Chapter 15 Notes. Inference for a Regression Line Goal: To determine if there is a relationship between two quantitative variables. –i.e.
Resampling techniques
Limits to Statistical Theory Bootstrap analysis ESM April 2006.
28. Multiple regression The Practice of Statistics in the Life Sciences Second Edition.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 12 More About Regression 12.1 Inference for.
Statistical Data Analysis 2011/2012 M. de Gunst Lecture 3.
Statistical Data Analysis 2011/2012 M. de Gunst Lecture 4.
Quantifying Uncertainty
Bootstrapping James G. Anderson, Ph.D. Purdue University.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 12 More About Regression 12.1 Inference for.
Statistics 350 Lecture 2. Today Last Day: Section Today: Section 1.6 Homework #1: Chapter 1 Problems (page 33-38): 2, 5, 6, 7, 22, 26, 33, 34,
1 Simple Linear Regression Chapter Introduction In Chapters 17 to 19 we examine the relationship between interval variables via a mathematical.
Estimating standard error using bootstrap
CHAPTER 12 More About Regression
Application of the Bootstrap Estimating a Population Mean
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Inference for Regression (Chapter 14) A.P. Stats Review Topic #3
Inference: Conclusion with Confidence
CHAPTER 12 More About Regression
Lecture 19.
CHAPTER 29: Multiple Regression*
Quantifying uncertainty using the bootstrap
Linear Regression.
Bootstrap - Example Suppose we have an estimator of a parameter and we want to express its accuracy by its standard error but its sampling distribution.
Regression Models - Introduction
Sampling Distribution
Sampling Distribution
BOOTSTRAPPING: LEARNING FROM THE SAMPLE
CHAPTER 12 More About Regression
CHAPTER 12 More About Regression
Sampling Distributions (§ )
Inference for Regression
Simulation Berlin Chen
Bootstrapping and Bootstrapping Regression Models
Introductory Statistics
Uncertainty Propagation
Presentation transcript:

Computational statistics, lecture3 Resampling and the bootstrap  Generating random processes  The bootstrap  Some examples of bootstrap techniques

Computational statistics, lecture3 Process-based model of the flow of nitrogen from land to sea Coastal model Anthropo- genic inputs Primary outputs (nutrient concentrations, chlorophyll, oxygen, etc.) Open-sea boundary conditions Watershed model Physio- graphic inputs Meteoro- logical forcings Atmos- pheric inputs Physio- graphic inputs Waterborne inputs Derived outputs Meteoro- logical forcings

Computational statistics, lecture3 Decomposing outputs of process-based models driven by meteorological inputs Observed forcing Weather-dependent model output Synthetic forcing Synthetic model output Weather-normalised mean output Weather-specific (random) component of the model output How can we use resampling to better understand model outputs?

Computational statistics, lecture3 Resampling daily temperatures  Split observed data into periods of duration one month  Generate new temperature series by resampling 1-month pieces and combining them so that the seasonal pattern is preserved

Computational statistics, lecture3 Observed and resampled daily temperatures Observed dataResampled data

Computational statistics, lecture 3 Data-driven inference - inference based on resampling observed data Sampling with replacement Resampled data Observed data

Computational statistics, lecture3 Nonparametric bootstrap - empirical cdf

Computational statistics, lecture3 The bootstrap  Let (X 1, …, X n ) be a sample and  a parameter of the underlying distribution  Suppose  is estimated by  The underlying idea of the bootstrap is to first use the sample to estimate the unknown distribution F of the data. Then this estimated distribution F* is used in place of the unknown true distribution in calculating the distribution of

Computational statistics, lecture3 Nonparametric bootstrap - histogram of sample means of bootstrap samples

Computational statistics, lecture3 Nonparametric bootstrap - histogram of sample means of bootstrap samples

Computational statistics, lecture3 Nonparametric bootstrap - histogram of standard deviations of bootstrap samples

Computational statistics, lecture3 Nonparametric bootstrap - confidence intervals by computing percentiles

Computational statistics, lecture3 Parametric bootstrap - empirical cdf  Assume that a sample is drawn from an exponential distribution with cdf F( , x) = 1 – exp(-  x)  Use the estimator  Determine the distribution of using the estimated distribution

Computational statistics, lecture3 Residual resampling  Consider the linear regression model  Estimate the beta coefficients and determine the residuals  Generate new bootstrap samples  Make inference about the model parameters by fitting linear regression models to bootstrap samples