Quantifying Uncertainty

Slides:



Advertisements
Similar presentations
McGraw-Hill/Irwin Copyright © 2013 by The McGraw-Hill Companies, Inc. All rights reserved. A PowerPoint Presentation Package to Accompany Applied Statistics.
Advertisements

Probability models- the Normal especially.
Review bootstrap and permutation
Computational Statistics. Basic ideas  Predict values that are hard to measure irl, by using co-variables (other properties from the same measurement.
Hypothesis testing and confidence intervals by resampling by J. Kárász.
Sampling Distributions (§ )
Confidence Intervals Underlying model: Unknown parameter We know how to calculate point estimates E.g. regression analysis But different data would change.
Ch 6 Introduction to Formal Statistical Inference.
Psychology 202b Advanced Psychological Statistics, II February 15, 2011.
Estimation A major purpose of statistics is to estimate some characteristics of a population. Take a sample from the population under study and Compute.
Stat Today: More Chapter 3. Assessing Effect Significance For replicated experiments, can use regression to determine important effects Can also.
Environmental Data Analysis with MatLab Lecture 24: Confidence Limits of Spectra; Bootstraps.
2008 Chingchun 1 Bootstrap Chingchun Huang ( 黃敬群 ) Vision Lab, NCTU.
Bootstrapping LING 572 Fei Xia 1/31/06.
8-1 Introduction In the previous chapter we illustrated how a parameter can be estimated from sample data. However, it is important to understand how.
Stat 301 – Day 37 Bootstrapping, cont (5.5). Last Time - Bootstrapping A simulation tool for exploring the sampling distribution of a statistic, using.
1 Confidence Intervals for Means. 2 When the sample size n< 30 case1-1. the underlying distribution is normal with known variance case1-2. the underlying.
Standard error of estimate & Confidence interval.
Introduction to Bootstrapping
Bootstrap spatobotp ttaoospbr Hesterberger & Moore, chapter 16 1.
6.1 What is Statistics? Definition: Statistics – science of collecting, analyzing, and interpreting data in such a way that the conclusions can be objectively.
Empirical Research Methods in Computer Science Lecture 2, Part 1 October 19, 2005 Noah Smith.
Bootstrapping – the neglected approach to uncertainty European Real Estate Society Conference Eindhoven, Nederlands, June 2011 Paul Kershaw University.
Biostatistics IV An introduction to bootstrap. 2 Getting something from nothing? In Rudolph Erich Raspe's tale, Baron Munchausen had, in one of his many.
● Final exam Wednesday, 6/10, 11:30-2:30. ● Bring your own blue books ● Closed book. Calculators and 2-page cheat sheet allowed. No cell phone/computer.
Estimating Incremental Cost- Effectiveness Ratios from Cluster Randomized Intervention Trials M. Ashraf Chaudhary & M. Shoukri.
1 Nonparametric Methods II Henry Horng-Shing Lu Institute of Statistics National Chiao Tung University
Ch 6 Introduction to Formal Statistical Inference
Chapter 14 Inference for Regression AP Statistics 14.1 – Inference about the Model 14.2 – Predictions and Conditions.
Module 7: Comparing Datasets and Comparing a Dataset with a Standard How different is enough?
Resampling techniques
Limits to Statistical Theory Bootstrap analysis ESM April 2006.
Deutscher Wetterdienst Bootstrapping – using different methods to estimate statistical differences between model errors Ulrich Damrath COSMO GM Rome 2011.
Computational statistics, lecture3 Resampling and the bootstrap  Generating random processes  The bootstrap  Some examples of bootstrap techniques.
Ledolter & Hogg: Applied Statistics Section 6.2: Other Inferences in One-Factor Experiments (ANOVA, continued) 1.
Analysis of Chromium Emissions Data Nagaraj Neerchal and Justin Newcomer, UMBC and OIAA/OEI and Mohamed Seregeldin, Office of Air Quality Planning and.
1 Summarizing Performance Data Confidence Intervals Important Easy to Difficult Warning: some mathematical content.
: An alternative representation of level of significance. - normal distribution applies. - α level of significance (e.g. 5% in two tails) determines the.
Mystery 1Mystery 2Mystery 3.
Nonparametric Methods II 1 Henry Horng-Shing Lu Institute of Statistics National Chiao Tung University
Case Selection and Resampling Lucila Ohno-Machado HST951.
Analysis of Experimental Data; Introduction
Beginning Statistics Table of Contents HAWKES LEARNING SYSTEMS math courseware specialists Copyright © 2008 by Hawkes Learning Systems/Quant Systems, Inc.
Statistical Data Analysis 2011/2012 M. de Gunst Lecture 4.
10.1 – Estimating with Confidence. Recall: The Law of Large Numbers says the sample mean from a large SRS will be close to the unknown population mean.
Project Plan Task 8 and VERSUS2 Installation problems Anatoly Myravyev and Anastasia Bundel, Hydrometcenter of Russia March 2010.
Univariate Point Estimation Confidence Interval Estimation Bivariate: Linear Regression Multivariate: Multiple Regression 1 Chapter 4: Statistical Approaches.
Comparing Datasets and Comparing a Dataset with a Standard How different is enough?
Bias-Variance Analysis in Regression  True function is y = f(x) +  where  is normally distributed with zero mean and standard deviation .  Given a.
Based on “An Introduction to the Bootstrap” (Efron and Tibshirani)
Statistical Inference
Concepts in Probability, Statistics and Stochastic Modeling
Application of the Bootstrap Estimating a Population Mean
Prepared by Lloyd R. Jaisingh
Virtual COMSATS Inferential Statistics Lecture-26
When we free ourselves of desire,
CI for μ When σ is Unknown
Statistical Inference for the Mean Confidence Interval
Quantifying uncertainty using the bootstrap
Alafia river: Autocorrelation Autocorrelation of standardized flow.
Bootstrap Confidence Intervals using Percentiles
Introduction to Inference
Bootstrap - Example Suppose we have an estimator of a parameter and we want to express its accuracy by its standard error but its sampling distribution.
BOOTSTRAPPING: LEARNING FROM THE SAMPLE
QQ Plot Quantile to Quantile Plot Quantile: QQ Plot:
Chapter 14 Inference for Regression
Sampling Distributions (§ )
Techniques for the Computing-Capable Statistician
Bootstrapping and Bootstrapping Regression Models
Introductory Statistics
Presentation transcript:

Quantifying Uncertainty Two approaches Use statistical theory Bootstrapping

Statistical Theory Uncorrelated Correlated Effective sample size

Significance Statistics Significance statistics use the standard error: Confidence Intervals:

Bootstrapping Motivated by the absence of equations for other accuracy measures (bias, prediction error, confidence intervals) for statistics of interest (correlation, regressions, ACF) Definition: “The bootstrap is a data-based simulation method for statistical inference.” Principle: resample with replacement from data. After Efron and Tibshirani, An Introduction to the Bootstrap, 1993

Bootstrapping REAL WORLD BOOTSTRAP WORLD Sampling with replacement F x = {x1, x2, …, xn} BOOTSTRAP WORLD F * x * = {x*1, x * 2, …, x *n} Empirical Distribution Bootstrap Sample Bootstrap Replication Unknown Probability Distribution Observed Random Sample Sampling with replacement Statistic of Interest After Efron and Tibshirani, An Introduction to the Bootstrap, 1993

95% CI and interquartile range from 500 bootstrap samples Hillsborough River at Zephyr Hills, September flows Mean = 8621 mgal S = 8194 mgal N = 31 Uncertainty on estimates of the mean One and two standard errors 95% CI and interquartile range from 500 bootstrap samples Millions of gallons

Box-Cox Normality Plot for Monthly September Flows on Alafia R. Using Kolmogorov-Smirnov (KS) Statistic Peak at  = -0.39 What is the range of uncertainty on this?

Example for the ks.test How? Produce 500 new datasets (x*) of the same length as x by sampling with replacement from x Find the optimal  value for each Determine the 10th and 90th percentiles to cover 80% of the  values calculated.

80% confidence interval 10% 50% 90% -0.425 -0.250 -0.068 Look back at the original plot and verify that the original “optimal” value was at the far left of the broad top, which is reflected in this confidence interval. 80% confidence interval 10% 50% 90% -0.425 -0.250 -0.068