Presentation is loading. Please wait.

Presentation is loading. Please wait.

Higher-order Confidence Intervals for Stochastic Programming using Bootstrapping Cosmin G. Petra Joint work with Mihai Anitescu Mathematics and Computer.

Similar presentations


Presentation on theme: "Higher-order Confidence Intervals for Stochastic Programming using Bootstrapping Cosmin G. Petra Joint work with Mihai Anitescu Mathematics and Computer."— Presentation transcript:

1 Higher-order Confidence Intervals for Stochastic Programming using Bootstrapping Cosmin G. Petra Joint work with Mihai Anitescu Mathematics and Computer Science Division Argonne National Laboratory petra@mcs.anl.gov INFORMS ANNUAL MEETING 2012

2 Outline  Confidence intervals  Motivation –statistical inference for the stochastic optimization of power grid  Our statistical estimator for the optimal value  Bootstrapping  Second-order bootstrapped confidence intervals  Numerical example 2

3 Confidence intervals (CIs) for a statistic 3  Want an interval [L,U] where resides with high probability  Need the knowledge of the probability distribution  Example: Confidence intervals for the mean of Gaussian (normal) random variable Normal distribution, also called Gaussian or "bell curve“ distribution. Image source: Wikipedia.

4 Approximating CIs 4  In many cases the distribution function is not known.  Such intervals are approximated based on the central limit theorem (CLT)  Normal approximation for equal-tailed 95% CI  Notation

5 Optimal value in stochastic programming  Monotonically shrinking negative bias:  Consistency  Arbitrary slow convergence  Non-normal bias 5 Sample average approximation (SAA) Stochastic programming (SP) problem Properties

6 Stochastic unit commitment with wind power  Wind Forecast – WRF(Weather Research and Forecasting) Model –Real-time grid-nested 24h simulation –30 samples require 1h on 500 CPUs (Jazz@Argonne) 6 Slide courtesy of V. Zavala & E. Constantinescu Wind farm Thermal generator

7 The specific of stochastic optimization of energy systems 7 SAA discrete continuous Sampling Statistical inference uncertainty is expensive Only a small number of samples are available.

8 Standard methodology for stochastic programming – Linderoth, Shapiro, Wright (2004) 8  Lower bound CI  CI for based on M batches of N samples  Upper bound CI  CI for (obtained similarly)  Needs a relatively large number of samples (2MN)  First-order correct and therefore unreliable for small number of samples Correctness of a CI – order k if

9 Our approach for SP with low-size samples 9 1. Novel estimator 2. Bootstrapping  Converges one order faster than –Excepting for a set whose measure converges exponentially to 0.  Allows the construction of reliable CIs in the low-size samples situation.  Bootstrap CIs are second-order correct M. Anitescu, C. Petra: “Higher-Order Confidence Intervals for Stochastic Programming using Bootstrapping”, submitted to Math. Prog.

10 The estimator 10  L is the Lagrangian of SP and J is the Jacobian of the constraints  is the solution of the SAA problem – obtained using N samples  Intended for nonlinear recourse terms Theorem 1: (Anitescu & P.) Under some regularity and smoothness conditions Proof: based on the theory of large deviations.  CIs constructed for are based on a second batch of N samples.  A total of 2N sample needed when using bootstrapping

11 Bootstrapping – a textbook example  1. 1930 population = 1920 population X mean of the ratios  2. needs the distribution of the ratios - not enough samples -> Bootstrapping –Sample the existing samples (with replacement) –For each sample compute the mean –Bootstrapping distribution is obtained –Build CIs based on the bootstrapping distribution 11 Histogram for the ratio of 1930 and 1920 populations for N=49 US cities “Bootstrapped” distribution clearly not a Gaussian Bootstrap CIs outperform normal CIs. US population known in 1920. 1930 population of 49 cities known Want 1. estimation of the 1930 population 2. CIs for the estimation Solution

12 The methodology of bootstrapping  BCa (bias corrected and accelerated) confidence intervals –second-order correct –the method of choice when an accurate estimate of the variance is not available 12

13 What does bootstrapping do?  Edgeworth expansions for cdfs  Bootstrapping accounts also for the second term in the expansion  The quantiles are also second-order correct (Cornish-Fisher inverse expansions)  (Some) Bootstrapped CIs are second-order correct 13 Reference: Peter Hall, “The Bootstrap and Edgeworth Expansion”, 1994.

14 Bootstrapping the estimator 14 Theorem 2: (Anitescu & P.) Let be a second order bootstrapping confidence interval for. Then for any

15 Numerical order of correctness 15 Correctness order 0.32 Correctness order 0.82 Correctness order 1.14 Correctness order 2.11 Observed order of correctness

16 Coverage for small number of samples 16

17 Concluding remarks and future work  Proposed and analyzed a novel statistical estimator for the optimal solution of nonlinear stochastic optimization  Almost second order correct confidence intervals using bootstrapping  Theoretical properties confirmed by numerical testing  Some assumptions are rather strict and can/should be relaxed  Parallelization of the CI computations for large problems needed 17

18 Thank you for your attention! Questions? 18

19 Bootstrapping - theory 19 Edgeworth expansions for pdfs Bootstrapping also accounts for the second term of in the expansion.  Cornish-Fisher expansion for quantiles (inverting Edgeworth expansion)  Bootstrapped quantiles possess similar expansion  But  (Some) Bootstrap CIs are second-order correct (Hall’s book is really detailed on this)


Download ppt "Higher-order Confidence Intervals for Stochastic Programming using Bootstrapping Cosmin G. Petra Joint work with Mihai Anitescu Mathematics and Computer."

Similar presentations


Ads by Google