Uncertainty Quantification In Geosciences with Computationally Expensive Simulation Models (with last minute modification to Include Sustainability/Energy.

Slides:



Advertisements
Similar presentations
Advanced Piloting Cruise Plot.
Advertisements

Climate Prediction Applications Science Workshop
Weather Forecasting This chapter discusses: 1.Various weather forecasting methods, their tools, and forecasting accuracy and skill 2.Images for the forecasting.
Copyright © 2002 Pearson Education, Inc. Slide 1.
Chapter 1 The Study of Body Function Image PowerPoint
STATISTICS Joint and Conditional Distributions
Flood Risk Analysis – the USACE Approach
STATISTICS HYPOTHESES TEST (III) Nonparametric Goodness-of-fit (GOF) tests Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering.
1 Bayesian CTS FDA/Industry Workshop September 18, 2003Copyright Pharsight Case Study in the Use of Bayesian Hierarchical Modeling and Simulation.
1 Superior Safety in Noninferiority Trials David R. Bristol To appear in Biometrical Journal, 2005.
Sales Forecasting using Dynamic Bayesian Networks Steve Djajasaputra SNN Nijmegen The Netherlands.
Pacific Gas and Electric Company Long Term Procurement Plan Proceeding Renewable Integration Model Results and Model Demonstration October 22, 2010 Workshop.
Curtis Johnson Process Control Instrumentation Technology, 8e]
R E 1 Some Experiences with Data and Models in International (Water) Assessments Ton Bresser National Institute of Public Health and Environment Frans.
1. 2 Why are Result & Impact Indicators Needed? To better understand the positive/negative results of EC aid. The main questions are: 1.What change is.
Jeopardy Q 1 Q 6 Q 11 Q 16 Q 21 Q 2 Q 7 Q 12 Q 17 Q 22 Q 3 Q 8 Q 13
Jeopardy Q 1 Q 6 Q 11 Q 16 Q 21 Q 2 Q 7 Q 12 Q 17 Q 22 Q 3 Q 8 Q 13
FACTORING ax2 + bx + c Think “unfoil” Work down, Show all steps.
Lecture 2 ANALYSIS OF VARIANCE: AN INTRODUCTION
1 -Classification: Internal Uncertainty in petroleum reservoirs.
Evaluating Provider Reliability in Risk-aware Grid Brokering Iain Gourlay.
The Reality of Climate Change and its Impact on Water Production of Hydro systems in Iran Prof. H.Sedghi Member of Academy 1.
Solve Multi-step Equations
MA Metal Finishing Forum Tools and Techniques for Optimizing Metal Finishing Process/Environmental MA Metal Finishing Forum Kevin L. Klink, P.E.
Spoofing State Estimation
1 The tiling algorithm Learning in feedforward layered networks: the tiling algorithm writed by Marc M é zard and Jean-Pierre Nadal.
Water Distribution Systems – Part 1
1 SOURCE AND DISTRIBUTION OF WATER On completion of this segment you should be able to appreciate Water, an essential component Sources and application.
Randomized Algorithms Randomized Algorithms CS648 1.
Methods on Measuring Prices Links in the Fish Supply Chain Daniel V. Gordon Department of Economics University of Calgary FAO Workshop Value Chain Tokyo,
1 1 Mechanical Design and Production Dept, Faculty of Engineering, Zagazig University, Egypt. Mechanical Design and Production Dept, Faculty of Engineering,
5-1 Chapter 5 Theory & Problems of Probability & Statistics Murray R. Spiegel Sampling Theory.
Chapter 10 Project Cash Flows and Risk
Capacity Planning For Products and Services
Discrete Event (time) Simulation Kenneth.
Our Corporate Motto “WE DON’T PASS GAS”
VOORBLAD.
1 Breadth First Search s s Undiscovered Discovered Finished Queue: s Top of queue 2 1 Shortest path from s.
Hydrologic Statistics Reading: Chapter 11, Sections 12-1 and 12-2 of Applied Hydrology 04/04/2006.
Chapter 5 Microsoft Excel 2007 Window
Uncertainty and Sensitivity Analysis of Complex Computer Codes
© 2012 National Heart Foundation of Australia. Slide 2.
Formal models of design 1/28 Radford, A D and Gero J S (1988). Design by Optimization in Architecture, Building, and Construction, Van Nostrand Reinhold,
Understanding Generalist Practice, 5e, Kirst-Ashman/Hull
Chapter 5 Test Review Sections 5-1 through 5-4.
25 seconds left…...
Determining How Costs Behave
We will resume in: 25 Minutes.
©Brooks/Cole, 2001 Chapter 12 Derived Types-- Enumerated, Structure and Union.
Chapter 8 Estimation Understandable Statistics Ninth Edition
Chapter 11: The t Test for Two Related Samples
Short-Term Financing 20 Chapter South-Western/Thomson Learning © 2003 See c20.xls for spreadsheets to accompany this chapter.c20.xls.
Simple Linear Regression Analysis
Multinational Capital Budgeting 14 Chapter South-Western/Thomson Learning © 2006 Slides by Yee-Tien (Ted) Fu.
Using a multivariate DOE method for congestion study under impacts of PEVs Hamed V. HAGHI M. A. GOLKAR K. N. Toosi University.
January Structure of the book Section 1 (Ch 1 – 10) Basic concepts and techniques Section 2 (Ch 11 – 15): Inference for quantitative outcomes Section.
1 McGill University Department of Civil Engineering and Applied Mechanics Montreal, Quebec, Canada.
Introduction into Simulation Basic Simulation Modeling.
Basics of Statistical Estimation
Surrogate Response Surface Methods for Optimization,
Advancements in Simulating Land Hydrologic Processes for Land Surface Modeling (LSM) Hua Su Presentation for Physical Climatology.
The Importance of Watershed Modeling for Conservation Policy Or What is an Economist Doing at a SWAT Workshop?
Representing Groundwater in Management Models Julien Harou University College London 2010 International Congress on Environmental Modelling and Software.
A Comparison of a SWAT model for the Cannonsville Watershed with and without Variable Source Area Hydrology Josh Woodbury Christine A. Shoemaker Dillon.
Working With Simple Models to Predict Contaminant Migration Matt Small U.S. EPA, Region 9, Underground Storage Tanks Program Office.
Environmental modeling application domains
Presentation transcript:

Uncertainty Quantification In Geosciences with Computationally Expensive Simulation Models (with last minute modification to Include Sustainability/Energy analysis. Christine A. Shoemaker School of Civil and Environmental Engineering and School of Operations Research and Information Engineering Cornell University

Uncertainty Quantification with Computationally Expensive Simulation Models Many problems in geoscience and environmental engineering are described by computationally expensive simulation models. Probably the main obstacle to doing rigorous statistical analysis of uncertainty for a given data set and model is computational time because: Simulation models typically need to be run many (e.g. thousands of ) times for uncertainty quantification Multiple types of uncertainty need to be incorporated including data error, model error, parameter error, randomness in model input (static and dynamic) I’m interested in developing new algorithms for this problem.

Examples of Static Geoscience Problems Requiring Uncertainty Quantification Examples of Goals: Determine the spatial distribution of different types of geologic materials in the subsurface Determine location of oil reservoirs or underground water Data sources: based on sound waves or radar (many spatial points but not highly accurate). Based on drilling into the subsurface (more accurate but very few spatial points because because of high cost of drilling each well.

Examples of Uncertainty Quantification Needs for Dynamic Geoscience Problems Examples of problems include: Forecasting man-made changes in fluid flow in geologic formations with effects on floods and contaminant transport Predicting the impact of Greenhouse gases on climate change over time. Forecasting dynamic response of multiple fluids in the subsurface to waste disposal (e.g. carbon sequestration)

Quantifying Uncertainty from Dynamic Model Predictions with Model Parameters based on Data Measured (Weather) Dynamic Input Predicted (Weather) Input-perhaps many scenarios to represent randomness Calibration Time Periods With Measured Static Input and Dynamic Output (flow, contamination, etc.) Forecasting Time Period Using Estimates of Static Input TIME Data is taken during Calibration time period and it can be used to establish parameter values (as deterministic values or parameters as random variables with pdf’s estimated based on data). We then want to forecast model output in the future using a probabilistic representation of model parameters and input. This forecast then has uncertainty including static and dynamic inputs, model error, and parameter error

Excess Phosphorous from Watershed would result in cost of How Do We Protect This Water From Pollution? This is New York City (NYC) water supply. Excess Phosphorous from Watershed would result in cost of $US 8 Billion Water Treatment Plant for NYC!

Study Area Cannonsville Reservoir Basin – agricultural basin New York City water supply P ‘restriction’ impedes economic growth of county Model incorporates over 20,000 data values available for this watershed. note eutrophication problem no filtration of drinking water here means that eutrophic reservoir causes taps to be shut off Filtration plant for this Basin is on the order of 8 billion dollars so avoidance is a priority!

Cow Waste (Manure) Is the Primary Source of Phosphorous Transported By Water to Lake Cow Waste (Manure) is Primary Source of Phosphorous Reaching River and Lake

SWAT2000: A Spatially Distributed Simulation Model with Subbasins, Land Use and Soil Type in “HRUs” Using a spatially distributed model helps us evaluate management options. Different than GWLF 43 subbasins 758 HRUs Avg HRU = 1.6 km2 SWAT model developed by USDA and is used worldwide.

Simulation Model Predictions of Annual Phosphorous Contamination: Repeating 1988-1999 Climates for 72 years By repeating 12 year blocks of climate, the trends in climate inputs are eliminated P load to reservoir simulated to increase over time! Increase is measured relative to first climate block = (38224-34297)/34297*100% = 11.4% increase in P load to reservoir over 72 yrs Data points in chart are the average of 12 years of total P loading. This is how future P loading trends will be assessed for other P management scenarios. *** In the absence of trends in climate, model predicts P loading to increase over time! Sept 12, 2003 10

Significance of Results on Phosphorous in Watershed The forecast of increase in phosphorous loading to the water (with no increase in human or animal activity) is very serious because: NY City’s drinking water quality will decline (and there might not be a replacement) Future cost could be Billions of dollars Steps should be taken now to stop the increase in phosphorous pollution Hence the rate of increase (11%/72 years) is important and we would like to know the uncertainty associated with it.

Sustainability/Energy Issues Development of renewable energy devices –fuel cells, photovoltaics, biofuels, etc. Wind Energy also has some of the same issues as the watershed analysis—you have a calibration period, but the real uncertainty is during the forecast period and is driven by variability in weather inputs (and fear of disasters). My group has started to work on Carbon Sequestration which is important in waste disposal and has some similarity to the problems that arise in nuclear waste disposal.

Carbon Sequestration Another Application where Uncertainty Quantification is Important Carbon sequestration: storage of super critical carbon dioxide in geological formations about 1 kilometer below the surface.

Uncertainty Quantification Importance for Carbon Sequestration The issue is that there are serious public health and environmental risks associated with the unexpected movement of CO2 upward to groundwater or to the surface. We would like to use a model plus monitoring data to generate an estimate of the CO2 plume in the ground to determine if the system is functioning OK and where the plume is going to move in the future.

Related Issues How do we efficiently quantify tail probabilities/rare events of high-impact? Can we augment the predictive ability by using experiments? How? Can we derive predictive reduced-order models and/or response surfaces? Likelihoods can be multimodal and/or rough.

Most Nonlinear Simulation Models will Lead to Likelihood Models with Multiple Local Minima Multi-Modal Problems have Multiple local minima F(x) Local minimum Global minimum X (parameter value)

Likelihood can be a Rough Surface because of Numerical Simulation Objective (likelihood) Function Versus Parameter Value

Our Approach to Uncertainty Quantification I have been working with statistician David Ruppert and others on quantification of uncertainty for computationally expensive simulation models. My approach has been to imbed both optimization search and response surfaces into the algorithms for uncertainty quantification to significantly reduce computational effort (by orders of magnitude). Our methods are designed to work with multi modal functions and rough surfaces.

Paper 1 on SOARS Blizniouk, N., D. Ruppert, C.A. Shoemaker, R. G. Regis, S. Wild, P. Mugunthan, “Bayesian Calibration of Computationally Expensive Models Using Optimization and Radial Basis Function Approximation.” Journal of Computational and Graphical Statistics, July 2008.

Notation Y0 - vector of observed data; η - parameters in the (joint) statistical model [list1|list2] - conditional density of random variables in list1 given list2, e.g. Is the conditional density of η given the data Y0 η= b in this example

Kernel estimates of the marginal posterior densities by a) (solid line) exact joint posterior obtained from conventional MCMC .Analysis with 10,000 function evaluations & b) (dashed lines) with our function approximation method with 150 function evaluations. One graph for each parameter.

OUTPUT UNCERTAINTY: Again we got excellent agreement between our approach with 150 evaluations and the conventional approach with 10,000 evaluations. Output Comparison

SWAT2000: A Spatially Distributed Simulation Model with Subbasins, Land Use and Soil Type in “HRUs” Using a spatially distributed model helps us evaluate management options. Different than GWLF 43 subbasins 758 HRUs Avg HRU = 1.6 km2 SWAT model developed by USDA and is used worldwide.

Marginal posterior Densities of the 5 parameters (βi) for watershed model This contains all the statistical information . Everything else is computed on the basis of this.

Joint Posterior Density of Six Different Model Outputs

Quantiles, Means, Standard Dev. These are based on statistically rigorous analysis with transformations to account for non normal data obtained with a small fraction of the number of simulations required by other methods including MCMC and GLUE

Questions?

Groundwater Simulation Model for Beijing Figure 1. Miyun-Huai-Shun groundwater aquifer with hydraulic head observation wells of the study area. The main aquifer covers 456 km2 of the basin area. This is primary water supply for Beijing. Upper layers of the Aquifer are contaminated, thereby significantly reducing Beijing’s water supply. Parameter estimation goal in order to understand effect of extracting water and potential for contamination. Model involves solving PDE’s.

Groundwater Aquifer is 3-dimensional. Below is a vertical cross section. Cross Hatched areas indicate different “conductivities”, which need to be estimated as parameters. Simulation model solves a systems of partial differential equations by finite difference method.