Groundwater. Notes on geostatistics Monica Riva, Alberto Guadagnini Politecnico di Milano, Italy Key reference: de Marsily, G. (1986), Quantitative Hydrogeology.

Slides:



Advertisements
Similar presentations
Spatial point patterns and Geostatistics an introduction
Advertisements

Spatial point patterns and Geostatistics an introduction
Approximate Analytical/Numerical Solutions to the Groundwater Transport Problem CWR 6536 Stochastic Subsurface Hydrology.
CmpE 104 SOFTWARE STATISTICAL TOOLS & METHODS MEASURING & ESTIMATING SOFTWARE SIZE AND RESOURCE & SCHEDULE ESTIMATING.
Sensitivity Analysis In deterministic analysis, single fixed values (typically, mean values) of representative samples or strength parameters or slope.
Upscaling and effective properties in saturated zone transport Wolfgang Kinzelbach IHW, ETH Zürich.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 13 Nonlinear and Multiple Regression.
Aspects of Conditional Simulation and estimation of hydraulic conductivity in coastal aquifers" Luit Jan Slooten.
Maximum likelihood (ML) and likelihood ratio (LR) test
Deterministic Solutions Geostatistical Solutions
Spatial Interpolation
Applied Geostatistics
Development of Empirical Models From Process Data
Prediction and model selection
1 Simple Linear Regression Chapter Introduction In this chapter we examine the relationship among interval variables via a mathematical equation.
Deterministic Solutions Geostatistical Solutions
Linear and generalised linear models
STAT 592A(UW) 526 (UBC-V) 890-4(SFU) Spatial Statistical Methods NRCSE.
Ordinary Kriging Process in ArcGIS
Derivation of the Gaussian plume model Distribution of pollutant concentration c in the flow field (velocity vector u ≡ u x, u y, u z ) in PBL can be generally.
Statistical Tools for Environmental Problems NRCSE.
Maximum likelihood (ML)
Applications in GIS (Kriging Interpolation)
Method of Soil Analysis 1. 5 Geostatistics Introduction 1. 5
Lecture II-2: Probability Review
Statistical Methods For Engineers ChE 477 (UO Lab) Larry Baxter & Stan Harding Brigham Young University.
Component Reliability Analysis
Review of Statistical Inference Prepared by Vera Tabakova, East Carolina University ECON 4550 Econometrics Memorial University of Newfoundland.
Geo479/579: Geostatistics Ch17. Cokriging
BsysE595 Lecture Basic modeling approaches for engineering systems – Summary and Review Shulin Chen January 10, 2013.
Non-Linear Models. Non-Linear Growth models many models cannot be transformed into a linear model The Mechanistic Growth Model Equation: or (ignoring.
Geo479/579: Geostatistics Ch12. Ordinary Kriging (1)
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Module 1: Statistical Issues in Micro simulation Paul Sousa.
Formulation of the Problem of Upscaling of Solute Transport in Highly Heterogeneous Formations A. FIORI 1, I. JANKOVIC 2, G. DAGAN 3 1Dept. of Civil Engineering,
ECE 8443 – Pattern Recognition ECE 8423 – Adaptive Signal Processing Objectives: Deterministic vs. Random Maximum A Posteriori Maximum Likelihood Minimum.
Geo597 Geostatistics Ch9 Random Function Models.
Explorations in Geostatistical Simulation Deven Barnett Spring 2010.
Geog. 579: GIS and Spatial Analysis - Lecture 21 Overheads 1 Point Estimation: 3. Methods: 3.6 Ordinary Kriging Topics: Lecture 23: Spatial Interpolation.
Geographic Information Science
ECE 8443 – Pattern Recognition LECTURE 07: MAXIMUM LIKELIHOOD AND BAYESIAN ESTIMATION Objectives: Class-Conditional Density The Multivariate Case General.
Spatial Statistics in Ecology: Continuous Data Lecture Three.
Approximate Analytical Solutions to the Groundwater Flow Problem CWR 6536 Stochastic Subsurface Hydrology.
Spatial Analysis & Geostatistics Methods of Interpolation Linear interpolation using an equation to compute z at any point on a triangle.
Review of Random Process Theory CWR 6536 Stochastic Subsurface Hydrology.
Introduction to kriging: The Best Linear Unbiased Estimator (BLUE) for space/time mapping.
Chapter 8: Simple Linear Regression Yang Zhenlin.
Review of Probability. Important Topics 1 Random Variables and Probability Distributions 2 Expected Values, Mean, and Variance 3 Two Random Variables.
Short course on space-time modeling Instructors: Peter Guttorp Johan Lindström Paul Sampson.
Geology 6600/7600 Signal Analysis 04 Nov 2015 © A.R. Lowry 2015 Last time(s): Discussed Becker et al. (in press):  Wavelength-dependent squared correlation.
(Z&B) Steps in Transport Modeling Calibration step (calibrate flow & transport model) Adjust parameter values Design conceptual model Assess uncertainty.
Lab for Remote Sensing Hydrology and Spatial Modeling Dept of Bioenvironmental Systems Engineering National Taiwan University 1/45 GEOSTATISTICS INTRODUCTION.
Lecture 3: MLE, Bayes Learning, and Maximum Entropy
Stochastic Analysis of Groundwater Flow Processes CWR 6536 Stochastic Subsurface Hydrology.
Lecture II-3: Interpolation and Variational Methods Lecture Outline: The Interpolation Problem, Estimation Options Regression Methods –Linear –Nonlinear.
Stats Term Test 4 Solutions. c) d) An alternative solution is to use the probability mass function and.
Stochastic Hydrology Random Field Simulation Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University.
Geostatistics GLY 560: GIS for Earth Scientists. 2/22/2016UB Geology GLY560: GIS Introduction Premise: One cannot obtain error-free estimates of unknowns.
Geo479/579: Geostatistics Ch12. Ordinary Kriging (2)
Spatial Point Processes Eric Feigelson Institut d’Astrophysique April 2014.
A Framework and Methods for Characterizing Uncertainty in Geologic Maps Donald A. Keefer Illinois State Geological Survey.
Approximate Analytical Solutions to the Groundwater Flow Problem CWR 6536 Stochastic Subsurface Hydrology.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
CWR 6536 Stochastic Subsurface Hydrology Optimal Estimation of Hydrologic Parameters.
Chapter 7. Classification and Prediction
Ch9 Random Function Models (II)
Inference for Geostatistical Data: Kriging for Spatial Interpolation
Paul D. Sampson Peter Guttorp
Stochastic Hydrology Random Field Simulation
Where did we stop? The Bayes decision rule guarantees an optimal classification… … But it requires the knowledge of P(ci|x) (or p(x|ci) and P(ci)) We.
Presentation transcript:

Groundwater. Notes on geostatistics Monica Riva, Alberto Guadagnini Politecnico di Milano, Italy Key reference: de Marsily, G. (1986), Quantitative Hydrogeology. Academic Press, New York, 440 pp

In practice: random spatial variability of hydrogeologic medium properties, and stochastic nature of corresponding flow (hydraulic head, fluid flux and velocity) and transport (solute concentration, solute flux and velocity) variables, are often ignored. Instead, the common approach has been to analyse flow and transport in multiscale, randomly heterogeneous soils and rocks deterministically. Yet with increasing frequency, the popular deterministic approach to hydrogeologic analysis is proving to be inadequate.    Modelling flow and transport in heterogenous media motivation and general idea

Understanding the role of heterogeneity Jan 2000 editorial "It's the Heterogeneity!“ (Wood, W.W., It’s the Heterogeneity!, Editorial, Ground Water, 38(1), 1, 2000): heterogeneity of chemical, biological, and flow conditions should be a major concern in any remediation scenario. Many in the groundwater community either failed to "get" the message or were forced by political considerations to provide rapid, untested, site- specific active remediation technology. "It's the heterogeneity," and it is the Editor's guess that the natural system is so complex that it will be many years before one can effectively deal with heterogeneity on societally important scales. Panel of experts (DOE/RL-97-49, April 1997): As flow and transport are poorly understood, previous and ongoing computer modelling efforts are inadequate and based on unrealistic and sometimes optimistic assumptions, which render their output unreliable.

Flow and Transport in Multiscale Fields (conceptual) conductivities & dispersivities varyscale of observation Field & laboratory-derive conductivities & dispersivities appear to vary continuously with the scale of observation (conductivity support, plume travel distance). Anomalous transport. theorieslinkscale-dependencemultiscale structure Recent theories attempt to link such scale-dependence to multiscale structure of Y = ln K. Predicteffect of domain size Predict observed effect of domain size on apparent variance and integral scale of Y. Predictsupra linear growth rate of dispersivity Predict observed supra linear growth rate of dispersivity with mean travel distance (time). Major challenge Major challenge: develop more powerful/general stochastic theories/models for multiscale random media, and back them with lab/field observation.

Shed some light Conceptual difficulty: Data deduced by means of deterministic Fickian models from laboratory and field tracer tests in a variety of porous and fractured media, under varied flow and transport regimes. Linear regression: a La  s 1.5 Supra-linear growth Neuman S.P., On advective transport in fractal permeability and velocity fields, Water Res. Res., 31(6), , 1995.

Natural Variability. Geostatistics revisited Introduction: Few field findings about spatial variability Regionalized variables Interpolation methods Simulation methods

AVRA VALLEY Clifton and Neuman, 1982 Clifton, P.M., and S.P. Neuman, Effects of Kriging and Inverse Modeling on Conditional Simulation of the Avra Valley Aquifer in southern Arizona, Water Resour. Res., 18(4), , Regional Scale

Columbus Air Force [Adams and Gelhar, 1992] Aquifer Scale

Mt. Simon aquifer Bakr, 1976 Local Scale

Summary: Variability is present at all scales But, what happens if we ignore it? We will see in this class that this would lead to interpretation problems in both groundwater flow and solute transport phenomena Examples in transport: - Scale effects in dispersion - New processes arising Heterogeneous parameters: ALL (T, K,, S, v (q), BC,...) Most relevant one: T (2D), or K (3D), as they have been shown to vary orders of magnitude in an apparently homogeneous aquifer

Variability in T and/or K Summary of data from many different places in the world. Careful though! Data are not always obtained with rigorous procedures, and moreover, as we will see throughout the course, data depend on interpretation method and scale of regularization Data given in terms of mean and variance (dispersion around the mean value)

Variability in T and/or K Almost always σ lnT (or σ lnK ) < 2 (and in most cases <1) This can be questioned, but OK by now Correlation scales (very important concept later!!)

But, what is the correct treatment for natural heterogeneity? First of all, what do we know? - real data at (few) selected points - Statistical parameters - A huge uncertainty related to the lack of data in most part of the aquifer. If parameter continuous (of course they are), then the number of locations without data is infinity Note: The value of K at any point DOES EXIST. The problem is we do not know it (we could if we measured it, but we could never be exhaustive anyway) Stochastic approach: K at any given point is RANDOM, coming from a predefined (maybe known, maybe not) pdf, and spatially correlated REGIONALIZED VARIABLE

Regionalized Variables T(x,ω) is a Spatial Random Function iif: -If ω = ω 0 then T(x,ω 0 ) is a spatial function (continuity?, differentiability?) -If x = x 0 then T(x 0 ) (actually T(x 0, ω)) is a random function Thus, as a random function, T(x 0 ) has a univariate distribution (log-normal according to Law, 1944; Freeze, 1975)

Hoeksema and Kitanidis, 1985

Hoeksema & Kitanidis, 1985 Log-T normal, log-K normal Both consolidated and unconsolidated deposits

Now we look at T(x), so we are interested in the multivariate distribution of T(x 1 ), T(x 2 ),... T(x n ): Most frequent hypothesis: Y=(Y(x 1 ), Y(x 2 ),... Y(x n ))=(ln T(x 1 ), ln T(x 2 ),... ln T(x n )) Is multinormal with But most important: NO INDEPENDENCE

What if independent? and then we are in classical statistics But here we are not, so we need some way to characterize dependency of one variable at some point with the SAME variable at a DIFFERENT point. This is the concept of the SEMIVARIOGRAM (or VARIOGRAM)

Classification of SRF Second order stationary E[Z(x)]=const C(x, y) is not a function of location ( only of separation distance, h ) Particular case: isotropic RSF; C(h) = C(h) Anisotropic covariance: different correlation scales along different directions Most important property: if multinormal distribution, first and second order moments are enough to fully characterize the SRF multivariate distribution

Relaxing the stationary assumption 1. The assumption of second-order stationarity with finite variance, C(0), might not be satisfied (The experimental variance tends to increase with domain size) 2. Less stringent assumption: INTRINSIC HYPOTHESIS The variance of the first-order increments is finite AND these increments are themselves second-order stationary. Very simple example: hydraulic heads ARE non intrinsic SRF E[Y(x + h) – Y(x)] = m(h) var[Y(x + h) – Y(x)] =  (h) Independent of x; only function of h Usually: m(h) = 0; if not, just define a new function, Y(x) – m(x), which satisfies this consition Definition of variogram,  (h) E[Y(x + h) – Y(x)] = 0  (h) = (1/2) var[Y(x + h) – Y(x)] = (1/2) E[(Y(x + h) – Y(x)) 2 ]

Variogram v. Covariance 1. The variogram is the mean quadratic increment of Y between two points separated by h. 2. Compare the INTRINSIC HYPOTHESIS with SECOND-ORDER STATIONARITY E[Y(x)] = m = constant  (h) = (1/2) E[(Y(x + h) – Y(x)) 2 ] = = (1/2) ( E[Y(x + h) 2 ] + E[Y(x) 2 ] – 2 m 2 – 2 E[Y(x + h) Y(x)] + 2 m 2 ) = = C(0) – C(h) covariance variogram h

The variogram The definition of the Semi-Variogram is usually given by the following probabilistic formula When dealing with real data the semi-variogram is estimated by the Experimental Semi-Variogram. For a given separation vector, h, there is a set of observation pairs that are approximately separated by this distance. Let the number of pairs in this set be N(h). The experimental semi-variogram is given by:

Some comments on the variogram If Z(x) and Z(x+h) are totally independent, then If Z(x) and Z(x+h) are totally dependent, then One particular case is when x = x+h. Therefore, by definition In the stationary case:

Variogram Models DEFINITIONS: Nugget Sill Range Integral distance or correlation scale Models: Pure Nugget Spherical Exponential Gaussian Power

Correlation scales: Larger in T than in K. Larger in horizontal than in vertical. Fraction of the domain of interest

Additional comments Second order stationary E[Z(x)]=constant  (h) is not a function of location Particular case: isotropic RSF  (h) =  (h) Anisotropic variograms: two types of anisotropy depending on correlation scale or sill value Important property:  (h) =  2 – C(h) Most important property: if multinormal distribution, first and second order moments are enough to fully characterize the SRF multivariate distribution

Estimation vs. Simulation Problem: Few data available, maybe we know mean, variance and variogram Alternatives: (1) Estimation (interpolation) problems: KRIGING Kriging – BLUE Extremely smooth Many possible krigings Alternative: cokriging

The kriging equations - 1 We want to predict the value, Z(x 0 ), at an unsampled location, x 0, using a weighted average of the observed values at N neighboring locations, {Z(x 1 ), Z(x 2 ),..., Z(x N )}. Let Z*(x 0 ) represent the predicted value; a weighted average estimator be written as The associated estimation error is In general, we do not know the (constant) mean, m, in the intrinsic hypothesis. We impose the additional condition of equivalence between the mathematical expectation of Z* and Z 0.

The kriging equations - 2 This condition allows obtaining an unbiased estimator. Unknown mathematical expectation of the process Z.

The kriging equations - 3 We wish to determine the set of weights. IMPOSE the condition

The kriging equations - 4 We then use the definition of variogram THEN: Which I will use into:

The kriging equations - 5 By substitution We finally obtain: Noting that:

The kriging equations - 6 This is a constrained optimization problem. To solve it we use the method of Lagrange Multipliers from the calculus of variation. The Lagrangian objective function is To minimize this we must take the partial derivative of the Lagrangian with respect to each of the weights and with respect to the Lagrange multiplier, and set the resulting expressions equal to zero, yielding a system of linear equations

The kriging equations - 7 Minimize this: and get (N+1) linear equations with (N+1) unknowns

The kriging equations - 8 The complete system can be written as: A = b

The kriging equations - 9 We finally get the Variance of the Estimation Error Hint: just replace into

Estimation vs. Simulation (ii) (2) Simulations: try to reproduce the “look” of the heterogeneous variable Important when extreme values are important Many (actually infinite) solutions, all of them equilikely (and with probability = 0 to be correct) For each potential application we are interested in one or the other

Estimation. 1 AVRA VALLEY. Regional Scale - Clifton, P.M., and S.P. Neuman, Effects of Kriging and Inverse Modeling on Conditional Simulation of the Avra Valley Aquifer in southern Arizona, Water Resour. Res., 18(4), , 1982.

Estimation. 2 AVRA VALLEY. Regional Scale - Clifton, P.M., and S.P. Neuman, Effects of Kriging and Inverse Modeling on Conditional Simulation of the Avra Valley Aquifer in southern Arizona, Water Resour. Res., 18(4), , 1982.

Estimation. 3 AVRA VALLEY. Regional Scale - Clifton, P.M., and S.P. Neuman, Effects of Kriging and Inverse Modeling on Conditional Simulation of the Avra Valley Aquifer in southern Arizona, Water Resour. Res., 18(4), , 1982.

Estimation. 4 AVRA VALLEY. Regional Scale - Clifton, P.M., and S.P. Neuman, Effects of Kriging and Inverse Modeling on Conditional Simulation of the Avra Valley Aquifer in southern Arizona, Water Resour. Res., 18(4), , 1982.

Estimation. 5 AVRA VALLEY. Regional Scale - Clifton, P.M., and S.P. Neuman, Effects of Kriging and Inverse Modeling on Conditional Simulation of the Avra Valley Aquifer in southern Arizona, Water Resour. Res., 18(4), , 1982.

Monte Carlo approach h1h1h1h1 h2h2h2h2 h simulations Statistical CONDITIONAL moments, first and second order CONDITIONAL CROSS-CORRELATED FIELDS Y = lnT

NUMERICAL ANALYSIS - MONTE CARLO Simple to understand Applicable to a wide range of linear and nonlinear problems High heterogeneities Conditioning  Heavy calculations  Fine computational grids  Reliable convergence criteria (?)  Evaluation of key statistics of medium parameters (K, porosity, …)  Synthetic generation of an ensemble of equally likely fields  Solution of flow/transport problems on each one of these  Ensemble statistics

Hydraulic head variance Number of Monte Carlo simulations Problems: reliable assessment of convergence – Ballio and Guadagnini [2004]