Design of a Sampling Network for an Estuary in the Colombian Caribbean, Using Geostatistical Methods. 1.INTRODUCTION In environmental statistics, model.

Slides:



Advertisements
Similar presentations
Spatial point patterns and Geostatistics an introduction
Advertisements

Spatial point patterns and Geostatistics an introduction
Brief introduction on Logistic Regression
Copyright © 2006 The McGraw-Hill Companies, Inc. Permission required for reproduction or display. 1 ~ Curve Fitting ~ Least Squares Regression Chapter.
Kriging.
1 12. Principles of Parameter Estimation The purpose of this lecture is to illustrate the usefulness of the various concepts introduced and studied in.
Likelihood Ratio, Wald, and Lagrange Multiplier (Score) Tests
SPATIAL DATA ANALYSIS Tony E. Smith University of Pennsylvania Point Pattern Analysis Spatial Regression Analysis Continuous Pattern Analysis.
Basic geostatistics Austin Troy.
How Many Samples are Enough? Theoretical Determination of the Critical Sampling Density for a Greek Clay Quarry. by K. Modis and S. Stavrou, Nat. Tech.
Aim Considerable relations have recently been given to the environmental hazardous caused by agricultural chemicals such as excess fertilizers. Evaluating.
Spatial prediction Stat 498A, Winter The geostatistical model Gaussian process  (s)=EZ(s) Var Z(s) < ∞ Z is strictly stationary if Z is weakly.
WFM 6202: Remote Sensing and GIS in Water Management © Dr. Akm Saiful IslamDr. Akm Saiful Islam WFM 6202: Remote Sensing and GIS in Water Management Akm.
Meet the professor Friday, January 23 at SFU 4:30 Beer and snacks reception.
Using the Maryland Biological Stream Survey Data to Test Spatial Statistical Models A Collaborative Approach to Analyzing Stream Network Data Andrew A.
Introduction This project deals with conversion of Vector based Probable Maximum Precipitation (PMP) data into Raster based PMP data using different interpolation.
Curve-Fitting Regression
Spatial Interpolation
Applied Geostatistics
Analysis of Experimental Data for Flow Thorough Fractures using Geostatistics DICMAN ALFRED Dr. ERWIN PUTRA Dr. DAVID SCHECHTER.
Deterministic Solutions Geostatistical Solutions
Bayesian kriging Instead of estimating the parameters, we put a prior distribution on them, and update the distribution using the data. Model: Matrix with.
Bayesian kriging Instead of estimating the parameters, we put a prior distribution on them, and update the distribution using the data. Model: Prior: Posterior:
Ordinary Kriging Process in ArcGIS
Mapping Chemical Contaminants in Oceanic Sediments Around Point Loma’s Treated Wastewater Outfall Kerry Ritter Ken Schiff N. Scott Urquhart Dawn Olson.
Statistical Tools for Environmental Problems NRCSE.
Spatial statistics 2 Stat 518 Sp 08. Ordinary kriging where and kriging variance.
Applications in GIS (Kriging Interpolation)
Method of Soil Analysis 1. 5 Geostatistics Introduction 1. 5
A PREDICTION APPROACH TO AGRICULTURAL LAND USE ESTIMATION Ambrosio L., Marín C., Iglesias L., Montañés J., Rubio L.A. Universidad Politécnica Madrid. Spain.
EXPLORING SPATIAL CORRELATION IN RIVERS by Joshua French.
Weed mapping tools and practical approaches – a review Prague February 2014 Weed mapping tools and practical approaches – a review Prague February 2014.
Spatial Interpolation of monthly precipitation by Kriging method
Spatial Statistics Jonathan Bossenbroek, PhD Dept of Env. Sciences Lake Erie Center University of Toledo.
Edoardo PIZZOLI, Chiara PICCINI NTTS New Techniques and Technologies for Statistics SPATIAL DATA REPRESENTATION: AN IMPROVEMENT OF STATISTICAL DISSEMINATION.
Explorations in Geostatistical Simulation Deven Barnett Spring 2010.
Geog. 579: GIS and Spatial Analysis - Lecture 21 Overheads 1 Point Estimation: 3. Methods: 3.6 Ordinary Kriging Topics: Lecture 23: Spatial Interpolation.
Geographic Information Science
Spatial Statistics in Ecology: Continuous Data Lecture Three.
A B S T R A C T The study presents the application of selected chemometric techniques to the pollution monitoring dataset, namely, cluster analysis,
Spatially Assessing Model Error Using Geographically Weighted Regression Shawn Laffan Geography Dept ANU.
PROBABILITY AND STATISTICS FOR ENGINEERING Hossein Sameti Department of Computer Engineering Sharif University of Technology Principles of Parameter Estimation.
Spatial Analysis & Geostatistics Methods of Interpolation Linear interpolation using an equation to compute z at any point on a triangle.
Vamsi Sundus Shawnalee. “Data collected under different conditions (i.e. treatments)  whether the conditions are different from each other and […] how.
Applied Quantitative Analysis and Practices LECTURE#31 By Dr. Osman Sadiq Paracha.
Lecture 6: Point Interpolation
Short course on space-time modeling Instructors: Peter Guttorp Johan Lindström Paul Sampson.
Interpolation and evaluation of probable Maximum Precipitation (PMP) patterns using different methods by: tarun gill.
Geology 6600/7600 Signal Analysis 04 Nov 2015 © A.R. Lowry 2015 Last time(s): Discussed Becker et al. (in press):  Wavelength-dependent squared correlation.
Geo479/579: Geostatistics Ch7. Spatial Continuity.
INTEGRATING SATELLITE AND MONITORING DATA TO RETROSPECTIVELY ESTIMATE MONTHLY PM 2.5 CONCENTRATIONS IN THE EASTERN U.S. Christopher J. Paciorek 1 and Yang.
Stochastic Hydrology Random Field Simulation Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University.
Geostatistics GLY 560: GIS for Earth Scientists. 2/22/2016UB Geology GLY560: GIS Introduction Premise: One cannot obtain error-free estimates of unknowns.
Geo479/579: Geostatistics Ch12. Ordinary Kriging (2)
By Russ Frith University of Alaska at Anchorage Civil Engineering Department Estimating Alaska Snow Loads.
Chapter 9: Introduction to the t statistic. The t Statistic The t statistic allows researchers to use sample data to test hypotheses about an unknown.
Exposure Prediction and Measurement Error in Air Pollution and Health Studies Lianne Sheppard Adam A. Szpiro, Sun-Young Kim University of Washington CMAS.
12. Principles of Parameter Estimation
Ch3: Model Building through Regression
Statistical Methods for Model Evaluation – Moving Beyond the Comparison of Matched Observations and Output for Model Grid Cells Kristen M. Foley1, Jenise.
NRCSE 2. Covariances.
International ‘EURASIA Natural Nutrition and Healthy Life Summit’ 2017
Inference for Geostatistical Data: Kriging for Spatial Interpolation
2. Stratified Random Sampling.
Paul D. Sampson Peter Guttorp
Stochastic Hydrology Random Field Simulation
2. Stratified Random Sampling.
Spatial Data Analysis: Intro to Spatial Statistical Concepts
Problem: Interpolation of soil properties
12. Principles of Parameter Estimation
Presentation transcript:

Design of a Sampling Network for an Estuary in the Colombian Caribbean, Using Geostatistical Methods. 1.INTRODUCTION In environmental statistics, model based and design based approaches are used to solving the problem of estimating the size of the sample and the location of the sampling sites (Caselton & Zidek, 1984; Aldworth & Cressie, 1999; Groenigen, 2000; Caeiro et al., 2003). For spatial-mean predicition over the local region, ordinary kriging predictor (model based approach) is better than classical design-based estimators, when an appropiate model is choice (Aldworth & Cressie, 1999). In this situation, good designs tend to spread point uniformly in the region (Mc Bratney et al., 1981; Olea, 1984; Cox et al., 1997). Accordingly, the problem of design sampling networks for local estimation is limited to establishing for sampling networks of different size, with a regular grid, the relationship between the maximum prediction variances and their associated costs. As the kriging variances are influences by spatial correlation, it is very important to have good estimates of the semivariogram parameters (Groenigen, 2000). When model based geostatistics (Diggle & Ribeiro, 2000) is assumed, maximum likelihood (ML) would be preferred in order to estimate the parameters of the spatial correlation model instead of ordinary least squares (Stein, 1999) Ramón Giraldo H 1,2 1 Ph. D. Student. Statistical and Operational Research. Polytechnic University of Catalonia, Barcelona, Spain Associated Professor. Statistics Department. National University of Colombia, Bogotá, Colombia. . REFERENCES 1. Aldworth, J. & N. Cressie Sampling designs and prediction methods for gaussian spatial processes. In "Multivariate analysis, design of experiments and survey sampling" (S. Ghosh, ed.), pp Marcel Dekker Inc, New York, USA 2.Caeiro, S., Painho, M., Goovaerts, P., Costa, H. and S. Sousa Spatial sampling design for sediment quality assessment in estuaries. Environ. Mod. & Soft., 18: Caselton, W. & J. Zidek Optimal monitoring networks designs. Statistics & Probability Letters, 2: Cox, D., Cox, L & K. Ensor Spatial sampling and the environment: some issues and directions. Environ. Ecol. Stat., 4: Diggle, P. & P. Ribeiro Model based geostatistics. 14 Sinape, Brasil. 6.Groenigen, J The influence of variogram parameters on optimal sampling schemes for mapping by kriging. Geoderma, 97: McBratney, A., Webster, R. & T. Burgess The design of optimal sampling schemes for local estimation and mapping of regionalized variables I. Computers and Geosciences, 7(4): Olea, R Sampling design optimization for spatial functions. Math. Geol., 16: Ribeiro, P & P. Diggle geoR. Package for geostatistical data analysis. R- NEWS, Vol 1, No 2, Stein, M Interpolation of spatial data. Some theory for kriging. Springer I ABSTRACT. A network for monitoring physical chemistry and biological variables in the Ciénaga Grande de Santa Marta (CGSM) estuary, located in the Caribbean coast of Colombia, was designed. Initially a set of 114 sampling points was chosen to measure the considered variables (Fig. 1.a). Based on the data, a spatial auto-correlation structure for each variable was estimated, using the Matérn model and maximum likelihood. Some variables were assumed Gaussian. In other cases it was necessary to transform the variables in order to obtain Gaussian processes. Later, for different size networks, the kriging prediction variances were calculated, taking the adjusted autocorrelation models as a basis. The comparison among the prediction variances for the different networks and their associated costs allowed establishing a set of sampling sites, that at a reasonable cost, substantially diminishes the prediction error for the variables of interest. Key Words: Estuary, geostatistics, Gaussian processes, ML estimation, sampling networks. 3. RESULTS. The adjusted Matern models (table 1) show strong spatial dependence for some variables (temperature, nitrites, salinity) in the area. The ranges are relatively high because the distance between the extreme north and south of the system (the longest distance) is not more than 20 km. The nugget was not greater than 50% of the sill. This, is recommendable for the spatial correlation model describe adequately reality (Caeiro et al.,2003) 2. MATERIALS AND METHODS The information used for the analysis was obtained in March 1997, at the CGSM (Fig. 1). Water samples from the surface of the water column were analyzed for the following variables: temperature (Cº), salinity, total suspended solids (mg l -1 ), depth (m), silicates (  mol l - 1 ), chlorophyll “a” (  g l -1 ), dissolved oxygen (mg l -1 ), nitrites (  mol l -1 ) and chlorophyll “c” (  g l -1 ). Between 103 and 114 observations were obtained for each variable. The data was taken throughout the system by systematic samples of squares of 4 km 2. For each variable, the spatial auto-correlation structure was estimated by ML assuming Gaussian processes (dissolved oxygen and nitrites were log- transformed. Chlorophyll “c” was transformed with =0.35 using Box-Cox transformation) and correlation models of the Matérn Family (Diggle & Ribeiro, 2004). Sampling networks were simulated with distances of 2 (the observed), 3, 4, 5 and 6 km between points (Fig.1) and kriging prediction over 1000 unsampled points was carried out with each one. The corresponding mean prediction variances of each variable were estimated and related to the associated costs in each sampling density. The final decision on the proposed sampling network was based on practical criteria founded on the prediction variance-cost relationship. The analysis it was carried out using geoR package (Ribeiro & Diggle, 2001) Figure 2. Increase in precision (% reduction standard prediction error) each sampling network respect 6000 m network (least sampling points) Figure 3. Sampling cost for each variable on five sampling network 2  Chlorophyll “c” ( = 0.35) Chlorophyll “a” Silicates Log (Nitrites) Suspended Solids Log (Oxygen) Salinity Temperature Depth Variable k Table 1. ML estimation of spatial correlation Matérn model.  Salinity is the variable which the greatest gain in precision was obtained (35%) when changing to the less dense network to the densest (Fig. 2). Other variables such as temperature, dissolved oxygen, silicates, and chlorophyll “a” had increases in precision that varied between 15.9% and 23.8% (Fig. 2). Finally, for depth, nitrites, total suspended solids and chlorophyll “c”, the increase in precision was only in percentages between 5.7% and 10.1% (Fig. 2). Obviously, when comparing the intermediate networks, those with grid distances between 3, 4, and 5 km., with the 6 km network, the relative increase in precision was much less. The sampling costs associated each variable under each sampling density were different (Fig.3). Temperature, depth, and salinity had cost was much lower than other variables. For some of the variables (dissolved oxygen, silicates, and chlorophyll) going from a 3000 m network to a 2000 m the sampling, increased cost about US $240. Hence, for temperature and salinity, it would be more convenient to make an intense sampling (the densest network) as this would increase the efficiency in a considerable percentage (Fig. 2), with costs increased only about US $90 (Fig 3). For depth, even if the sampling costs are not significantly increased (Fig.3), is more recommendable to sample it in the less dense network, given that the efficiency is increased by a maximum of 7% in comparison with other networks (Fig. 2). For nitrite, total suspended solids and chlorophyll “c”, there is only a little increase in the efficiency with increasing network density; on the contrary, the costs, especially in the 2000 m network, increase considerably (Fig. 3). Hence the less dense networks (5000 m and 6000 m between sampling points) are the most adequate for the follow-up of these variables. With variables dissolved oxygen, silicates and chlorophyll “a” the decision is complex given that there are considerable increases in the costs (Fig. 3) and the efficiences with increasing samples (Fig. 2). A global analysis of the increases in cost and in efficiency show that the 2 km network is the least recommendable given that, compared to the 3 km one, there is a high increase in costs (more that 200%) but the relative efficiency increases in only 4.9%. While in relative terms, the change in efficiency and the costs going from one network to another with a greater number of points is similar (with the exception of the 2 km one), the networks with distances between sampling points of 4 km and 5 km should be considered to be the most advisable, given that they produce a greater efficiency that the one obtained in the 6 km, with slightly higher costs. The suggestion given in the foregoing paragraph about the optimum sampling arrangement to monitoring the variables considered in the ecosystem, are not absolute. In the final analysis, while comparing the functions of cost and of statistical efficiency, many purely empirical criteria have been used. Nevertheless it is considered that the agencies that make the final decision should have a tool that allows them to plan the most adequate monitoring strategy for the future. Figure 1. The sampling networks under which the estimation of the prediction variances were made for each variable The distances between the sampling points: a) 2 km; b) 3 km; c) 4 km; d) 5 km and e) 6 km