Presentation is loading. Please wait.

Presentation is loading. Please wait.

1 of 27 How Many Samples do I Need? Part 2 Presenter: Sebastian Tindall (60 minutes) (5 minute “stretch” break) DQO Training Course Day 1 Module 5.

Similar presentations


Presentation on theme: "1 of 27 How Many Samples do I Need? Part 2 Presenter: Sebastian Tindall (60 minutes) (5 minute “stretch” break) DQO Training Course Day 1 Module 5."— Presentation transcript:

1 1 of 27 How Many Samples do I Need? Part 2 Presenter: Sebastian Tindall (60 minutes) (5 minute “stretch” break) DQO Training Course Day 1 Module 5

2 2 of 27 Analytical + Sampling & Sub-sampling + Natural heterogeneity of the site = Total Uncertainty Uncertainty is Additive! Remember the uncertainty is additive for all steps in sampling and analysis

3 3 of 27 What is the one phenomenon that causes ALL sampling error? HETEROGENEITY

4 4 of 27 The SYSTEM functions as if it believes that… { Data Uncertainty Automatically Managed Data Quality Prescriptive Analytical Methods = { Analytical Uncertainty Automatically Managed Decision Quality = { Decision Uncertainty Automatically Managed

5 5 of 27 Take-Home MessageNon- Representative Sample PerfectAnalyticalChemistry+ “BAD” DATA

6 6 of 27 Sample vs. Analytical Certainty 16 54 3 27 Analytical = 5% TOTAL ERROR Sampling = 95% 1,280 Onsite 1,220 Lab 331 Onsite 286 Lab 500 Onsite 416 Lab 164 Onsite 136 Lab 24,000 Onsite 27,700 Lab 27,800 Onsite 42,800 Lab 39,800 Onsite 41,400 Lab

7 7 of 27 Dilemma! Average is not specific to an area, or sub-sample size. None of the equations for the number of samples, or the average or standard deviation include size/area Some guidance suggests 1 sample/20 cu yd but this is indefensible Must decide on the scale of the decision or exposure unit to represent the population of interest Must sample within the scale of the decision unit

8 8 of 27 Typical Sampling Design EPA “ Methods for Evaluating the Attainment of Soil Cleanup Standards - Vol 1”, 1989 Equation 6.6 Wrong Often Usually way off or unknown Assumed Normal Distribution

9 9 of 27 n  5 1. Will usually fail to truly capture heterogeneity…. of population(s) 2. Results in large uncertainty which is seldom: - Identified - Quantified - or even Acknowledged Typical Sampling Design (cont.)

10 10 of 27 Uncertainty M o = M d = M n Normal Mo  Md  MnMo  Md  Mn Lognormal M 0 = mode M d = median M n = mean % of time when x <  is high, (when n is small)

11 11 of 27 Why Decisions are suspect n Failure to define population accurately n Failure to obtain representative data from the population of interest n Failure to accurately determine the frequency distribution of the COPCs n Failure to select the appropriate statistical method for generating adequate samples n Failure to use the appropriate UCL in making the decision

12 12 of 27 Definitions of Representativeness n A sample is representative when the mean square, r 2 SE, of Sampling Error (SE) is not larger than a certain standard of representativeness regarded as acceptable. n Representativeness is the sum of the square of the mean of SE (m SE ), and the variance of the SE (s 2 SE ). r 2 SE  m 2 SE + s 2 SE <= r 2 oSE Preparation of Soil Sampling Protocols: Sampling Techniques and Strategies, EPA/600/R-92/128, July 1992

13 13 of 27 Definitions of Representativeness n A sample collected in such a manner that the sampling error is less than a specified amount. n A sample of a universe or whole that can be expected to exhibit the average properties of the universe or whole (40 CFR 260.10). n A sample that answers a question about a population with a specified confidence Sampling for Environmental Activities, Envirostat, 2003

14 14 of 27 Definitions of Representativeness n Representativeness expresses the degree to which sample data accurately and precisely represents a characteristic of a population, parameter variations at a sampling point, or an environmental condition. Representativeness is a qualitative parameter which is most concerned with the proper design of the sampling program. The representativeness criterion is best satisfied by making certain that sampling locations are selected properly and a sufficient number of samples are collected. Representativeness is addressed by describing sampling techniques and the rational used to select sampling locations. DQOs for Remedial Response Activities: Development Process, US EPA 1987

15 15 of 27 Quantity of Data Matters. Why? WARNING: The Statistician General has determined that drawing conclusions from insufficient data may be hazardous to your decisions.

16 16 of 27 Sample Size Rules of Thumb “Samples of less than 10 are usually too small to rely on sample estimates even in ‘nice’ parametric cases.” “In many practical contexts, the number 30 is used as a ‘minimum’ sample size.” M.R. Chernick in Bootstrap Methods: A Practitioner's Guide, 1999, pp. 150, 151.

17 17 of 27 Sample Size Rules of Thumb “Although it is always dangerous to set ‘rules of thumb’ for sample sizes, I would suggest that in most cases it would be wise to take n ≥ 50.” M.R. Chernick in Bootstrap Methods: A Practitioner's Guide, 1999, p. 151.

18 18 of 27 Sample Size Rules of Thumb “For practical purposes it will be assumed here that a ‘too small number’ is less than 30, and a ‘large number’ is at least 50.” Pierre Gy in Sampling for Analytical Purposes, 1998, p. 70.

19 19 of 27 Sample Size Rules of Thumb “In practice, there appears to be no simple rule for determining how large n should be….If the distribution is highly skewed, an n of 50 or more may be required.” Richard Gilbert in Statistical Methods for Environmental Pollution Monitoring, 1987, p. 140.

20 20 of 27 Quantity of Data Matters. Why? “If the sample size is ‘large’ then most traditional estimators will yield the same conclusions and simple estimators suffice.” H. Lacayo, Jr. in Environmental Statistics: Handbook of Statistics Volume 12, 1994, p. 891.

21 21 of 27 “Lacking distribution information, it is impossible to devise an optimal sampling strategy.” - Jenkins, et. al. 1996. “Assessment of Sampling Error Associated with Collection and Analysis of Soil Samples at Explosives-Contaminated Sites” U.S. Army Corps of Engineers, Cold Regions Research & Engineering Laboratory, p. 1. http://www.crrel.usace.army.mil/techpub/CRREL_Reports/reports/SR96_15.pdf

22 22 of 27 How Many Samples do I Need? Begin with the End in Mind Optimal Sampling Design Alternative Sample Designs , , ,  Correct Equation for n (Statistical Method) Population Frequency Distribution Contaminant Concentrations in the Spatial Distribution of the Population The end DATA

23 23 of 27 Q: Where do you obtain the contaminant distribution information in order to select the correct sampling design to ensure representativeness, etc? A: From sampling data. Q: How much sampling data do you need? A: Depends upon the consequences of making the wrong decision.

24 24 of 27 Sample Representativeness Are we honestly addressing Heterogeneity (sampling uncertainty)? n Now we are finally able to address this issue, defensibly and affordably ! n Use cheaper analytical technologies that allow you to increase sample density n Use real-time measurements at the site of the sample to support real-time decision-making n IF willing to honestly balance analytical uncertainty against overall data uncertainty

25 25 of 27 The TRIAD Approach Systematic Planning Dynamic Work Plans Real-Time Measurement Technologies

26 26 of 27 Unifying Concept for TRIAD: Managing Uncertainty n Systematic planning –Identify decision goals w/ tolerable overall uncertainty –Identify major uncertainties (cause decision error) –Identify strategy to manage each major uncertainty n Use the Field Analytical Method (FAM) and a Dynamic Work Plan (DWP) to effectively manage sampling uncertainty (sample representativeness) n Use various strategies to manage analytical uncertainty when using FAM

27 27 of 27 End of Module 5 Thank you Questions? We will now take a 5-minute Afternoon “Stretch” Break. Please be back in 5 minutes


Download ppt "1 of 27 How Many Samples do I Need? Part 2 Presenter: Sebastian Tindall (60 minutes) (5 minute “stretch” break) DQO Training Course Day 1 Module 5."

Similar presentations


Ads by Google