1 Evaluation of Standards data collected from probabilistic sampling programs Eric P. Smith Y. Duan, Z. Li, K. Ye Statistics Dept., Virginia Tech Presented.

Slides:



Advertisements
Similar presentations
Dept of Bioenvironmental Systems Engineering National Taiwan University Lab for Remote Sensing Hydrology and Spatial Modeling STATISTICS Hypotheses Test.
Advertisements

Sampling Design, Spatial Allocation, and Proposed Analyses Don Stevens Department of Statistics Oregon State University.
Autocorrelation Functions and ARIMA Modelling
Multiple Regression.
Designs for estimating variability structure and implications for detecting watershed restoration effectiveness David P. Larsen –Western Ecology Division,
Probabilistic models Haixu Tang School of Informatics.
Lecture XXIII.  In general there are two kinds of hypotheses: one concerns the form of the probability distribution (i.e. is the random variable normally.
Chapter 9 Hypothesis Testing Understandable Statistics Ninth Edition
MPS Research UnitCHEBS Workshop - April Anne Whitehead Medical and Pharmaceutical Statistics Research Unit The University of Reading Sample size.
Chap 9: Testing Hypotheses & Assessing Goodness of Fit Section 9.1: INTRODUCTION In section 8.2, we fitted a Poisson dist’n to counts. This chapter will.
Chapter 10 Section 2 Hypothesis Tests for a Population Mean
Chapter 7: Statistical Applications in Traffic Engineering
Chapter 7(7b): Statistical Applications in Traffic Engineering Chapter objectives: By the end of these chapters the student will be able to (We spend 3.
Chapter Seventeen HYPOTHESIS TESTING
PSY 307 – Statistics for the Behavioral Sciences
Hypothesis Testing Steps of a Statistical Significance Test. 1. Assumptions Type of data, form of population, method of sampling, sample size.
Evaluating Hypotheses Chapter 9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics.
Discrete Event Simulation How to generate RV according to a specified distribution? geometric Poisson etc. Example of a DEVS: repair problem.
Analysis of Variance. Experimental Design u Investigator controls one or more independent variables –Called treatment variables or factors –Contain two.
Statistics Are Fun! Analysis of Variance
Evaluating Hypotheses Chapter 9 Homework: 1-9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics ~
Characterizing Baseline Water Body Conditions. What? Confirm impairments and identify problems Statistical summary Spatial analysis Temporal analysis.
BCOR 1020 Business Statistics Lecture 21 – April 8, 2008.
BCOR 1020 Business Statistics Lecture 18 – March 20, 2008.
Statistics for the Social Sciences
BCOR 1020 Business Statistics Lecture 20 – April 3, 2008.
Today Concepts underlying inferential statistics
One Sample  M ean μ, Variance σ 2, Proportion π Two Samples  M eans, Variances, Proportions μ1 vs. μ2 σ12 vs. σ22 π1 vs. π Multiple.
1 Spatial and Spatio-temporal modeling of the abundance of spawning coho salmon on the Oregon coast R Ruben Smith Don L. Stevens Jr. September.
Testing Hypotheses I Lesson 9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics n Inferential Statistics.
Copyright © 2010, 2007, 2004 Pearson Education, Inc Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by.
STATISTICS HYPOTHESES TEST (I) Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National Taiwan University.
Chapter 8 Introduction to Hypothesis Testing
Means Tests Hypothesis Testing Assumptions Testing (Normality)
4-1 Statistical Inference The field of statistical inference consists of those methods used to make decisions or draw conclusions about a population.
The paired sample experiment The paired t test. Frequently one is interested in comparing the effects of two treatments (drugs, etc…) on a response variable.
Chapter 8 Introduction to Hypothesis Testing
A Beginner’s Guide to Bayesian Modelling Peter England, PhD EMB GIRO 2002.
Developing Monitoring Programs to Detect NPS Load Reductions.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Learning Geographical Preferences for Point-of-Interest Recommendation Author(s): Bin Liu Yanjie Fu, Zijun Yao, Hui Xiong [KDD-2013]
Bayesian vs. frequentist inference frequentist: 1) Deductive hypothesis testing of Popper--ruling out alternative explanations Falsification: can prove.
4 Hypothesis & Testing. CHAPTER OUTLINE 4-1 STATISTICAL INFERENCE 4-2 POINT ESTIMATION 4-3 HYPOTHESIS TESTING Statistical Hypotheses Testing.
Statistics for the Social Sciences Psychology 340 Fall 2012 Analysis of Variance (ANOVA)
Research Seminars in IT in Education (MIT6003) Quantitative Educational Research Design 2 Dr Jacky Pow.
Chapter 7 Sampling and Sampling Distributions ©. Simple Random Sample simple random sample Suppose that we want to select a sample of n objects from a.
Univariate Linear Regression Problem Model: Y=  0 +  1 X+  Test: H 0 : β 1 =0. Alternative: H 1 : β 1 >0. The distribution of Y is normal under both.
Educational Research Chapter 13 Inferential Statistics Gay, Mills, and Airasian 10 th Edition.
Three Frameworks for Statistical Analysis. Sample Design Forest, N=6 Field, N=4 Count ant nests per quadrat.
Introduction to Statistical Inference Jianan Hui 10/22/2014.
© 2006 by The McGraw-Hill Companies, Inc. All rights reserved. 1 Chapter 12 Testing for Relationships Tests of linear relationships –Correlation 2 continuous.
"Classical" Inference. Two simple inference scenarios Question 1: Are we in world A or world B?
Hypothesis Testing Errors. Hypothesis Testing Suppose we believe the average systolic blood pressure of healthy adults is normally distributed with mean.
Bayes Theorem. Prior Probabilities On way to party, you ask “Has Karl already had too many beers?” Your prior probabilities are 20% yes, 80% no.
Understanding Basic Statistics Fourth Edition By Brase and Brase Prepared by: Lynn Smith Gloucester County College Chapter Nine Hypothesis Testing.
Inference ConceptsSlide #1 1-sample Z-test H o :  =  o (where  o = specific value) Statistic: Test Statistic: Assume: –  is known – n is “large” (so.
REVISIONS TO THE FEDERAL WATER QUALITY STANDARDS RULE JILL CSEKITZ, TECHNICAL SPECIALIST TEXAS COMMISSION ON ENVIRONMENTAL QUALITY.
Created by Erin Hodgess, Houston, Texas Section 7-1 & 7-2 Overview and Basics of Hypothesis Testing.
1 of 48 The EPA 7-Step DQO Process Step 6 - Specify Error Tolerances 3:00 PM - 3:30 PM (30 minutes) Presenter: Sebastian Tindall Day 2 DQO Training Course.
Educational Research Inferential Statistics Chapter th Chapter 12- 8th Gay and Airasian.
Chapter Nine Hypothesis Testing.
STATISTICS HYPOTHESES TEST (I)
STAT 312 Chapter 7 - Statistical Intervals Based on a Single Sample
Linear Mixed Models in JMP Pro
CONCEPTS OF HYPOTHESIS TESTING
Statistical Process Control
Mark Rothmann U.S. Food and Drug Administration September 14, 2018
From GLM to HLM Working with Continuous Outcomes
CHAPTER 6 Statistical Inference & Hypothesis Testing
Presentation transcript:

1 Evaluation of Standards data collected from probabilistic sampling programs Eric P. Smith Y. Duan, Z. Li, K. Ye Statistics Dept., Virginia Tech Presented at the Monitoring Science and Technology Symposium, Denver, CO Sept

2 Outline  Background Standards assessments  Single site analysis  Regional analysis Mixed model approach Bayesian approach  Upshot: need models that allow for additional information to be used in assessments

3 320

4 Standards assessment – 303d  Clean Water Act section 303d mandates states in US to monitor and assess condition of streams  Site impaired – list site, start TMDL process (Total Max Daily Loading)  Impaired means site does not meet usability criteria

5 Linkages in 303(d) Set goals and WQS Implement strategies [NPDES, 319, SRF, etc] Conduct monitoring Develop strategies [TMDLs] Yes No Meeting WQS? Apply Antidegradation 303(d) List standards tests Local to regional Sampling plan

6 Impaired sites  Site impaired if standards not met  Standards – defined through numerical criteria Involve frequency, duration, magnitude  –Old method Site impaired if >10% of samples exceed criteria Implicit statistical decision process- error rates

7 Test of impairment

8 Some newer approaches  Frequency: Binomial method Test p<0.1  Magnitude Acceptance sampling by variables Tolerance interval on percentile Test criteria by computing mean for the distribution of measurements and comparing with what is expected given the percentile criteria

9 Problems  Approach is local Limited sampling budget; many stations means small sample sizes per station Impairment may occur over a region Modeling must be relatively simple (hard to account for seasonality, temporal effects) Does not complement current approaches to sampling Site history is ignored Not linked to TMDL analysis (regional) and 305 reporting

10 Probabilistic sampling schemes  Randomly selected sites  Rotating panel surveys Some sites sampled at all possible times Other sites sampled on rotational basis Sites in second group may be randomly selected

11 Making the assessment regional  fixed effects (time, covariates)  random ones (site, location) Y = mean + site Y = mean + time + site General model Y = X + Z = fixed effect model + random effects

12 Regional Mixed Model  Allows for covariates  Allows for a variety of error structures Temporal, spatial, both  Does not require equal sample sizes etc  Allows estimation of means for sites with small sample sizes Improves estimation by borrowing information from other sites

13 Simple model  Testing is based on estimate and variance of mean for site i ( i )  Can also test for regional impairment using distribution of grand mean Random site effect Error term allows for modeling of temporal or spatial correlation

14 Error and stochastic components  Covariance Structure without correlation (one random effect model)  Spatial Covariance Structure Random site effect Error term allows for modeling of temporal or spatial correlation

15 Test based on OLS estimations for each site i  Baseline is the numeric criterion. For DO, we use 5, and for PH 6.  Model based: same idea but mean and variance may be estimated from model

16 Simulation results: different means, variance=1, normal 3 sites-12 obs – 6.28 is the mean for the boundary , , , , Site Site Site Expect.05 Two bad sites Pull third site One badAll good Two border One good

17 Located in SW Virginia Good bass fishing

18 DO data collected at four stations of PHILPOTT RESERVOIR (years 2000, 2001 & 2002)

19 Evaluation based on Do data of PHILPOTT RESERVIOR ( ) 4ASRE046.90Model based4ASRE ASRE n Sample mean Sample variance % excceding Binomial p-value Test statistic critical value conclusion Fail to rejectreject Single site analysis

20 Bayesian approach  a is a random site effect  Error term may include temporal correlation or spatial  Priors on parameters Mean –uniform a is normal (random effect) variance has prior Produces results similar to first approach

21 Alternative: Using historical data  Power prior – Chen, Ibrahim, Shao 2000  Use likelihood from the previous assessment (D 0 ). Basic idea: weight new data by prior data  Power term,, determines influence of historical data.  Modification to work with Winbugs

22 Incorporate Historical Data using Power Priors  Make random, and assign a prior on it. The joint posterior of becomes where D is current data and D 0 is past data  Advantage: Improve the precision of estimates.

23

24 PH data collected at four stations: use past information to build prior

25 Evaluate site impairment based on PH data with power priors Note – log transformation applied to improve normality

26  If multiple historical data sets are available, assign a different for each historical data set. where  Data collected at adjacent stations could be used as “historical” data. Power Priors with Multiple Historical Data Sets

27 DO data collected at four stations of PHILPOTT RESERVOIR (years 2000, 2001 & 2002)

28 Evaluate site impairment based on DO data collected at four stations of PHILPOTT RESERVOIR (years 2000, 2001 & 2002)

29 DO data collected at four stations of MOOMAW RESERVOIR (years 2000 & 2001)

30 Evaluate site impairment based on DO data collected at four stations of MOOMAW RESERVOIR (years 2000 & 2001)

31 Comments  Advantages Greater flexibility in modeling Allows for site history to be included Can include spatial and temporal components Can better connect to TMDL analysis and probabilistic sampling  Disadvantage Requires more commitment to the modeling process Greater emphasis on the distributional assumptions  tml

32 Needs  More applications to evaluate  Temporal/spatial modeling  Evaluation of error rates  Bayesian modeling and null and alternative hypotheses

33 Sponsor RD This talk was not subjected to USEPA review. The conclusion and opinions are soley those of the authors and not the views of the Agency.