Comparability of Biological Assessments Derived from Predictive Models of Increasing Geographic Scope Peter Ode California Department of Fish and Game.

Slides:



Advertisements
Similar presentations
O/E: a standardized way to make site-specific assessments of biological condition Chuck Hawkins Western Center for Monitoring and Assessment of Freshwater.
Advertisements

ANCOVA Workings of ANOVA & ANCOVA ANCOVA, Semi-Partial correlations, statistical control Using model plotting to think about ANCOVA & Statistical control.
Literature Review Kathryn Westerman Oliver Smith Enrique Hernandez Megan Fowler.
THE DISTRIBUTION OF SAMPLE MEANS How samples can tell us about populations.
We processed six samples in triplicate using 11 different array platforms at one or two laboratories. we obtained measures of array signal variability.
Designing Effective Monitoring Programs for Fish Population Response to Habitat Restoration John Sweka USFWS – Northeast Fishery Center Lamar, PA.
Statistics 100 Lecture Set 7. Chapters 13 and 14 in this lecture set Please read these, you are responsible for all material Will be doing chapters
Statistical Issues in Research Planning and Evaluation
Module 36: Correlation Pitfalls Effect Size and Correlations Larger sample sizes require a smaller correlation coefficient to reach statistical significance.
Comparable Biological Assessments from Different Methods and Analyses David B. Herbst 1 and Erik L. Silldorff 2 1 Sierra Nevada Aquatic Research Laboratory,
Down-scaling climate data for microclimate models and forecasts Securing the Conservation of biodiversity across Administrative Levels and spatial, temporal.
Simple Regression correlation vs. prediction research prediction and relationship strength interpreting regression formulas –quantitative vs. binary predictor.
Statistical Analysis SC504/HS927 Spring Term 2008 Week 17 (25th January 2008): Analysing data.
Intro to Statistics for the Behavioral Sciences PSYC 1900 Lecture 6: Correlation.
Stat 217 – Day 25 Regression. Last Time - ANOVA When?  Comparing 2 or means (one categorical and one quantitative variable) Research question  Null.
Today Concepts underlying inferential statistics
Using Statistics in Research Psych 231: Research Methods in Psychology.
Chapter 7 Correlational Research Gay, Mills, and Airasian
Richard M. Jacobs, OSA, Ph.D.
So are how the computer determines the size of the intercept and the slope respectively in an OLS regression The OLS equations give a nice, clear intuitive.
Chapter 9: Correlational Research. Chapter 9. Correlational Research Chapter Objectives  Distinguish between positive and negative bivariate correlations,
Community Ordination and Gamma Diversity Techniques James A. Danoff-Burg Dept. Ecol., Evol., & Envir. Biol. Columbia University.
The Water Withdrawal Assessment Process Context within regional water policy discussions Context within regional water policy discussions –Aquatic ecosystems.
THIS IS REALLY IMPORTANT STUFF. YOU MUST LOOK AT THE MARKS ALLOCATED – is it a quick answer (1 mark) or does it need developing to a certain degree? IF.
Chapter 4 Hypothesis Testing, Power, and Control: A Review of the Basics.
CENTRE FOR INNOVATION, RESEARCH AND COMPETENCE IN THE LEARNING ECONOMY Session 2: Basic techniques for innovation data analysis. Part I: Statistical inferences.
Theory testing Part of what differentiates science from non-science is the process of theory testing. When a theory has been articulated carefully, it.
Covariance and correlation
Macroinvertebrate Bioassessment Tools Aquatic Life/Nutrient Workgroup August 11, 2008.
Agronomic Spatial Variability and Resolution What is it? How do we describe it? What does it imply for precision management?
CHAPTER 16: Inference in Practice. Chapter 16 Concepts 2  Conditions for Inference in Practice  Cautions About Confidence Intervals  Cautions About.
Developing O/E (Observed-to-Expected) Models for Assessing Biological Condition Chuck Hawkins Western Center for Monitoring and Assessment of Freshwater.
Understanding the Variability of Your Data: Dependent Variable Two "Sources" of Variability in DV (Response Variable) –Independent (Predictor/Explanatory)
Agronomic Spatial Variability and Resolution What is it? How do we describe it? What does it imply for precision management?
Slide 1 Estimating Performance Below the National Level Applying Simulation Methods to TIMSS Fourth Annual IES Research Conference Dan Sherman, Ph.D. American.
POSC 202A: Lecture 12/10 Announcements: “Lab” Tomorrow; Final ed out tomorrow or Friday. I will make it due Wed, 5pm. Aren’t I tender? Lecture: Substantive.
June 19, 2007 GRIDDED MOS STARTS WITH POINT (STATION) MOS STARTS WITH POINT (STATION) MOS –Essentially the same MOS that is in text bulletins –Number and.
Mercury Concentrations in Stream Fish Throughout 12 Western States in the USA Alan Herlihy and Robert Hughes Dept. of Fisheries & Wildlife, Oregon State.
EPA’s Bioassessment Performance and Comparability “Guidance”
Steps 3 & 4: Evaluating types of evidence for the Truckee River case study.
Chapter 10: Analyzing Experimental Data Inferential statistics are used to determine whether the independent variable had an effect on the dependent variance.
Objectives: 1.Enhance the data archive for these estuaries with remotely sensed and time-series information 2.Exploit detailed knowledge of ecosystem structure.
1 Do traits of freshwater species predict vulnerability to climate change? Bruce Chessman Climate Change Science, NSW DECCW.
Objectives 2.1Scatterplots  Scatterplots  Explanatory and response variables  Interpreting scatterplots  Outliers Adapted from authors’ slides © 2012.
Simon Power Managing Consultant John Rae Director Understanding Communities Through PayCheck
Chapter 16 Data Analysis: Testing for Associations.
Experiences in assessing deposition model uncertainty and the consequences for policy application Rognvald I Smith Centre for Ecology and Hydrology, Edinburgh.
CWWUC Presentation April 8, 2009 Application of the Integrated Impact Analysis Tool.
Fish O/E Modeling Aquatic Life/Nutrient Workgroup August 11, 2008.
1 Defining Least-Impacted Reference Condition for the National Wadeable Streams Assessment Alan Alan Herlihy (Oregon State University), John Stoddard (U.S.
Introducing Communication Research 2e © 2014 SAGE Publications Chapter Seven Generalizing From Research Results: Inferential Statistics.
Correlation They go together like salt and pepper… like oil and vinegar… like bread and butter… etc.
Chapter 15 The Chi-Square Statistic: Tests for Goodness of Fit and Independence PowerPoint Lecture Slides Essentials of Statistics for the Behavioral.
Seasonal and elevational variation of surface water  18 O and  2 H in the Willamette River basin J. Renée Brooks 1, Parker J. Wigington 1, Jr., Carol.
Statistical Techniques
Detecting Ecological Effects of Development in the Wappingers and Fishkill Watersheds Karin Limburg, Karen Stainbrook, Bongghi Hong SUNY College of Environmental.
Agronomic Spatial Variability and Resolution What is it? How do we describe it? What does it imply for precision management?
WINTER OROGRAPHIC- PRECIPITATION PATTERNS IN THE SIERRA NEVADA— CLIMATIC UNDERPINNINGS & HYDROLOGIC CONSEQUENCES Mike Dettinger 1, Kelly Redmond 2, & Dan.
Böhmer, J. Birk, S., Schöll, F. Intercalibration of large river assessment methods.
Using Regional Models to Assess the Relative Effects of Stressors Lester L. Yuan National Center for Environmental Assessment U.S. Environmental Protection.
Data Analysis. Qualitative vs. Quantitative Data collection methods can be roughly divided into two groups. It is essential to understand the difference.
L Berkley Davis Copyright 2009 MER301: Engineering Reliability Lecture 8 1 MER301: Engineering Reliability LECTURE 8: Chapter 4: Statistical Inference,
THE USE OF BENTHIC MACROINVERTEBRATE TRAITS TO ASSESS CLIMATE CHANGE RESPONSES AND VULNERABILITIES Anna Hamilton (Tetra Tech), Britta Bierwagen (US EPA),
Predicting effects of climate change on stream ecosystems in the conterminous United States: results from a pilot study in California Charles P. Hawkins.
Aquatic, Watershed, and Earth Resources
POSC 202A: Lecture Lecture: Substantive Significance, Relationship between Variables 1.
Review for Exam 2 Some important themes from Chapters 6-9
Inferential Statistics
CHAPTER 16: Inference in Practice
More on Maxent Env. Variable importance:
Presentation transcript:

Comparability of Biological Assessments Derived from Predictive Models of Increasing Geographic Scope Peter Ode California Department of Fish and Game Charles Hawkins Utah State University Alan Herlihy Oregon State University John Van Sickle US EPA-ORD Daren Carlisle USGS-NAWQA Lester Yuan US EPA-ORD

Now that we have predictive models that cover large geographic areas like WSA and EMAP’s models … …can we use these large models equally well for local assessments? APPROACH: compare the scores derived from larger models with scores from recently created CA models, using set of test sites from CA This potential is very attractive, but larger models have some limitations that may restrict their value for regional assessments… QUESTION: What are the consequences of using models derived at different spatial scales… – for regional assessments? – for site-specific assessments?

Model 1. National Wadeable Streams Assessment (WSA -3 Models) One western model applies to CA (519 sites)

Models 2a, 2b. Environmental Monitoring and Assessment Program, Western Pilot (WEMAP, 629 sites-209 in CA) 5 separate models (represented by colors) gave better performance than a single western model (4 apply to California) Two WEMAP versions: 1.WEMAP (null): no division of reference sites within the 5 models 2. WEMAP (full): Some models with reference sites divided into clusters

Model 3. California Models (206 sites) 3 models performed better than a single statewide model

WSA Spatial Relationships of Model Reference Sites As we expand the spatial scale of the models, we’re expanding the geographic area from which we combine reference sites into the clusters used to predict “E”. The larger models contain a smaller proportion of CA reference sites and thus, are increasingly influenced by the characteristics of sites outside the state. Increasing geographic range WSA WEMAP CA

Scale Example: Wadeable Streams Assessment (WSA) Great variability in geographic range of clusters…. Western Model = 30 clusters (24 in CA)

National Wadeable Streams Assessment (WSA, western model) Most clusters are widespread…

National Wadeable Streams Assessment (WSA –western model) … but a few clusters are restricted to CA

California Models Model 3 Cold and Mesic Model 1 Cool and Wet Model 2 Dry and Warm Clusters occur at much smaller spatial scales than in the larger models… …tends to increase similarity in taxonomic composition and predictor variables

Predictor variables have a lot of overlap, but each sub-model has unique combinations California Predictors Group 1 Watershed Area Longitude Latitude Temperature Group 2 Longitude Precipitation Group 3 Watershed Area Temperature WEMAP Predictors Group 2 Watershed Area Longitude Elevation Precipitation All other CA groups use null models (no predictors) WSA Predictors Watershed Area Longitude Day of Year Min. Temperature Elevation Precipitation Slope

… do these factors affect performance? 1. Models vary in both in the specific predictors and in the geographic range of the predictor gradients…. 2. Predictors that work for larger geographic areas may miss or under-represent regionally important gradients… 3. Variation in predictor association within a taxonomic group (e.g., species within a genus) that occurs across the geographic range of a model can also influence model accuracy and precision... Predictors are a key to understanding model differences

The Test Dataset: 128 CA non-reference sites (none used in model development) 107 WEMAP Probabilistic Sites + 21 USFS Sites ANALYSES: Use test set to compare larger models to CA models Assume CA models represent the “truth” Score sites with all 4 models Compare precision, sensitivity, accuracy and bias

“how sensitive are the models?” (standard deviations) CaliforniaWEMAP (null)WEMAP (full)WSA Group Group Group Western0.198 Group 20.17Group Group Group 30.16Group Group 3*0.200 Group Group 4*0.220 Group Group 5*0.170 California models are more precise than either WEMAP or WSA WEMAP (full) slightly more precise than WEMAP (null) or WSA

r 2 =0.364 r 2 =0.388r 2 =0.338 “do the models give sites the same score?” (correlations with CA Models) Weak correlations, some evidence of bias toward higher scores under WSA model

r 2 =0.595 r 2 =0.625 Correlations between the two larger models Stronger correlations, but prominent bias in WSA model

“do differences in model scores vary with key environmental gradients ” (expect a flat line if score differences don’t vary with gradient) Difference between larger models and CA models tends to vary with increasing slope and % fastwater p=0.019 p=0.002 p=0.03 p=0.01 p=0.034p=0.012

Model Bias vs. Environmental Gradients (large models) … are both large models failing to account for these gradients or is CA getting it wrong? WSA model tends to score sites ~.15 units higher than WEMAP models, but no bias with gradients…

O/E scores for reference sites versus environmental gradients under the 4 models (expect a flat line if models account for gradients) CA models clearly not affected by these gradients p=0.010 p=0.005 p=0.0473p= ns Both WSA and WEMAP (null, but not full) had significant relationships with slope and % fastwater that were not accounted for by the models

Impairment Decisions: Impaired ( I ) or Not Impaired ( NI ) Use an impairment threshold ( mean – 2sd for all models ) to compare number of false negatives and false positives relative to the expectation of the CA models CA Model 1 (n=59) CA Model 2 (n=44) CA Model 3 (n=25) TOTALS (n=128) ALL INII I I CA (“truth”) I I NI NI WSA I I NI NI WEMAP (null) I I NI NI WEMAP (full) I I NI NI WSA generally very forgiving (misses 2/3 of impaired sites, few false +) WEMAP misses 1/3 of impaired sites, but ~ same number of false + as false -

Summary of Results 1.WEMAP models were similar -Full models had slightly better correlations with other models -Slightly less bias in impairment decisions 2.Measures of Precision/Sensitivity: -Larger models appear to have similar precision ( weak correlations with CA models, but OK with each other) -Both models have higher sd values than CA (takes more species loss to detect impairment) -Assessments: WSA model strongly underestimates impairment, WEMAP (null) slightly underestimates impairment

Summary of Results (continued) 3. Measures of Accuracy/ Bias: -WSA tends to overestimate site quality relative to all others (sometimes by a lot) -WSA and WEMAP tend to increasingly underscore sites with increasing slope and increasing fast-water habitats relative to CA models

Does it matter? Site Specific Monitoring… where both accuracy and precision are important, this is pretty strong evidence that we still need local models … WSA and WEMAP models are not appropriate because they get it wrong too often Regional Assessments- Accuracy (lack of bias) is more important than precision… (we can make up for low precision by looking at large numbers of samples) - WEMAP and WSA may be interchangeable (with a correction factor?) - Whether larger models are appropriate for smaller assessments (e.g., state condition assessments) will depend on the strength of the bias with environmental gradients “ it depends on what question you’re asking”

Choice of models should reflect the degree to which the region of interest is represented by the reference sites used to build the models For example, CA models will probably work well for assessments in CA’s north coast (where reference sites were plentiful), but we may need new models for regions where reference sites are sparse (like the Central Valley)

Acknowledgements Funding and Data Sources USFS, Joseph Furnish - CA models EPA-Office of Research and Development, WEMAP and WSA Field Crews: EMAP-West USFS/ Utah State University CA Aquatic Bioassessment Laboratory (ABL, Shawn McBride, Michael Dawson, Jennifer York) Invertebrate Taxonomists: EcoAnalysts Utah State Bug Lab ABL (Doug Post, Dan Pickard, Joe Slusark, Brady Richards)