CWWUC Presentation April 8, 2009 Application of the Integrated Impact Analysis Tool.

Slides:



Advertisements
Similar presentations
Kin 304 Regression Linear Regression Least Sum of Squares
Advertisements

An Introduction to Multivariate Analysis
Statistics Measures of Regression and Prediction Intervals.
Prediction, Correlation, and Lack of Fit in Regression (§11. 4, 11
1 Multivariate Statistics ESM 206, 5/17/05. 2 WHAT IS MULTIVARIATE STATISTICS? A collection of techniques to help us understand patterns in and make predictions.
LINEAR REGRESSION: Evaluating Regression Models Overview Assumptions for Linear Regression Evaluating a Regression Model.
LINEAR REGRESSION: Evaluating Regression Models. Overview Assumptions for Linear Regression Evaluating a Regression Model.
LINEAR REGRESSION: Evaluating Regression Models. Overview Standard Error of the Estimate Goodness of Fit Coefficient of Determination Regression Coefficients.
© 2005 The McGraw-Hill Companies, Inc., All Rights Reserved. Chapter 14 Using Multivariate Design and Analysis.
General linear model and regression analysis. The general linear model: Y = μ + σ 2 (Age) + σ 2 (Sex) + σ 2 (Genotype) + σ 2 (Measurement) + σ 2 (Condition)
Lecture 25 Regression diagnostics for the multiple linear regression model Dealing with influential observations for multiple linear regression Interaction.
Multivariate Data Analysis Chapter 4 – Multiple Regression.
Regression and Correlation
Chapter Eighteen MEASURES OF ASSOCIATION
Multiple Regression MARE 250 Dr. Jason Turner.
Lecture 24: Thurs., April 8th
Probability & Statistics for Engineers & Scientists, by Walpole, Myers, Myers & Ye ~ Chapter 11 Notes Class notes for ISE 201 San Jose State University.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 15-1 Chapter 15 Multiple Regression Model Building Basic Business Statistics 11 th Edition.
Analysis of Individual Variables Descriptive – –Measures of Central Tendency Mean – Average score of distribution (1 st moment) Median – Middle score (50.
Regression Model Building Setting: Possibly a large set of predictor variables (including interactions). Goal: Fit a parsimonious model that explains variation.
Simple Linear Regression Analysis
Copyright ©2011 Pearson Education 15-1 Chapter 15 Multiple Regression Model Building Statistics for Managers using Microsoft Excel 6 th Global Edition.
Metric (Family Level) Standard Best Value (95 th or 5 th percentile) Worst Possible Value Expected Response to Degradation Total Taxa180 EPT Taxa120 %EPT91.90.
Factor Analysis Psy 524 Ainsworth.
1 Doing Statistics for Business Doing Statistics for Business Data, Inference, and Decision Making Marilyn K. Pelosi Theresa M. Sandifer Chapter 11 Regression.
Graph the linear function What is the domain and the range of f?
Chapter 11 Simple Regression
Macroinvertebrate Bioassessment Tools Aquatic Life/Nutrient Workgroup August 11, 2008.
Site Classification for Re-calibration of the Alabama Index of Stream Macroinvertebrate Condition Ben Jessup and Jen Stamp Tetra Tech, Inc. SWPBA November.
Chapter 12 Examining Relationships in Quantitative Research Copyright © 2013 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin.
Statistical Analysis. Statistics u Description –Describes the data –Mean –Median –Mode u Inferential –Allows prediction from the sample to the population.
1 Multivariate Linear Regression Models Shyh-Kang Jeng Department of Electrical Engineering/ Graduate Institute of Communication/ Graduate Institute of.
Examining Relationships in Quantitative Research
Stat 112 Notes 16 Today: –Outliers and influential points in multiple regression (Chapter 6.7)
Descriptive Statistics vs. Factor Analysis Descriptive statistics will inform on the prevalence of a phenomenon, among a given population, captured by.
Regression Lesson 11. The General Linear Model n Relationship b/n predictor & outcome variables form straight line l Correlation, regression, t-tests,
Chapter 16 Data Analysis: Testing for Associations.
Stream macroinvertebrate responses to landscape variables; an evaluation of rapid bioassessment techniques using a statistical modeling approach. Declan.
The Pearson Product-Moment Correlation Coefficient.
Rapid Bioassessment Protocols for low gradient streams) for species richness, composition and pollution tolerance, as well as a composite benthic macroinvertebrate.
Principal Components Analysis. Principal Components Analysis (PCA) A multivariate technique with the central aim of reducing the dimensionality of a multivariate.
 Relationship between education level, income, and length of time out of school  Our new regression equation: is the predicted value of the dependent.
Correlation.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 14-1 Chapter 14 Multiple Regression Model Building Statistics for Managers.
1 Doing Statistics for Business Doing Statistics for Business Data, Inference, and Decision Making Marilyn K. Pelosi Theresa M. Sandifer Chapter 12 Multiple.
Lesson 14 - R Chapter 14 Review. Objectives Summarize the chapter Define the vocabulary used Complete all objectives Successfully answer any of the review.
Principal Component Analysis (PCA)
D/RS 1013 Data Screening/Cleaning/ Preparation for Analyses.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 15-1 Chapter 15 Multiple Regression Model Building Basic Business Statistics 10 th Edition.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 14-1 Chapter 14 Multiple Regression Model Building Statistics for Managers.
Tutorial I: Missing Value Analysis
Development of Nutrient Water Quality Standards for Rivers and Streams in Ohio Ohio EPA ORSANCO, October 20, 2009 George Elmaraghy, P.E., Chief.
Biostatistics Regression and Correlation Methods Class #10 April 4, 2000.
Univariate Point Estimation Confidence Interval Estimation Bivariate: Linear Regression Multivariate: Multiple Regression 1 Chapter 4: Statistical Approaches.
Lecturer: Ing. Martina Hanová, PhD.. Regression analysis Regression analysis is a tool for analyzing relationships between financial variables:  Identify.
Chapter 14 EXPLORATORY FACTOR ANALYSIS. Exploratory Factor Analysis  Statistical technique for dealing with multiple variables  Many variables are reduced.
Introduction Many problems in Engineering, Management, Health Sciences and other Sciences involve exploring the relationships between two or more variables.
Correlation & Simple Linear Regression Chung-Yi Li, PhD Dept. of Public Health, College of Med. NCKU 1.
Chapter 15 Multiple Regression Model Building
Regression Analysis AGEC 784.
Kin 304 Regression Linear Regression Least Sum of Squares
BPK 304W Regression Linear Regression Least Sum of Squares
BPK 304W Correlation.
CHAPTER 3 Describing Relationships
I271b Quantitative Methods
Descriptive Statistics vs. Factor Analysis
Correlation and Regression
Regression Forecasting and Model Building
Chad Larson - WDOE Daniel Marshalonis - EPA Region 10
Regression Analysis.
Presentation transcript:

CWWUC Presentation April 8, 2009 Application of the Integrated Impact Analysis Tool

Status of Division Work Site Classes  Set up Fuzzy Set Site Classes based on elevation, slope, and precipitation.  5 Site Classes 1. High-elevation, cold water sites 2. Mid-elevation, semi-cold, high gradient, drier 3. Low/mid-elevation, transitional temperature, low gradient, drier 4. Low-elevation, warm water, low gradient, dry 5. Mid-elevation, semi-cold, low gradient, moist

Status of Division Work Multi-Metric Index (MMI)  Based on benthic macroinvertebrates  MMI score between 0 and 100  High index (Site Classes 1-3) –Clinger Taxa –EPT Taxa –Predator/Shredder Taxa –HBI –Total Taxa

Status of Division Work  Low index (Site Classes 4-5) –Insect Taxa –Non-Insect Percent of Taxa –Coleoptera Percent –Predator/Shredder Taxa –Sensitive Plains Families –Clinger/Sprawler percent  Still figuring out thresholds

Status of Division Work Observed over Expected Taxa (O/E)  Presented some information on the Multivariate Predictive Model that would be used to develop the Expected portion of this ratio.  April 2009 Workgroup Meeting should address this in more detail.  Basic idea – the closer the ratio is to 0, the more impaired the stream.

Unanswered Questions  What happens once they have the MMI and O/E?  If they find that a segment (or portion of a segment) is below the MMI threshold or the O/E is less than 1, what does that mean to a discharger?  How can a person figure out what is causing the impact to aquatic life?

Integrated Impact Analysis  WERF project (partly funded by EPA) to distinguish the relative impact of chemistry and habitat on aquatic life.  Teamed with GEI (Chadwick) and Risk Sciences (Tim Moore).  Has been used in Santa Ana UAA, Arid West work, and is currently being implemented by the Southeastern Wisconsin Watershed Trust.

Integrated Impact Analysis Chemical Physical Biological + =

IIA – The Gist  Uses existing statistical methods –Principle Components Analysis –All Possible Regressions –Chi-Square Automatic Interaction Detection (CHAID)  Results identify key stressors and their relative impact.  So... if the state identifies a stream segment or site as impacted (through O/E or MMI), IIA can help determine what is causing that impact.

Basic Steps of IIA MODEL Apply Basic Statistics Identify Key Stressor and Response Variables Rank Variables According to Relative Impact Repeat Cycle if More Variables Are Needed Fit Equation to Describe Interactions Between Stressors and Response

Step 1 Apply Basic Statistics  Perform basic descriptive statistics and develop graphics  Normalize data as needed – develop new descriptive statistics and new graphics  Compile a correlation matrix

Water Chemistry Basic Statistics

Habitat Basic Statistics

Looking for correlations (≥0.6). Correlated variables can act as surrogates for each other.

Step 2 Identify Key Stressor and Response Variables  Identify key independent stressor variables and relationships between variables using: –Principle Components Analysis (PCA) –All Possible Regressions –Chi-square Automatic Interaction Detection (CHAID)  Iterative process that systematically removes variables from the larger pool of variables.

Principal Components Analysis  Useful in determining how variables relate to one another – how they move in space.  If variables generally move together, one variable can act as a surrogate for the other(s).

Principal Components Analysis Look at values with an absolute value ≥0.6. A component in PCA is a group of variables that move in the same direction. Generally, the variable with the highest score is identified as the surrogate. Rerun to limit number of stressors to 6 and responses to 2 or 3. strongest

Principal Components Analysis

All Possible Regressions  Combines one response with many stressor variables into models using all combinations of the stressor variables.  Look at all combinations to see what combinations explain the greatest amount of variance with the lowest error.  Look for lowest variable count that explains the most variance.

All Possible Regressions

Chi-Squared Automatic Interaction Detection (CHAID)  Identifies both linear and non-linear relationships between variables.  Non-parametric technique, so data should not be transformed.  Robust to missing data points.

NH3 range Macroinvertebrates P/Channel Range Macroinvertebrates

Step 3 Rank Variables According to Relative Impact  Develop matrix of key independent stressor variables and relationships found in Step 2  Repeat Steps 2 and 3 until the two most influential independent stressor variables are identified for each dependent response variable

Rank Variables According to Relative Impact  Develop a matrix of response variables and their corresponding “important” stressor variables for each of the 3 analyses.  Look for stressors identified by multiple analyses and sort by number of analyses in common.  To help with sorting, refer back to the analyses and how strong the relationships are between the stressor and response variables.

Matrix of Ranked Stressors for Each Response

Step 4 Fit Equation to Describe Interactions Between Stressors and Response  Use three-dimensional modeling program to identify specific non-linear relationship transformations

Fit Equation

Step 5 Repeat Cycle if More Variables Are Needed  Enter the residuals calculated for the response variable into a new column in dataset.  The residuals are the remainder of the of the response variable after the variability caused by the 2 stressor variables is removed.  Repeat steps 2-4 to identify the next 2 important stressor variables.

Residuals From Equation Fitting

IIA Gives You...  An ordered list of stressors that are causing an impact on the response variables.  A model to help predict how the response variables will change based on a change in the stressor.  An understanding of whether habitat is playing a role in limiting the response variable.  A means to make sense of what the O/E and MMI metrics are showing and how that could relate to your discharge.

Questions