Statistics and Nutrient Levels Julie Stahli Metro Wastewater Reclamation District March 2010.

Slides:



Advertisements
Similar presentations
Probability models- the Normal especially.
Advertisements

Assumptions underlying regression analysis
Objectives 10.1 Simple linear regression
BA 275 Quantitative Business Methods
Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and l Chapter 12 l Multiple Regression: Predicting One Factor from Several Others.
AP Stats Review. Assume that the probability that a baseball player will get a hit in any one at-bat is Give an expression for the probability.
Copyright © 2009 Pearson Education, Inc. Chapter 29 Multiple Regression.
Inference for Regression
Correlation and regression Dr. Ghada Abo-Zaid
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
Regression Analysis Notes. What is a simple linear relation? When one variable is associated with another variable in such a way that two numbers completely.
Ch11 Curve Fitting Dr. Deshi Ye
1 Quantifying Opinion about a Logistic Regression using Interactive Graphics Paul Garthwaite The Open University Joint work with Shafeeqah Al-Awadhi.
Class 15: Tuesday, Nov. 2 Multiple Regression (Chapter 11, Moore and McCabe).
1 BA 275 Quantitative Business Methods Residual Analysis Multiple Linear Regression Adjusted R-squared Prediction Dummy Variables Agenda.
Lecture 19: Tues., Nov. 11th R-squared (8.6.1) Review
Lecture 23 Multiple Regression (Sections )
Lecture 16 – Thurs, Oct. 30 Inference for Regression (Sections ): –Hypothesis Tests and Confidence Intervals for Intercept and Slope –Confidence.
Business Statistics - QBM117 Interval estimation for the slope and y-intercept Hypothesis tests for regression.
Introduction to Probability and Statistics Linear Regression and Correlation.
Oscar Go, Areti Manola, Jyh-Ming Shoung and Stan Altan
Introduction to Regression Analysis, Chapter 13,
Simple Linear Regression Analysis
1 Chapter 10 Correlation and Regression We deal with two variables, x and y. Main goal: Investigate how x and y are related, or correlated; how much they.
Forecasting Revenue: An Example of Regression Model Building Setting: Possibly a large set of predictor variables used to predict future quarterly revenues.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS & Updated by SPIROS VELIANITIS.
Quantitative Business Analysis for Decision Making Multiple Linear RegressionAnalysis.
Copyright © 2011 Pearson Education, Inc. Multiple Regression Chapter 23.
Inference for regression - Simple linear regression
STA291 Statistical Methods Lecture 27. Inference for Regression.
Copyright © Cengage Learning. All rights reserved. 13 Linear Correlation and Regression Analysis.
Regression Analysis (2)
Linear Regression: Test scores vs. HW scores. Positive/Negative Correlation.
Inference for Regression
Forecasting Revenue: An Example of Regression Model Building Setting: Possibly a large set of predictor variables used to predict future quarterly revenues.
AP Stats Review.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
OPIM 303-Lecture #8 Jose M. Cruz Assistant Professor.
Regression Analysis. Scatter plots Regression analysis requires interval and ratio-level data. To see if your data fits the models of regression, it is.
Statistical Analysis. Statistics u Description –Describes the data –Mean –Median –Mode u Inferential –Allows prediction from the sample to the population.
Statistics: Unlocking the Power of Data Lock 5 STAT 101 Dr. Kari Lock Morgan Multiple Regression SECTIONS 9.2, 10.1, 10.2 Multiple explanatory variables.
Correlation & Regression
Lecture 8 Simple Linear Regression (cont.). Section Objectives: Statistical model for linear regression Data for simple linear regression Estimation.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
Chapter 14 Inference for Regression © 2011 Pearson Education, Inc. 1 Business Statistics: A First Course.
Chapter 4 Linear Regression 1. Introduction Managerial decisions are often based on the relationship between two or more variables. For example, after.
Lesson Multiple Regression Models. Objectives Obtain the correlation matrix Use technology to find a multiple regression equation Interpret the.
Univariate Linear Regression Problem Model: Y=  0 +  1 X+  Test: H 0 : β 1 =0. Alternative: H 1 : β 1 >0. The distribution of Y is normal under both.
Lack of Fit (LOF) Test A formal F test for checking whether a specific type of regression function adequately fits the data.
Text Exercise 12.2 (a) (b) (c) Construct the completed ANOVA table below. Answer this part by indicating what the f test statistic value is, what the appropriate.
Business Statistics for Managerial Decision Farideh Dehkordi-Vakil.
Stat 112 Notes 5 Today: –Chapter 3.7 (Cautions in interpreting regression results) –Normal Quantile Plots –Chapter 3.6 (Fitting a linear time trend to.
Chapter 14: Inference for Regression. A brief review of chapter 4... (Regression Analysis: Exploring Association BetweenVariables )  Bi-variate data.
Correlation & Regression Analysis
I271B QUANTITATIVE METHODS Regression and Diagnostics.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc.. Chap 14-1 Chapter 14 Introduction to Multiple Regression Basic Business Statistics 10 th Edition.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Simple Linear Regression Analysis Chapter 13.
Introduction to Multiple Regression Lecture 11. The Multiple Regression Model Idea: Examine the linear relationship between 1 dependent (Y) & 2 or more.
Statistics for Managers Using Microsoft Excel, 5e © 2008 Prentice-Hall, Inc.Chap 14-1 Statistics for Managers Using Microsoft® Excel 5th Edition Chapter.
Exam 4 Review. Correlation In a study examining the relation of math ability to belief that math ability was innate, belief was considered the predictor.
Inference for Regression
Biostatistics Regression and Correlation Methods Class #10 April 4, 2000.
Beginners statistics Assoc Prof Terry Haines. 5 simple steps 1.Understand the type of measurement you are dealing with 2.Understand the type of question.
What is normal anyway?! Disclaimer: I am not an expert!
Week 14 Chapter 16 – Partial Correlation and Multiple Regression and Correlation.
CHAPTER 26: Inference for Regression
Multiple Regression Models
1.8 Critical Analysis: Outliers
Product moment correlation
Regression Forecasting and Model Building
Presentation transcript:

Statistics and Nutrient Levels Julie Stahli Metro Wastewater Reclamation District March 2010

MMI Version 3 lacks a lot of the problems that previous versions contained. Statistically, the model distinguishes reference sites from stressed sites. The level chosen as the level of impaired is statistically generous.

MMI

MMI Issues (based on Metro Data) Comments should focus on implementation Sites with low scores should be put on the M&E list and multiple samples should be taken. Implementation should be based on median of all MMI scores over the course of five years. We have seen substantial variation at a single site. Big rivers need alternative indexes when sites are found in the “yellow zone”.

Sample Data TypeYear31 st 64 th Up CC88 th 120 th – H124 th 160thFt LuptonCty Rd 18 Kick Kick Kick Kick Kick Kick Kick Kick Med Blue indicates attainment, red indicates impaired

Big Rivers

Lets assume that the MMI is valid and can be related to nutrient concentrations to form TP and TN limits….

Regression analysis Lots of types of regression analysis linear Multi-linear regression logistic regression etc

Regression analysis Each type has restrictions on how it is used based on the assumptions we can make about the data. Evaluate the assumptions to determine whether or not the model is valid or appropriate. Example assumptions The relationship is linear in nature. The errors are independent and normally distributed. The model is the best available for describing the relationship.

Regression analysis After the model is developed, we evaluate the model to see how useful it is Tools for evaluation: R 2 value Goodness of fit for data Confidence intervals Residual mean-squared error Ideally, a model on which decisions are based are both valid and useful.

Quantile Regression Breaks up the data into quantiles and forms an individual regression for each one. Used with the wedge plot.

Quantile Regression Typically, no assumptions are necessary. Lots of decisions are made during the modeling process. Typically, goodness of fit is evaluated in two ways: examining whether or not the slope is different from zero The P value for the model (is it significant) Results are not predictive. Can’t really determine whether or not the model is valid or useful.

Quantile Regression

Used to explore patterns in data that cannot be seen using more traditional methods. Designed to be used in exactly the same way that CDPHE is using it – to illuminate an effect that may be muddied by lots of other variables. There is no way to evaluate whether or not the model is right or wrong, good or bad and should be considered as part of a weight of evidence approach.

Quantile Regression What else do we use? We don’t believe there is a better tool to solve the nutrient problem. Especially for Phosphorus.

What??!? The uncertainty in the model needs to be acknowledged. Using this method should not set precedent for future usage where other options may be available. In repeating their methods, we found that choices were made that are not scientifically valid.

CDPHE Review

Quantile Regression Slope

Problems Using log scale. Using only the warm data to define the slope. Using the 90 th percentile without biological reason (85% is more commonly used). Nitrogen is difficult because of lack of data. Unless otherwise noted, all limits would apply to Warm streams and rivers.

Log Scale Issues Normal regression Using a log-scale QuantileSlope Does Not Cross Zero Good P-value Final Value mg/L YesNoN/A Yes Yes Yes Yes Yes Yes QuantileSlope Does Not Cross Zero Good P-value Final Value mg/L YesNoN/A Yes Yes Yes Yes Yes NoYesN/A

Using only warm data Using all data Using warm data ***Both using log scales to make comparable QuantileSlope Does Not Cross Zero Good P-value Final Value mg/L YesNoN/A Yes Yes Yes Yes Yes Yes QuantileSlope Does Not Cross Zero Good P- value Final Value mg/L No N/A No N/A YesNoN/A Yes ** did not runN/A Yes Yes 0.155

Cold data QuantileSlope Does Not Cross Zero Good P- value Final Value mg/L No N/A No N/A No N/A No N/A Yes No N/A No N/A Warm – Yes All – Yes

Recommendation QuantileSlope Does Not Cross Zero Good P- value Final Value mg/L YesNoN/A Yes Yes Yes Yes Yes NoYesN/A

Nitrogen

Valid models with the normal data if cold and warm are looked at together as recommended for Phosphorus. Strongest scientific model (at 0.9 quantile) gives TN value of 2.06 mg/L. Currently there is a paucity of data and I think we could argue for more information after the implementation of ammonia and nitrite criteria.

Caveats I am not an expert in quantile regression and may have missed some qualifying assumptions. All of the relationships we are discussing are tenuous at best, but may qualify as best current scientific opinion.

Thank you…. Questions???