Regression Analysis of Phosphorous Loading Data for the Maumee River, Water Years 2003-2005 Charlie Piette David Dolan Pete Richards Department of Natural.

Slides:



Advertisements
Similar presentations
Multiple Regression and Model Building
Advertisements

Kin 304 Regression Linear Regression Least Sum of Squares
1 1 Chapter 5: Multiple Regression 5.1 Fitting a Multiple Regression Model 5.2 Fitting a Multiple Regression Model with Interactions 5.3 Generating and.
Chapter 12 Simple Linear Regression
BA 275 Quantitative Business Methods
Simple Linear Regression. Start by exploring the data Construct a scatterplot  Does a linear relationship between variables exist?  Is the relationship.
Regression Analysis Once a linear relationship is defined, the independent variable can be used to forecast the dependent variable. Y ^ = bo + bX bo is.
Prediction, Correlation, and Lack of Fit in Regression (§11. 4, 11
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 13 Nonlinear and Multiple Regression.
Objectives (BPS chapter 24)
Chapter 13 Multiple Regression
LINEAR REGRESSION: Evaluating Regression Models Overview Assumptions for Linear Regression Evaluating a Regression Model.
LINEAR REGRESSION: Evaluating Regression Models. Overview Assumptions for Linear Regression Evaluating a Regression Model.
Chapter 12 Multiple Regression
1 Chapter 3 Multiple Linear Regression Ray-Bing Chen Institute of Statistics National University of Kaohsiung.
© 2000 Prentice-Hall, Inc. Chap Multiple Regression Models.
Multiple Regression Models. The Multiple Regression Model The relationship between one dependent & two or more independent variables is a linear function.
BCOR 1020 Business Statistics Lecture 28 – May 1, 2008.
Multivariate Data Analysis Chapter 4 – Multiple Regression.
Lecture 24: Thurs., April 8th
Simple Linear Regression Analysis
Ch. 14: The Multiple Regression Model building
This Week Continue with linear regression Begin multiple regression –Le 8.2 –C & S 9:A-E Handout: Class examples and assignment 3.
Statistics 350 Lecture 17. Today Last Day: Introduction to Multiple Linear Regression Model Today: More Chapter 6.
Simple Linear Regression Analysis
Correlation & Regression
Active Learning Lecture Slides
Regression Analysis Regression analysis is a statistical technique that is very useful for exploring the relationships between two or more variables (one.
Inference for regression - Simple linear regression
Chapter 13: Inference in Regression
Simple Linear Regression Models
OPIM 303-Lecture #8 Jose M. Cruz Assistant Professor.
1 Dr. Jerrell T. Stracener EMIS 7370 STAT 5340 Probability and Statistics for Scientists and Engineers Department of Engineering Management, Information.
Water Quality Monitoring and Constituent Load Estimation in the Kings River near Berryville, Arkansas 2009 Brian E. Haggard Arkansas Water Resources Center.
1 Chapter 12 Simple Linear Regression. 2 Chapter Outline  Simple Linear Regression Model  Least Squares Method  Coefficient of Determination  Model.
Multiple Regression and Model Building Chapter 15 Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
Applications of Regression to Water Quality Analysis Unite 5: Module 18, Lecture 1.
Chapter 14 Inference for Regression AP Statistics 14.1 – Inference about the Model 14.2 – Predictions and Conditions.
Lecture 8 Simple Linear Regression (cont.). Section Objectives: Statistical model for linear regression Data for simple linear regression Estimation.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
Lesson Multiple Regression Models. Objectives Obtain the correlation matrix Use technology to find a multiple regression equation Interpret the.
Exam 1 Review. Data referenced throughout review The Federal Trade Commission annually rates varieties of domestic cigarettes according to their tar,
Regression Analysis Relationship with one independent variable.
Regression Analysis Part C Confidence Intervals and Hypothesis Testing
1 Regression Analysis The contents in this chapter are from Chapters of the textbook. The cntry15.sav data will be used. The data collected 15 countries’
Lecture 10 Chapter 23. Inference for regression. Objectives (PSLS Chapter 23) Inference for regression (NHST Regression Inference Award)[B level award]
Multiple Regression. Simple Regression in detail Y i = β o + β 1 x i + ε i Where Y => Dependent variable X => Independent variable β o => Model parameter.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Model Building and Model Diagnostics Chapter 15.
Business Statistics for Managerial Decision Farideh Dehkordi-Vakil.
Lesson 14 - R Chapter 14 Review. Objectives Summarize the chapter Define the vocabulary used Complete all objectives Successfully answer any of the review.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Simple Linear Regression Analysis Chapter 13.
1 1 Slide The Simple Linear Regression Model n Simple Linear Regression Model y =  0 +  1 x +  n Simple Linear Regression Equation E( y ) =  0 + 
1 Probability and Statistics Confidence Intervals.
Brian Haggard Director Arkansas Water Resources Center Funding provided by ANRC through Beaver Water District.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Multiple Regression Chapter 14.
Video Conference 1 AS 2013/2012 Chapters 10 – Correlation and Regression 15 December am – 11 am Puan Hasmawati Binti Hassan
Bivariate Regression. Bivariate Regression analyzes the relationship between two variables. Bivariate Regression analyzes the relationship between two.
Stats Methods at IC Lecture 3: Regression.
Regression Analysis AGEC 784.
Basic Estimation Techniques
Week 14 Chapter 16 – Partial Correlation and Multiple Regression and Correlation.
Relationship with one independent variable
BIVARIATE REGRESSION AND CORRELATION
Basic Estimation Techniques
Statistical Methods For Engineers
Chapter 12 Inference on the Least-squares Regression Line; ANOVA
Simple Linear Regression
Relationship with one independent variable
You and your lab partner independently determine the concentration of Ca2+ in a water sample. The results are: You Lab partner 350 ppm
Chapter 14 Inference for Regression
Presentation transcript:

Regression Analysis of Phosphorous Loading Data for the Maumee River, Water Years Charlie Piette David Dolan Pete Richards Department of Natural and Applied Sciences University of Wisconsin Green Bay National Center for Water Quality Research, Heidelberg College

Phosphorus and the Great Lakes Water Quality Agreement Goal for reduction Initial targets Secondary targets

Maumee River Watershed 5

Maumee River Facts Size Contribution

Data Source USGS NCWQR Used data from WY

Purpose of Our Research ECOFORE 2006: Hypoxia Assessment in Lake Erie Estimate TP loads to Lake Erie using data from Heidelberg College and effluent data from permitted point sources Constructing a daily time series of phosphorus loading (Maumee River)

Problems in Constructing a Time Series for the Maumee Missing data All three years missing some data No major precipitation events were missed in water years 2003 and ……..

Water Year 2005 Data Overview Missing an important time period December 2004-January 2005, moving the lab Very significant period of precipitation 32.8 inches of snow in January ’05 Third wettest January on record Warm temps- 52˚F on New Year’s Day

Importance of WY 2005 Fifth largest peak flow in 73 year data record- 94,100 cfs Orders of magnitude larger than average flows for the same time period in WY ’03 and ’04 3,437cfs and 10,039 cfs respectively Need to model the missing data to complete the time series

Objectives Use statistical analysis to develop a model for predicting missing T.P. for the Maumee in WY 2005 Calculate an annual load for WY 2005 using measured and predicted data Compare estimated regression load to estimated load from another method Assess effectiveness of final regression model on other Lake Erie Tributaries

Reconstructing the Missing Concentration Data Multiple regression w/ SAS Producing an equation that can be used to model for the missing phosphorus concentrations

Basic Regression Equation Y=ß о + ß 1 X 1 + ß 2 X 2 + ……… ß p X p + E The terms…..

Basic Assumption of Regression Linear relationship between dependent and independent variables

Basic Assumptions: Continued Normal distribution of residuals

So, the data is suitable for regression analysis. What makes for a strong model? Hypothesis for model significance Hypothesis for parameter estimate significance P-values- <.05 R 2 value M.S.E.

Beale’s Equation

Beale’s Ratio Estimator Daily load for sampled days Mean daily load Flow-adjusted mean daily load Bias-corrected X 365 = annual load estimate DateFlowP_Concentration 10/1/ /2/ /3/ /4/ /5/ /6/ /7/ /8/ /9/ /10/ /11/

Beale Stratified Ratio Estimator Stratification- flow or time More accurate estimation “It’s an art!”

Beale Vs. Regression Both a means to the same end- annual load estimate Both relying on one main assumption- a linear relationship Big difference- Beale is not good for reconstructing a time series

Regression Analysis

Data Analysis Step 1 Transforming the data to log space

Regression Model 1 Log P-Conc = b 0 + b 1 (Log Flow) + error Most simple model Historical use

Regression Model 2 Log P-Conc = b 0 + b 1 (Log Flow) + b 2 (Season) + error Addition of second independent variable “Season” Dual Slope Analysis

Purpose of adding “Season”

Regression Model 3 Log P-Conc = b 0 + b 1 (Log Flow) + b 2 (Season) + b 3 (Season Effect) + error Addition of “Season Effect” Interaction variable

Purpose of adding “Season Effect” Interaction b/w two independent variables Slope adjustment Change in log TP concentration per unit flow during the winter season

Results of Regression Models for the Maumee, WY 2005

Selecting the Best Model for WY 2005 Model 1 Results InterceptLog Flow Overall ModelMean Square Estimate R² SignificanceError P-Value<.0001

Selecting the Best Model for WY 2005 Model 2 Results InterceptLog FlowSeason Overall ModelMean Square Estimate R²SignificanceError P-Value< <.0001

Selecting the Best Model for WY 2005 Model 3 Results InterceptLog FlowSeasonSeas. Effect Estimate R²Mod. SigMSE P-Vals.< <.0001

Results of Regression Model 3 for the Maumee, WY

Model 3: Viable Option? Looked like a good choice for WY 2005 Ran with WY data

WaterInterceptLog FlowSeasonSeason Effect Mod. YearEstimate R²Sig P-values< < P-values< P-values< <.0001

Estimating an Annual TP Load Using Regression Results

Estimating an Annual Load With Regression Used Model 3 Need to bring the log TP concentrations out of log-space (back-transforming) Back-transforming bias and estimated concentrations

Bias Correction To make up for the low bias…. Total Phosphorus Concentration (ppm) = Exp[LogPredicted P Concentration + (Mean Square Error *.5)] Estimating annual TP load from both measured and estimated data Couple conversion factors……Annual Estimated Load in metric tons/year

What did We Find???

Major Purpose of Our Research The main objective- developing a daily time series for accurately estimating an annual load for the Maumee in 2005

How did the Regression Estimates Compare to the Beale Estimate? 95% Confidence Intervals WaterRegression EstimateBeale Estimate95% Confidence Year(Metric Ton/Year) Interval

The Discrepancy

Problem with Regression Under-prediction Low-flow bias

Future Directions Improving the regression model Other independent variables More years

Thank You Any Questions?