MIXING MODELS AND END-MEMBER MIXING ANALYSIS: PRINCIPLES AND EXAMPLES Matt Miller and Nick Sisolak Slides Contributed by: Mark Williams and Fengjing Liu.

Slides:



Advertisements
Similar presentations
Hydrology Rainfall - Runoff Modeling (I)
Advertisements

Component Analysis (Review)
Principal Component Analysis Based on L1-Norm Maximization Nojun Kwak IEEE Transactions on Pattern Analysis and Machine Intelligence, 2008.
CmpE 104 SOFTWARE STATISTICAL TOOLS & METHODS MEASURING & ESTIMATING SOFTWARE SIZE AND RESOURCE & SCHEDULE ESTIMATING.
FTP Biostatistics II Model parameter estimations: Confronting models with measurements.
Dimension reduction (1)
A Short Introduction to Curve Fitting and Regression by Brad Morantz
Visual Recognition Tutorial
Lecture 7: Principal component analysis (PCA)
LINEAR REGRESSION: Evaluating Regression Models Overview Assumptions for Linear Regression Evaluating a Regression Model.
Eigenvalues and eigenvectors
LINEAR REGRESSION: Evaluating Regression Models. Overview Assumptions for Linear Regression Evaluating a Regression Model.
Principal Component Analysis
Chapter 10 Simple Regression.
Class 5: Thurs., Sep. 23 Example of using regression to make predictions and understand the likely errors in the predictions: salaries of teachers and.
Hydrologic Mixing Models Ken Hill Andrew McFadden.
The Terms that You Have to Know! Basis, Linear independent, Orthogonal Column space, Row space, Rank Linear combination Linear transformation Inner product.
Chapter 11 Multiple Regression.
Mark Williams, CU-Boulder Using isotopes to identify source waters: mixing models.
Tables, Figures, and Equations
Correlation and Regression Analysis
Linear Regression/Correlation
Multivariate Data and Matrix Algebra Review BMTRY 726 Spring 2012.
Separate multivariate observations
Correlation & Regression
Descriptive Methods in Regression and Correlation
Skills for October Rounds
CHAPTER 26 Discriminant Analysis From: McCune, B. & J. B. Grace Analysis of Ecological Communities. MjM Software Design, Gleneden Beach, Oregon.
1 Chapter 3: Examining Relationships 3.1Scatterplots 3.2Correlation 3.3Least-Squares Regression.
Ch4 Describing Relationships Between Variables. Pressure.
Copyright © 2010 Pearson Education, Inc Chapter Seventeen Correlation and Regression.
4.2 An Introduction to Matrices Algebra 2. Learning Targets I can create a matrix and name it using its dimensions I can perform scalar multiplication.
Notes Bivariate Data Chapters Bivariate Data Explores relationships between two quantitative variables.
Ch4 Describing Relationships Between Variables. Section 4.1: Fitting a Line by Least Squares Often we want to fit a straight line to data. For example.
Various topics Petter Mostad Overview Epidemiology Study types / data types Econometrics Time series data More about sampling –Estimation.
Notes Bivariate Data Chapters Bivariate Data Explores relationships between two quantitative variables.
Multivariate Statistics Matrix Algebra I W. M. van der Veld University of Amsterdam.
Examining Relationships in Quantitative Research
Surface Water Hydrology: Infiltration – Green and Ampt Method
GG 313 Geological Data Analysis Lecture 13 Solution of Simultaneous Equations October 4, 2005.
Descriptive Statistics vs. Factor Analysis Descriptive statistics will inform on the prevalence of a phenomenon, among a given population, captured by.
MIXING MODELS AND END-MEMBER MIXING ANALYSIS: PRINCIPLES AND EXAMPLES Mark Williams and Fengjing Liu Department of Geography and Institute of Arctic and.
U.S. Department of the Interior U.S. Geological Survey End-Member Mixing Analysis Applied to the Karstic Madison Aquifer Using Water Chemistry in the Southern.
MIXING MODELS AND END-MEMBER MIXING ANALYSIS: PRINCIPLES AND EXAMPLES
Source waters and flow paths in an alpine catchment, Colorado, Front Range, United States Fengjing Liu, Mark W. Williams, and Nel Caine 2004.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
1 Sample Geometry and Random Sampling Shyh-Kang Jeng Department of Electrical Engineering/ Graduate Institute of Communication/ Graduate Institute of Networking.
Principal Component Analysis (PCA). Data Reduction summarization of data with many (p) variables by a smaller set of (k) derived (synthetic, composite)
Principal Components Analysis. Principal Components Analysis (PCA) A multivariate technique with the central aim of reducing the dimensionality of a multivariate.
END-MEMBER MIXING ANALYSIS: PRINCIPLES AND EXAMPLES Mark Williams and Fengjing Liu Department of Geography and Institute of Arctic and Alpine Research,
Correlation & Regression Analysis
Source waters, flowpaths, and solute flux in mountain catchments
ANOVA, Regression and Multiple Regression March
1 DETERMINATION OF SOURCES AND FLOWPATHS USING ISOTOPIC AND CHEMICAL TRACERS, GREEN LAKES VALLEY, ROCKY MOUNTAINS Fengjing Liu and Mark Williams Department.
Principal Component Analysis
Université d’Ottawa / University of Ottawa 2001 Bio 8100s Applied Multivariate Biostatistics L11.1 Lecture 11: Canonical correlation analysis (CANCOR)
Biostatistics Regression and Correlation Methods Class #10 April 4, 2000.
Regression Chapter 5 January 24 – Part II.
MathematicalMarketing Slide 5.1 OLS Chapter 5: Ordinary Least Square Regression We will be discussing  The Linear Regression Model  Estimation of the.
END-MEMBER MIXING ANALYSIS: PRINCIPLES AND EXAMPLES Fengjing Liu University of California, Merced.
Week 2 Normal Distributions, Scatter Plots, Regression and Random.
The simple linear regression model and parameter estimation
Chemical Hydrograph Separation
Stats Club Marnie Brennan
CHAPTER 29: Multiple Regression*
Linear Regression/Correlation
Using isotopes to identify source waters: mixing models
Principal Component Analysis
Product moment correlation
Principal Component Analysis
Presentation transcript:

MIXING MODELS AND END-MEMBER MIXING ANALYSIS: PRINCIPLES AND EXAMPLES Matt Miller and Nick Sisolak Slides Contributed by: Mark Williams and Fengjing Liu

OUTLINE  OVERVIEW OF MIXING MODEL  OVERVIEW OF END-MEMBER MIXING ANALYSIS (EMMA) -- PRINCIPAL COMPONENT ANALYSIS (PCA) -- STEPS TO PERFORM EMMA  APPLICATIONS OF MIXING MODEL AND EMMA -- Panola Mountain Research Watershed (Burns et al., 2001) -- Green Lakes Valley (Liu et al., 2004)

PART 1: OVERVIEW OF MIXING MODEL  Definition of Hydrologic Flowpaths  2-Component Mixing Model  3-Component Mixing Model  Generalization of Mixing Model  Geometrical Definition of Mixing Model  Assumptions of Mixing Model

HYDROLOGIC FLOWPATHS

MIXING MODEL: 2 COMPONENTS One Conservative Tracer Mass Balance Equations for Water and Tracer

MIXING MODEL: 3 COMPONENTS (Using Specific Discharge) Two Conservative Tracers Mass Balance Equations for Water and Tracers Simultaneous Equations Solutions Q - Discharge C - Tracer Concentration Subscripts - # Components Superscripts - # Tracers

MIXING MODEL: 3 COMPONENTS (Using Discharge Fractions) Two Conservative Tracers Mass Balance Equations for Water and Tracers Simultaneous Equations Solutions f - Discharge Fraction C - Tracer Concentration Subscripts - # Components Superscripts - # Tracers

MIXING MODEL: Generalization Using Matrices One tracer for 2 components and two tracers for 3 components N tracers for N+1 components? -- Yes However, solutions would be too difficult for more than 3 components So, matrix operation is necessary Simultaneous Equations Where Solutions Note: C x -1 is the inverse matrix of C x This procedure can be generalized to N tracers for N+1 components

MIXING MODEL: Geometrical Perspective For a 2-tracer 3-component model, for instance, the mixing subspaces are defined by two tracers. If plotted, the 3 components should be vertices of a triangle and all streamflow samples should be bound by the triangle. If not well bound, either tracers are not conservative or components are not well characterized. f x can be sought geometrically, but more difficult than algebraically.

ASSUMPTIONS FOR MIXING MODEL  Tracers are conservative (no chemical reactions);  All components have significantly different concentrations for at least one tracer;  Tracer concentrations in all components are temporally constant or their variations are known;  Tracer concentrations in all components are spatially constant or treated as different components;  Unmeasured components have same tracer concentrations or don’t contribute significantly.

A QUESTION TO THINK ABOUT  What if we have the number of conservative tracers much more than the number of components we seek for, say, 6 tracers for 3 components?  For this case, it is called over-determined situation  The solution to this case is EMMA, which follows the same principle as mixing models.

PART 2: EMMA AND PCA  EMMA Notation  Over-Determined Situation  Orthogonal Projection  Notation of Mixing Spaces  Steps to Perform EMMA

DEFINITION OF END-MEMBER  For EMMA, we use end-members instead of components to describe water contributing to stream from various compartments and geographic areas  End-members are components that have more extreme solute concentrations than streamflow [Christophersen and Hooper, 1992]

EMMA NOTATION (1)  Hydrograph separations using multiple tracers simultaneously;  Use more tracers than necessary to test consistency of tracers;  Typically use solutes as tracers Modified from Hooper, 2001

EMMA NOTATION (2)  Measure p solutes; p= # of solutes  Assume that there are k linearly independent end- members (k < p) k = # of end-members  B, matrix of end-members, (k  p); each row b j (1  p)  X, matrix of streamflow samples, (n  p); each row x i (1  p)

PROBLEM STATEMENT  Find a vector f i of mixing proportions such that Stream chemistry can be defined as a function of end-member contribution  Note that this equation is the same as generalized one for mixing model; the re- symbolizing is for simplification and consistency with EMMA references

SOLUTION FOR OVER- DETERMINED EQUATIONS  Must choose objective function: minimize sum of squared error  Solution is normal equation [Christophersen et al., 1990; Hooper et al., 1990]:  Constraint: all proportions must sum to 1  Solutions may be > 1 or < 0; this issue will be elaborated later

ORTHOGONAL PROJECTIONS  Following the normal equation, the predicted streamflow chemistry is [Christophersen and Hooper, 1992]:  Geometrically, this is the orthogonal projection of x i into the subspace defined by B, the end- members

This slide is from Hooper, 2001

OUR GOALS ACHIEVED SO FAR? We measure chemistry of streamflow and end-members. Then, we can derive fractions of end-members contributing to streamflow using equations above. So, our goals achieved? Not quite, because we also want to test end-members as well as mixing model. We need to define the geometry of the solute “cloud” (S-space) and project end-members into U-space! How? Use PCA to determine number and orientation of axes in U-space. Modified from Hooper, 2001

EMMA PROCEDURES Identification of Conservative Tracers - Bivariate solute-solute plots to screen data; PCA Performance - Derive eigenvalues and eigenvectors; Orthogonal Projection - Use eigenvectors to project chemistry of streamflow and end-members; Screen End-Members - Calculate Euclidean distance of end- members between their original values and S-space projections; Hydrograph Separation - Use orthogonal projections and generalized equations for mixing model to get solutions! Validation of Mixing Model - Predict streamflow chemistry using results of hydrograph separation and original end-member concentrations.

STEP 1 - MIXING DIAGRAMS Look familiar? This is the same diagram used for geometrical definition of mixing model (components changed to end-members); Generate all plots for all pair-wise combinations of tracers; The simple rule to identify conservative tracers is to see if streamflow samples can be bound by a polygon formed by potential end-members or scatter around a line defined by two end-members; Be aware of outliers and curvature which may indicate chemical reactions!

STEP 2 - PCA PERFORMANCE For most cases, if not all, we should use correlation matrix rather than covariance matrix of conservative solutes in streamflow to derive eigenvalues and eigenvectors; Why? This treats each variable equally important and unitless; How? Standardize the original data set using a routine software or minus mean and then divided by standard deviation; To make sure if you are doing right, the mean should be zero and variance should be 1 after standardized!

APPLICATION OF EIGENVALUES Eigenvalues can be used to infer the number of end-members that should be used in EMMA. How? Sum up all eigenvalues; Calculate percentage of each eigenvalue in the total eigenvalue; The percentage should decrease from PCA component 1 to p (remember p is the number of solutes used in PCA); How many eigenvalues can be added up to 90% (somewhat subjective! No objective criteria for this!)? Let this number be m, m = # of PCA components retained (explains 90% of variance) (m +1) = # of end-members we use in EMMA.

STEP 3 - ORTHOGONAL PROJECTION X - Standardized data set of streamflow, (n  p); V - Eigenvectors from PCA, (m  p); Remember only the first m eigenvectors to be used here! Use the same equation above; Now X represents a vector (1  p) for each end-member; Remember X here should be standardized by subtracting streamflow mean and dividing by streamflow standard deviation! Project End-Members

STEP 4 - SCREEN END-MEMEBRS Plot a scatter plot for streamflow samples and end-members using the first and second PCA projections; Eligible end-members should be vertices of a polygon (a line if m = 1, a triangle if m = 2, and a quadrilateral if m = 3) and should bind streamflow samples in a convex sense; Calculate the Euclidean distance between original chemistry and projections for each solute using the equations below: Algebraically Geometrically j represent each solute and b j is the original solute value Those steps should lead to identification of eligible end-members!

STEP 4 - SCREEN END-MEMEBRS

STEP 5 - HYDROGRAPH SEPARATION Use the retained PCA projections from streamflow and end- members to derive flowpath solutions! So, mathematically, this is the same as a general mixing model rather than the over-determined situation. Solve for Q’s

STEP 6 - PREDICTION OF STREAMFLOW CHEMISTRY Multiply results of hydrograph separation (usually fractions) by original solute concentrations of end-members to reproduce streamflow chemistry for conservative solutes; Comparison of the prediction with the observation can lead to a test of mixing model.

PROBLEM ON OUTLIERS PCA is very sensitive to outliers; If any outliers are found in the mixing diagrams of PCA projections, check if there are physical reasons; Outliers have negative or > 1 fractions; See next slide how to resolve outliers using a geometrical approach for an end-member model.

RESOLVING OUTLIERS A, B, and C are 3 end- members; D is an outlier of streamflow sample; E is the projected point of D to line AB; a, b, d, x, and y represent distance of two points; We will use Pythagorean theorem to resolve it. The basic rule is to force f c = 0, f A and f B are calculated below [Liu et al., 2003]:

SUMMARY:EMMA  IDENTIFY MULTIPLE SOURCE WATERS AND FLOWPATHS  QUANTITATIVELY SELECTS NUMBER AND TYPE OF END-MEMBERS  QUANTITATIVELY EVALUATE RESULTS  IDENTIFY MISSING END-MEMBERS

Burns et al. (2001)

Objectives  Use EMMA to derive three-component model during 2 rain storms to answer: 1) What is the relative importance of each end-member to stream runoff? 2)How do runoff processes vary with storm size, rain intensity, and antecedent wetness conditions? 3)Are EMMA modeling results consistent with physical hydrologic measurements?

Site Description  Study Site: PMRW  10ha  3 assumed End-members: 1)Outcrop 2) Hillslope 3)Riparian Area

Field Methods  Chemical Analysis 1) Stream water 2) Runoff from outcrop 3) Subsurface stormflow from hillslope trench 4) Riparian ground water  Physical Meaurements 1) Stream runoff rate 2) Rainfall amount/intensity 3) Runoff rate from hillslope trench 4) Riparian water table levels

EMMA Modeling 1) Five solutes used as tracers 2) Data Standardized into correlation matrix 3) PCA 4) Concentrations of end-members projected into U-space 5) Examine extent to which end-members bound stream water observations in U-space. 6) Solute concentrations predicted by EMMA compared with measured concentrations during 2 storms

Storm Characteristics

Mixing Diagrams

EMMA Results  First 2 principal components explained 93% of variability (m = 2) 3 end-members

End-member contributions

End Member Contributions cont. Storm 1Storm 2

Test of Mixing Model  Using fraction of Flow from EMMA (fo, fh, and fr) with measured end-member concentrations calculate predicted stream flow concentrations.  Linear Regression of Predicted vs. Measured Concentrations… r^2 =

Predicted Outcrop Runoff vs Measured Rainfall Intensity *Good Fit *Initial lag due to travel time of rain water *Outcrop runoff continues due to draining of soil

Predicted Hillslope Runoff and Measured Trench Outflow *Hillslope trench located halfway up hillslope *Lower soil moisture before March Storm *Subsurface storm flow likely provided recharge to riparian ground water and dry hillslope soils.

Predicted Riparian Groundwater Runoff and Observed Riparian Water Table Levels *Outcrop runoff and hillslope subsurface flow mix in riparian aquifer during transport to the stream.