Probability Distributions Learn the common parametric probability distributions that are used to represent the randomness of physical phenomena Learn the physical mechanisms that underlie common probability distributions Apply physical knowledge about the mechanisms that lead to randomness in a process to select an appropriate probability distribution to fit Distribution selection is more than selecting the best fit empirical curve Reading Kottegoda and Rosso Chapter 4.
Kottegoda and Rosso page 201
Kottegoda and Rosso page 228
From http://www.weibull.com/
Carl Friedrich Gauß, immortalized
Normal Distribution Standardized Cumulative Use Normal tables or normcdf, norminv
Linear combinations of normally distributed variables Regenerative Central Limit Theorem for n→ The normal distribution is the natural distribution to characterize any process where the randomness is due to the additive effect of many independent inputs, without domination by one specific input.
Log Normal Distribution X is log normally distributed if Y=ln(X) is normally distributed Derived distribution Parameters y, y, pertain to the ln of the variable.
Lognormal and Normal distribution relationships used for simulation Moments Coefficient of Variation Estimation of parameters from moments
Log normal distribution and central limit theorem Parameters , , pertain to the ln of the variable, the Y’s. The log normal distribution is the natural distribution to characterize any process where the randomness is due to the multiplicative effect of many independent inputs, without domination by one specific input. (If one Xi can be 0 it makes the product 0, dominating the resulting distribution)
Nonparametric Density Estimation Learn the kernel method for nonparametric density estimation Learn the difference between fixed kernel window width and variable kernel window width estimates Learn some of the approaches to kernel window width estimation Reference Least squares cross validation Reading Silverman Chapter 1, Chapter 2 (2.1-2.6, 2.9), Chapter 3 (3.1-3.4)
Parametric vs Nonparametric Assume data from a known distribution (e.g. Gamma, Normal) and estimate parameters Nonparametric Assume that the data has a probability density function but not of a specific known form Let data speak for themselves Exploratory data analysis
Which method conveys the information best to you ? Cumulative Density Probability Density Equation Probability Plot
San Juan River Annual Streamflow 1906-1983 Histogram, bin width 150 Flow KAF Number
San Juan River Annual Streamflow 1906-1983 Density Flow KAF
Kernel Density Estimator
Kernel Density Estimate (KDE) Place “kernels” at each data point Sum up the kernels Width of kernel determines level of smoothing Determining how to choose the width of the kernel is an important topic! Narrow kernel Medium kernel Wide kernel
KDE window width sensitivity
Variable Kernel
San Juan River Annual Streamflow 1906-1983, Variable Kernel Density Estimate Flow KAF
Some Common Kernels
Density estimate with different kernels with the same bandwidth
Least Squares Cross Validation Density Mo(h) Flow KAF Window width h