Probability distribution functions Normal distribution Lognormal distribution Mean, median and mode Tails Extreme value distributions.

Slides:



Advertisements
Similar presentations
Modeling of Data. Basic Bayes theorem Bayes theorem relates the conditional probabilities of two events A, and B: A might be a hypothesis and B might.
Advertisements

The Maximum Likelihood Method
Probability distribution functions Normal distribution Lognormal distribution Mean, median and mode Tails Extreme value distributions.
Hydrologic Statistics Reading: Chapter 11, Sections 12-1 and 12-2 of Applied Hydrology 04/04/2006.
Hydrologic Statistics
Discrete Probability Distributions
F (x) - PDF Expectation Operator x (fracture stress) Nomenclature/Preliminaries.
The Simple Linear Regression Model: Specification and Estimation
Time-Dependent Failure Models
458 Fitting models to data – II (The Basics of Maximum Likelihood Estimation) Fish 458, Lecture 9.
1 Engineering Computation Part 6. 2 Probability density function.
Descriptive statistics Experiment  Data  Sample Statistics Experiment  Data  Sample Statistics Sample mean Sample mean Sample variance Sample variance.
Continuous Random Variables Chap. 12. COMP 5340/6340 Continuous Random Variables2 Preamble Continuous probability distribution are not related to specific.
Statistics and Probability Theory Prof. Dr. Michael Havbro Faber
Continuous Random Variables and Probability Distributions
Probability Review (many slides from Octavia Camps)
VARIABILITY. PREVIEW PREVIEW Figure 4.1 the statistical mode for defining abnormal behavior. The distribution of behavior scores for the entire population.
Lecture II-2: Probability Review
Standard error of estimate & Confidence interval.
Flood Frequency Analysis
Common Probability Distributions in Finance. The Normal Distribution The normal distribution is a continuous, bell-shaped distribution that is completely.
Probability distribution functions
Traffic Modeling.
Chapter 3 Descriptive Measures
Bayesian inference review Objective –estimate unknown parameter  based on observations y. Result is given by probability distribution. Bayesian inference.
Exam I review Understanding the meaning of the terminology we use. Quick calculations that indicate understanding of the basis of methods. Many of the.
FREQUENCY ANALYSIS.
1 Statistical Distribution Fitting Dr. Jason Merrick.
7-1 Introduction The field of statistical inference consists of those methods used to make decisions or to draw conclusions about a population. These.
“ Building Strong “ Delivering Integrated, Sustainable, Water Resources Solutions Statistics 101 Robert C. Patev NAD Regional Technical Specialist (978)
Lab 3b: Distribution of the mean
MEGN 537 – Probabilistic Biomechanics Ch.5 – Determining Distributions and Parameters from Observed Data Anthony J Petrella, PhD.
Chapter 5.6 From DeGroot & Schervish. Uniform Distribution.
1 Lecture 16: Point Estimation Concepts and Methods Devore, Ch
1 Chapter 7 Sampling Distributions. 2 Chapter Outline  Selecting A Sample  Point Estimation  Introduction to Sampling Distributions  Sampling Distribution.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 13-1 Introduction to Regression Analysis Regression analysis is used.
Chapter 12 Continuous Random Variables and their Probability Distributions.
The Simple Linear Regression Model: Specification and Estimation ECON 4550 Econometrics Memorial University of Newfoundland Adapted from Vera Tabakova’s.
Chapter 5 Sampling Distributions. The Concept of Sampling Distributions Parameter – numerical descriptive measure of a population. It is usually unknown.
Probability distributions
5.5.3 Means and Variances for Linear Combinations
Machine Learning 5. Parametric Methods.
Sampling Theory and Some Important Sampling Distributions.
Describing Samples Based on Chapter 3 of Gotelli & Ellison (2004) and Chapter 4 of D. Heath (1995). An Introduction to Experimental Design and Statistics.
Chapter 5 Joint Probability Distributions and Random Samples  Jointly Distributed Random Variables.2 - Expected Values, Covariance, and Correlation.3.
Statistics Sampling Distributions and Point Estimation of Parameters Contents, figures, and exercises come from the textbook: Applied Statistics and Probability.
ELEC 303, Koushanfar, Fall’09 ELEC 303 – Random Signals Lecture 8 – Continuous Random Variables: PDF and CDFs Farinaz Koushanfar ECE Dept., Rice University.
G. Cowan Lectures on Statistical Data Analysis Lecture 9 page 1 Statistical Data Analysis: Lecture 9 1Probability, Bayes’ theorem 2Random variables and.
R. Kass/W03 P416 Lecture 5 l Suppose we are trying to measure the true value of some quantity (x T ). u We make repeated measurements of this quantity.
MEGN 537 – Probabilistic Biomechanics Ch.5 – Determining Distributions and Parameters from Observed Data Anthony J Petrella, PhD.
Statistics 350 Lecture 2. Today Last Day: Section Today: Section 1.6 Homework #1: Chapter 1 Problems (page 33-38): 2, 5, 6, 7, 22, 26, 33, 34,
Statistics and probability Dr. Khaled Ismael Almghari Phone No:
STA302/1001 week 11 Regression Models - Introduction In regression models, two types of variables that are studied:  A dependent variable, Y, also called.
Introduction to Probability - III John Rundle Econophysics PHYS 250
4-1 Continuous Random Variables 4-2 Probability Distributions and Probability Density Functions Figure 4-1 Density function of a loading on a long,
Statistical Modelling
Probability Theory and Parameter Estimation I
Ch3: Model Building through Regression
The Maximum Likelihood Method
Flood Frequency Analysis
Summary descriptive statistics: means and standard deviations:
Some probability density functions (pdfs)
Statistics & Flood Frequency Chapter 3 – Part 1
The Maximum Likelihood Method
Physics 114: Exam 2 Review Material from Weeks 7-11
Basic Estimation Techniques
Hydrologic Statistics
Extreme Value Theory: Part I
Summary descriptive statistics: means and standard deviations:
HYDROLOGY Lecture 12 Probability
Presentation transcript:

Probability distribution functions Normal distribution Lognormal distribution Mean, median and mode Tails Extreme value distributions

Normal (Gaussian) distribution Probability density function (PDF) What does figure tell about the cumulative distribution function (CDF)?

More on the normal distribution

Estimating mean and standard deviation Given a sample from a normally distributed variable, the sample mean is the best linear unbiased estimator (BLUE) of the true mean. For the variance the equation gives the best unbiased estimator, but the square root is not an unbiased estimate of the standard deviation For example, for a sample of 5 from a standard normal distribution, the standard deviation will be estimated on average as 0.94 (with standard deviation of 0.34)

Lognormal distribution

Mean, mode and median

Light and heavy tails

Fitting distribution to data Usually fit CDF to minimize maximum distance (Kolmogorov-Smirnoff test) Generated 20 points from N(3,1 2 ). Normal fit N(3.48, ) Lognormal lnN(1.24,0.26 ) Almost same mean and standard deviation.

Extreme value distributions No matter what distribution you sample from, the mean of the sample tends to be normally distributed as sample size increases (what mean and standard deviation?) Similarly, distributions of the minimum (or maximum) of samples belong to other distributions. Even though there are infinite number of distributions, there are only three extreme value distribution. – Type I (Gumbel) derived from normal. – Type II (Frechet) e.g. maximum daily rainfall – Type III (Weibull) weakest link failure

Maximum of normal samples With normal distribution, maximum of sample is more narrowly distributed than original distribution. Max of 10 standard normal samples mean, 0.59 standard deviation Max of 100 standard normal samples mean, 0.43 standard deviation

Gumbel distribution. Mean, median, mode and variance

Weibull distribution Probability distribution Its log has Gumbel dist. Used to describe distribution of strength or fatigue life in brittle materials. If it describes time to failure, then k<1 indicates that failure rate decreases with time, k=1 indicates constant rate, k>1 indicates increasing rate. Can add 3 rd parameter by replacing x by x-c.

Exercises Find how many samples of normally distributed numbers you need in order to estimate the mean and standard deviation with an error that will be less than 10% of the true standard deviation most of the time. Both the lognormal and Weibull distributions are used to model strength. Find how closely you can approximate data generated from a standard lognormal distribution by fitting it with Weibull. Take the introduction and preamble of the US Declaration of Independence, and fit the distribution of word lengths using the K-S criterion. What distribution fits best? Compare the graphs of the CDFs. Compare to a more contemporary text.