Probability distribution functions Normal distribution Lognormal distribution Mean, median and mode Tails Extreme value distributions.

Slides:



Advertisements
Similar presentations
Chapter 4. Elements of Statistics # brief introduction to some concepts of statistics # descriptive statistics inductive statistics(statistical inference)
Advertisements

Modeling of Data. Basic Bayes theorem Bayes theorem relates the conditional probabilities of two events A, and B: A might be a hypothesis and B might.
NORMAL OR GAUSSIAN DISTRIBUTION Chapter 5. General Normal Distribution Two parameter distribution with a pdf given by:
Frequency Analysis Reading: Applied Hydrology Sections 12-2 to 12-6.
Estimation of Means and Proportions
Hydrologic Statistics Reading: Chapter 11, Sections 12-1 and 12-2 of Applied Hydrology 04/04/2006.
How do we generate the statistics of a function of a random variable? – Why is the method called “Monte Carlo?” How do we use the uniform random number.
Hydrologic Statistics
MEGN 537 – Probabilistic Biomechanics Ch.4 – Common Probability Distributions Anthony J Petrella, PhD.
Probability distribution functions Normal distribution Lognormal distribution Mean, median and mode Tails Extreme value distributions.
F (x) - PDF Expectation Operator x (fracture stress) Nomenclature/Preliminaries.
Descriptive Statistics Statistical Notation Measures of Central Tendency Measures of Variability Estimating Population Values.
Calculus Review Texas A&M University Dept. of Statistics.
Statistics and Probability Theory Prof. Dr. Michael Havbro Faber
Continuous Random Variables and Probability Distributions
Confidence Intervals Confidence Interval for a Mean
VARIABILITY. PREVIEW PREVIEW Figure 4.1 the statistical mode for defining abnormal behavior. The distribution of behavior scores for the entire population.
8-1 Introduction In the previous chapter we illustrated how a parameter can be estimated from sample data. However, it is important to understand how.
Chapter 5 – 1 Chapter 5: Measures of Variability The Importance of Measuring Variability The Range IQR (Inter-Quartile Range) Variance Standard Deviation.
Lecture II-2: Probability Review
Statistical Inferences Jake Blanchard Spring 2010 Uncertainty Analysis for Engineers1.
Flood Frequency Analysis
Use of Quantile Functions in Data Analysis. In general, Quantile Functions (sometimes referred to as Inverse Density Functions or Percent Point Functions)
Common Probability Distributions in Finance. The Normal Distribution The normal distribution is a continuous, bell-shaped distribution that is completely.
Probability distribution functions
Numerical Descriptive Techniques
WFM 5201: Data Management and Statistical Analysis © Dr. Akm Saiful IslamDr. Akm Saiful Islam WFM 5201: Data Management and Statistical Analysis Akm Saiful.
Chapter 3 Descriptive Measures
Exam I review Understanding the meaning of the terminology we use. Quick calculations that indicate understanding of the basis of methods. Many of the.
Statistics Frequency and Distribution. We interrupt this lecture for the following… Significant digits You should not report numbers with more significant.
Measures of Variability Objective: Students should know what a variance and standard deviation are and for what type of data they typically used.
Advanced Algebra II Normal Distribution. In probability theory, the normal (or Gaussian) distribution is a continuous probability distribution that has.
Normal Distributions Z Transformations Central Limit Theorem Standard Normal Distribution Z Distribution Table Confidence Intervals Levels of Significance.
MEGN 537 – Probabilistic Biomechanics Ch.5 – Determining Distributions and Parameters from Observed Data Anthony J Petrella, PhD.
Chapter 6: Random Errors in Chemical Analysis CHE 321: Quantitative Chemical Analysis Dr. Jerome Williams, Ph.D. Saint Leo University.
Dr. Serhat Eren 1 CHAPTER 6 NUMERICAL DESCRIPTORS OF DATA.
What does Statistics Mean? Descriptive statistics –Number of people –Trends in employment –Data Inferential statistics –Make an inference about a population.
Probability = Relative Frequency. Typical Distribution for a Discrete Variable.
5-1 ANSYS, Inc. Proprietary © 2009 ANSYS, Inc. All rights reserved. May 28, 2009 Inventory # Chapter 5 Six Sigma.
3 common measures of dispersion or variability Range Range Variance Variance Standard Deviation Standard Deviation.
Chapter 12 Continuous Random Variables and their Probability Distributions.
Marina Maksimova Session 7 Comments for “Classical Tests”
Random Variables (1) A random variable (also known as a stochastic variable), x, is a quantity such as strength, size, or weight, that depends upon a.
Statistics Sampling Intervals for a Single Sample Contents, figures, and exercises come from the textbook: Applied Statistics and Probability for Engineers,
Stracener_EMIS 7305/5305_Spr08_ Reliability Data Analysis and Model Selection Dr. Jerrell T. Stracener, SAE Fellow Leadership in Engineering.
Probability distributions
Sundermeyer MAR 550 Spring Laboratory in Oceanography: Data and Methods MAR550, Spring 2013 Miles A. Sundermeyer Computing Basic Statistics.
Binomial probability estimation Playing chess against a friend you won 3 out of 5 matches and lost 2. Assuming that wins and losses follow the binomial.
Statistics topics from both Math 1 and Math 2, both featured on the GHSGT.
Chapter 20 Statistical Considerations Lecture Slides The McGraw-Hill Companies © 2012.
Lesson 2 - R Review of Chapter 2 Describing Location in a Distribution.
Ex St 801 Statistical Methods Inference about a Single Population Mean (CI)
Chapter 8 Estimation ©. Estimator and Estimate estimator estimate An estimator of a population parameter is a random variable that depends on the sample.
Term Project 2 Reliability estimations by S-S model Team 2 _ 양지훈 오혜정.
MEGN 537 – Probabilistic Biomechanics Ch.4 – Common Probability Distributions Anthony J Petrella, PhD.
MEGN 537 – Probabilistic Biomechanics Ch.5 – Determining Distributions and Parameters from Observed Data Anthony J Petrella, PhD.
Central Bank of Egypt Basic statistics. Central Bank of Egypt 2 Index I.Measures of Central Tendency II.Measures of variability of distribution III.Covariance.
Confidence Intervals and Sample Size
Notes 13.2 Measures of Center & Spread
Flood Frequency Analysis
Summary descriptive statistics: means and standard deviations:
Statistics & Flood Frequency Chapter 3 – Part 1
Laboratory in Oceanography: Data and Methods
MEGN 537 – Probabilistic Biomechanics Ch
Hydrologic Statistics
Summary descriptive statistics: means and standard deviations:
Measures in Variability
Texas A&M University Dept. of Statistics
Continuous Statistical Distributions: A Practical Guide for Detection, Description and Sense Making Unit 3.
HYDROLOGY Lecture 12 Probability
Presentation transcript:

Probability distribution functions Normal distribution Lognormal distribution Mean, median and mode Tails Extreme value distributions

Normal (Gaussian) distribution Normal density function What does the figure tell us about the values of the CDF?

More on the normal distribution P = normcdf(X,MU,SIGMA) returns the cdf of the normal distribution with mean MU and standard deviation SIGMA, evaluated at the values in X. The size of P is the common size of X, MU and SIGMA. normcdf(1)= normcdf(6)= e-010 If X is normally distributed, Y=aX+b is also normally distributed. What would be the mean and standard deviation of Y? Notation

Estimating mean and standard deviation Given a sample from a normally distributed variable, the sample mean is the best linear unbiased estimator of the true mean. For the variance the equation gives the best unbiased estimator, but the square root is not an unbiased estimate of the standard deviation x=randn(5,10000); s=std(x); mean(s) s2=s.^2; mean(s2)

Lognormal distribution If ln(X) has normal distribution X has lognormal distribution. That is, if X is normally distributed exp(X) is lognormally distributed. Notation: Probability distribution function (PDF) Mean and variance

Mean, mode and median Mode (highest point) Median (50% of samples)

Light and heavy tails Normal distribution has light tail. Six sigma is equivalent to (nine nines) safety. Lognormal is heavy tailed m=exp(0.5) m = v=exp(1)*(exp(1)-1) v = sig=sqrt(v) sig = sig6=m+6*sig sig6 = logncdf(sig6,0,1) =0.9963

Fitting distribution to data Typically fit to CDF.

Empirical CDF [F,X] = ecdf(Y) calculates the Kaplan-Meier estimate of the cumulative distribution function (cdf), also known as the empirical cdf. Y is a vector of data values. F is a vector of values of the empirical cdf evaluated at X. [F,X,FLO,FUP] = ecdf(Y) also returns lower and upper confidence bounds for the cdf. These bounds are calculated using Greenwood's formula, and are not simultaneous confidence bounds. ecdf(...) without output arguments produces a plot of the empirical cdf. Use the data cursor to read precise values from the plot.

Example x=lognrnd(0,1,1,20); ecdf(x) hold on x=lognrnd(0,1,1,10000); ecdf(x)

Extreme value distributions No matter what distribution you sample from, the mean of the sample tends to be normally distributed as sample size increases (what mean and standard deviation?) Similarly, distributions of the minimum (or maximum) of samples belong to other distributions. Even though there are infinite number of distributions, there are only three extreme value distribution. – Type I (Gumbel) derived from normal. – Type II (Frechet) e.g. maximum daily rainfall – Type III (Weibull) weakest link failure

Example x=5-0.3*randn(10,1000); minx=min(x); hist(minx); ecdf(minx)

Gumbel distribution PDF and CDF Mean, median, mode and variance

Weibull distribution Probability distribution Used to describe distribution Of strength or fatigue life in brittle materials (weakest link connection) If it describes time to failure, then k<1 indicates that failure rate decreases with time, k=1 indicates constant rate, k>1 indicates increasing rate. Useful for other phenomena like wind speed distribution. Can add 3 rd parameter by replacing x by x-c.