Validity and application of some continuous distributions Dr. Md. Monsur Rahman Professor Department of Statistics University of Rajshahi Rajshshi – 6205.

Slides:



Advertisements
Similar presentations
Chapter 7 Title and Outline 1 7 Sampling Distributions and Point Estimation of Parameters 7-1 Point Estimation 7-2 Sampling Distributions and the Central.
Advertisements

Probability and Statistics Mat 271E
Chapter 7 Introduction to Sampling Distributions
DISTRIBUTION FITTING.
Time-Dependent Failure Models
1 Pertemuan 06 Sebaran Normal dan Sampling Matakuliah: >K0614/ >FISIKA Tahun: >2006.
Chapter 6 The Normal Distribution and Other Continuous Distributions
Ch. 6 The Normal Distribution
CHAPTER 6 Statistical Analysis of Experimental Data
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 6-1 Chapter 6 The Normal Distribution and Other Continuous Distributions.
Statistics for Managers Using Microsoft Excel, 5e © 2008 Pearson Prentice-Hall, Inc.Chap 6-1 Statistics for Managers Using Microsoft® Excel 5th Edition.
Chapter 6 The Normal Distribution & Other Continuous Distributions
Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall 6-1 Chapter 6 The Normal Distribution & Other Continuous Distributions Statistics for.
Chapter 5 Continuous Random Variables and Probability Distributions
Introduction Before… Next…
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 6-1 Chapter 6 The Normal Distribution and Other Continuous Distributions.
SIMPLE LINEAR REGRESSION
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc.. Chap 6-1 Chapter 6 The Normal Distribution and Other Continuous Distributions Basic Business.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 4 Continuous Random Variables and Probability Distributions.
Business Statistics: A First Course, 5e © 2009 Prentice-Hall, Inc. Chap 6-1 Chapter 6 The Normal Distribution Business Statistics: A First Course 5 th.
Chapter 6 The Normal Distribution & Other Continuous Distributions
Chapter 4 Continuous Random Variables and Probability Distributions
SIMPLE LINEAR REGRESSION
1 Statistical Analysis - Graphical Techniques Dr. Jerrell T. Stracener, SAE Fellow Leadership in Engineering EMIS 7370/5370 STAT 5340 : PROBABILITY AND.
Chap 6-1 Copyright ©2013 Pearson Education, Inc. publishing as Prentice Hall Chapter 6 The Normal Distribution Business Statistics: A First Course 6 th.
Estimation of Statistical Parameters
Chapter 3 Basic Concepts in Statistics and Probability
Continuous Probability Distributions  Continuous Random Variable  A random variable whose space (set of possible values) is an entire interval of numbers.
Traffic Modeling.
Prof. Dr. S. K. Bhattacharjee Department of Statistics University of Rajshahi.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 1 PROBABILITIES FOR CONTINUOUS RANDOM VARIABLES THE NORMAL DISTRIBUTION CHAPTER 8_B.
Problem: Assume that among diabetics the fasting blood level of glucose is approximately normally distributed with a mean of 105mg per 100ml and an SD.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 6-1 Chapter 6 The Normal Distribution and Other Continuous Distributions.
An Empirical Likelihood Ratio Based Goodness-of-Fit Test for Two-parameter Weibull Distributions Presented by: Ms. Ratchadaporn Meksena Student ID:
Version 2012 Updated on Copyright © All rights reserved Dong-Sun Lee, Prof., Ph.D. Chemistry, Seoul Women’s University Chapter 6 Random Errors in.
1 Statistical Distribution Fitting Dr. Jason Merrick.
Bayesian Analysis and Applications of A Cure Rate Model.
Learning Theory Reza Shadmehr logistic regression, iterative re-weighted least squares.
1 Lecture 13: Other Distributions: Weibull, Lognormal, Beta; Probability Plots Devore, Ch. 4.5 – 4.6.
MEGN 537 – Probabilistic Biomechanics Ch.5 – Determining Distributions and Parameters from Observed Data Anthony J Petrella, PhD.
Chapter 9 Input Modeling Banks, Carson, Nelson & Nicol Discrete-Event System Simulation.
Pro gradu –thesis Tuija Hevonkorpi.  Basic of survival analysis  Weibull model  Frailty models  Accelerated failure time model  Case study.
1 Statistical Analysis – Descriptive Statistics Dr. Jerrell T. Stracener, SAE Fellow Leadership in Engineering EMIS 7370/5370 STAT 5340 : PROBABILITY AND.
LECTURE 3: ANALYSIS OF EXPERIMENTAL DATA
© 2002 Prentice-Hall, Inc.Chap 6-1 Basic Business Statistics (8 th Edition) Chapter 6 The Normal Distribution and Other Continuous Distributions.
Lecture 6 Normal Distribution By Aziza Munir. Summary of last lecture Uniform discrete distribution Binomial Distribution Mean and Variance of binomial.
Basic Business Statistics
Sampling and estimation Petter Mostad
Stracener_EMIS 7305/5305_Spr08_ Reliability Data Analysis and Model Selection Dr. Jerrell T. Stracener, SAE Fellow Leadership in Engineering.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 6-1 The Normal Distribution.
Continuous Probability Distribution By: Dr. Wan Azlinda Binti Wan Mohamed.
© 2002 Prentice-Hall, Inc.Chap 5-1 Statistics for Managers Using Microsoft Excel 3 rd Edition Chapter 5 The Normal Distribution and Sampling Distributions.
Chapter 20 Statistical Considerations Lecture Slides The McGraw-Hill Companies © 2012.
Statistics Sampling Distributions and Point Estimation of Parameters Contents, figures, and exercises come from the textbook: Applied Statistics and Probability.
Statistics for Managers Using Microsoft Excel, 5e © 2008 Pearson Prentice-Hall, Inc.Chap 6-1 Statistics for Managers Using Microsoft® Excel 5th Edition.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc.. Chap 6-1 Chapter 6 The Normal Distribution and Other Continuous Distributions Basic Business.
Chap 6-1 Chapter 6 The Normal Distribution Statistics for Managers.
1 Statistical Analysis - Graphical Techniques Dr. Jerrell T. Stracener, SAE Fellow Leadership in Engineering EMIS 7370/5370 STAT 5340 : PROBABILITY AND.
MEGN 537 – Probabilistic Biomechanics Ch.5 – Determining Distributions and Parameters from Observed Data Anthony J Petrella, PhD.
THE NORMAL DISTRIBUTION
Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 6-1 Chapter 6 The Normal Distribution Business Statistics, A First Course 4 th.
Chapter 6 The Normal Distribution and Other Continuous Distributions
MECH 373 Instrumentation and Measurements
STAT 206: Chapter 6 Normal Distribution.
Mathematical Foundations of BME Reza Shadmehr
Statistics for Managers Using Microsoft® Excel 5th Edition
Chapter 5 Continuous Random Variables and Probability Distributions
The Normal Distribution
Presentation transcript:

Validity and application of some continuous distributions Dr. Md. Monsur Rahman Professor Department of Statistics University of Rajshahi Rajshshi –

Normal distribution The first discoverer of the normal probability function was Abraham De Moivre( ), who, in 1733, derived the distribution as the limiting form of the binomial distribution. But the same formula was derived by Karl Freidrich Gauss( ) in connection with his work in evaluating errors of observation in astronomy. This is why the normal probability is often referred to as Gaussian distribution. 2

X: Normal Variate Density: Standard Normal Variate : 3

Normal distribution 4

Properties of Normal distribution Normal probability curve is symmetrical about the ordinate at Mean, median and mode of the distribution are equal and each of these is The curve has its points of inflection at By a point of infection, we mean a point at which the concavity changes All odd order moments of the distribution about the mean vanish The values of and are 0 and 3 respectively 5

includes about 68.27% of the population includes about 95.45% of the population includes about 99.73% of the population Application: Many biological characteristics conform to a Normal distribution - for example, heights of adult men and women, blood pressures in a healthy population, RBS levels in blood etc. 6

Validity of Normal Distribution for a set of data Many statistical methods can only be used if the observations follow a Normal Distribution. There are several ways of investing whether observations follow a Normal distribution. With a large sample we can inspect a histogram to see whether it looks like a Normal distribution curve. This does not work well with a small sample, and a more reliable method is the normal plot which is described below. 7

8

X: Normal Variate Density: Standard Normal Variate : 9

CDF OF X : F(X) CDF OF Z : P quantile of X : P quantile of Z :, is the solution of 10

Dataset Find empirical CDF values Arrange the data in ascending order as Empirical CDF values are as follows Using normal table obtain the values corresponding to 11

If the given set of observations follow normal distribution, the plot (x, z) should roughly be a straight line and the line passes through the point and has slope. Graphical estimates of and may be obtained. If the data are not come from Normal distribution we will get a curve of some sort. 12

Table 1 : RBS levels(mmol/L) measured in the blood of 20 medical students. Data of Bland(1995), pp. 66 Bland,M.(1995): An Introductions to Medical Statistics, second edition, ELBS with Oxford University Press. 13

14

MLE 15

16

Goodness of Fit Test We use here Kolmogorov-Smirnov (KS) test for the given data KS statistic=max |CDF_FIT- CDF_EMP| For the RBS level data we calculate KS statistic KS(cal)= % tabulated value=0.294 Conclusion: Normal distribution fit is good for the given data 17

Estimated population having RBS within the normal range (3.9 – 7.8mmol/L) is about 51% Estimated population having RBS below the normal range is about 49% Estimated population having RBS above the normal range is 0% Results 18

Empirical CDF values of are as follows: Obtain the values corresponding to Similarly values are obtained corresponding to Two sample case 19

If the first set of data come from normal distribution with mean and variance, then the plot will roughly be linear and passes through the point with slope. If the second set of data come from normal distribution with mean and variance, then the plot will roughly be linear and passes through the point with slope. 20

Both the lines parallel indicating different means but equal variances Both the lines coincide indicating equal means and equal variances Both the lines pass through the same point on the X-axis indicating same means but different variances 21

Table 2 : Burning times (rounded to the nearest tenth of a minute) of two kinds of emergency flares. Data due to Freund and Walpole(1987), pp. 530 Brand A: 14.9,11.3,13.2,16.6,17.0,14.1,15.4, 13.0,16.9 Brand B: 15.2,19.8,14.7,18.3,16.2,21.2,18.9, 12.2,15.3,19.4 Freund, J.E. and Walpole, R.E.(1987): Mathematical Statistics, Fourth edition, Prentice-Hall Inc. 22

Above plot indicates that both the samples come from normal population with unequal means and variances 23

Log-normal distribution In probability theory, a log-normal distribution is aprobability theory probability distributionprobability distribution of a random variable whoserandom variable logarithmlogarithm is normally distributed. If X is a randomnormally distributed variable with a normal distribution, then Y = exp(X) hasexp a log-normal distribution; likewise, if Y is log-normally distributed, then X = log(Y) is normally distributed. It is occasionally referred to as the Galton distribution. 24

Density: Mean = Variance= Median= Mode= 25

Log-normal density function x f(x) 26

Application Certain physiological measurements, such as blood pressure of adult humans (after separation on male/female subpopulations), vitamin D level in blood etc. follow lognormal distribution. Subsequently, reference ranges for measurements inreference ranges healthy individuals are more accurately estimated by assuming a log-normal distribution than by assuming a symmetric distribution about the mean. 27

Table 3 : Vitamin D levels(ng/ml) measured in the blood of 26 healthy men. Data due to Bland(1995), pp Bland,M.(1995): An Introductions to Medical Statistics, Second edition, ELBS with Oxford University Press. 28

29

ng/ml MLE 30

31

Goodness-of-fit test KS statistic=max |CDF_FIT- CDF_EMP| For the vitamin D level data we calculate KS statistic KS(cal)= % tabulated value=0.274 Conclusion: Lognormal distribution fit is good for the given vitamin D data 32

Estimated population having vitamin D level within the normal range (30 – 74 ng/ml) is about 56% Estimated population having vitamin D level below the normal range is about 40% Estimated population having vitamin D level above the normal range is about 4% Results 33

Weibull Distribution Weibull distribution is used to analyze the lifetime data T: Lifetime variable Density function : Scale parameter(.632 quantile) : Shape parameter( 1 or =1) CDF : Reliability (or Survival) function: 34

Increasing hazard rate : for Decreasing hazard rate: for Constant hazard rate : for p quantile, which is the solution of Accordingly, Hazard Function : 35

Density function: : Scale parameter(.632 quantile) CDF : Reliability (or Survival) function: Weibull distribution reduces to exponential distribution when Exponential distribution 36

p quantile, which is the solution of Accordingly, Hazard Function : 37

The red curve is the exponential density The red line is the exp. hazard function 38

From the Weibull CDF we get where Ordered lifetimes are: values are obtained through the empirical CDF values as given below Validity of Weibull distribution for a set of data 39

If the data follow Weibull distribution with scale parameter and shape parameter, the plot of (X,Y) will roughly be linear with slope and passes through the point. Accordingly, the graphical estimates of and may be obtained. 40

Table 4: Specimens lives (in hours) of a electrical insulation at temperature appear below. Data due to Nelson(1990), pp. 154 Nelson,W.(1990): Accelerated Testing: Statistical Models, Test Plans, and Data Analyses, John Wiley and Sons. 2520, 2856, 3192, 3192,

42

MLE of and Log-likelihood function of and based on observed data MLE of and by maximizing the log-Likelihood with respect to and using numerical method. Graphical estimates may be used as starting values required for the numerical method The MLEs of and are denoted by and respectively. 43

For the insulation fluid data given in table 4 the following results (based on MLEs) are obtained: hours Estimated median life= hours ML estimate of R(t) Time (hour): Reliability :

Weibull versus Exponential Model Suppose we want to test whether we accept exponential or Weibull model for a given set of data The above test is equivalent to test whether the shape parameter of Weibull distribution is unity or not i.e. vs 45

Test Procedure(LR test) Under the log-likelihood function is which yields, MLE of. Maximum of is given by 46

Similarly, under the maximum of the log-likelihood is given by where and are the MLE s of and under. LR test implies follows chi-square distribution with 1 df. If, accept (use) exponential Model 47

If, accept (use) Weibull model For the insulation fluid data given in table 4 Conclusion: Weibull model may be accepted at 5% level of significance 48

Accelerated Life Testing (ALT) for Weibull Distribution Stress: Temperature, Voltage, Load, etc. Under operating (used) stress level, it takes a lot of time to get sufficient number of failures Lifetimes obtained under high stress levels Aim: (i) To estimate the lifetime distribution under used stress level, say, (ii) To estimate reliability for a specified time under (iii) To estimate quantiles under

Sampling scheme(under constant stress testing) Divide n components into k groups with number of components respectively, where components exposed under stress levels, j-th lifetime corresponding to Obtain the equation for the lifetime corresponding to i-th group 50

If the data corresponding to the i- th group follow, the plot will roughly be linear with slope and passes through the point If the plots are linear and parallel, then lifetimes under different stress levels are Weibull with common slope and different scale which implies that depends on the stress levels The equation for the lifetime corresponding to i-th group 51

If the k plots are linear and parallel, the lifetimes under different stress levels are Weibull with different slopes and different scales which implies that both and both depend on the stress levels. In this case modeling is difficult. For the first case the relationship between the life and stress will be identified Plot log(.632 quantile) against the stress levels If the plot yields a straight line then the life-stress relationship will be 52

Estimation of Likelihood function under stress level where, Total log-likelihood using ML method 53

Using numerical method MLEs of may be obtained MLE of at, say,, is obtained through the relationship Hence ML estimate of Weibull density under used stress level is obtained. Accordingly, estimate of reliability for a specified time, median life and other desired percentiles may also be obtained 54

Table 5: Specimens lives (in hours) of a electrical insulation at three temperatures appear below, data of Nelson(1990), pp Nelson,W.(1990): Accelerated Testing: Statistical Models, Test Plans, and Data Analyses, John Wiley and Sons. 55

56

Above three plots of the data given in table5 are roughly linear and parallel, so the lifetimes under three stress levels are Weibull with common slope and different scale parameters which implies that the scale parameters depend on the stress levels Arrhenious life-stress relationship (temperature stress) where W is the temperature in degree kelvin Temperature in degree kelvin= temperature in degree centigrade plus

58

Results based on MLEs for the data given in table 5 with respect to the Arrhenious-Weibull model At used stress(180 deg. Centigrade) the following results are obtained Estimated median lifetime= hours and 59

At used stress(180 deg. Centigrade) the following results are obtained Estimated median lifetime= hours Results based on MLEs for the data given in table 5 with respect to the Arrhenious-Exponential model 60

Weibull versus Exponential Model for ALT vs (For Exponential model) (For Weibull model) Conclusion: Accept Weibull model at 5% level of significance 61