Data Analysis Statistical Measures Industrial Engineering

Slides:



Advertisements
Similar presentations
Important Random Variables Binomial: S X = { 0,1,2,..., n} Geometric: S X = { 0,1,2,... } Poisson: S X = { 0,1,2,... }
Advertisements

5 - 1 © 1997 Prentice-Hall, Inc. Importance of Normal Distribution n Describes many random processes or continuous phenomena n Can be used to approximate.
Sampling Distributions
Chapter 7 Introduction to Sampling Distributions
Sampling Distributions
1 Sampling Distributions Chapter Introduction  In real life calculating parameters of populations is prohibitive because populations are very.
Chapter 6 Continuous Random Variables and Probability Distributions
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 6 Introduction to Sampling Distributions.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 6-1 Introduction to Statistics Chapter 7 Sampling Distributions.
Descriptive statistics Experiment  Data  Sample Statistics Experiment  Data  Sample Statistics Sample mean Sample mean Sample variance Sample variance.
Normal and Sampling Distributions A normal distribution is uniquely determined by its mean, , and variance,  2 The random variable Z = (X-  /  is.
Continuous Probability Distribution  A continuous random variables (RV) has infinitely many possible outcomes  Probability is conveyed for a range of.
R. Kass/S07 P416 Lec 3 1 Lecture 3 The Gaussian Probability Distribution Function Plot of Gaussian pdf x p(x)p(x) Introduction l The Gaussian probability.
 Deviation is a measure of difference for interval and ratio variables between the observed value and the mean.  The sign of deviation (positive or.
Chapter 6 Sampling and Sampling Distributions
SAMPLING DISTRIBUTION
Continuous Probability Distributions  Continuous Random Variable  A random variable whose space (set of possible values) is an entire interval of numbers.
Random Sampling, Point Estimation and Maximum Likelihood.
Chap 6-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 6 Introduction to Sampling.
Functions of Random Variables. Methods for determining the distribution of functions of Random Variables 1.Distribution function method 2.Moment generating.
Lecture 15: Statistics and Their Distributions, Central Limit Theorem
Determination of Sample Size: A Review of Statistical Theory
What does Statistics Mean? Descriptive statistics –Number of people –Trends in employment –Data Inferential statistics –Make an inference about a population.
§ 5.3 Normal Distributions: Finding Values. Probability and Normal Distributions If a random variable, x, is normally distributed, you can find the probability.
Random Variables (1) A random variable (also known as a stochastic variable), x, is a quantity such as strength, size, or weight, that depends upon a.
The final exam solutions. Part I, #1, Central limit theorem Let X1,X2, …, Xn be a sequence of i.i.d. random variables each having mean μ and variance.
Summarizing Risk Analysis Results To quantify the risk of an output variable, 3 properties must be estimated: A measure of central tendency (e.g. µ ) A.
South Dakota School of Mines & Technology Introduction to Probability & Statistics Industrial Engineering.
Review Normal Distributions –Draw a picture. –Convert to standard normal (if necessary) –Use the binomial tables to look up the value. –In the case of.
South Dakota School of Mines & Technology Introduction to Probability & Statistics Industrial Engineering.
1 Sampling distributions The probability distribution of a statistic is called a sampling distribution. : the sampling distribution of the mean.
Lecture 5 Introduction to Sampling Distributions.
1 Probability: Introduction Definitions,Definitions, Laws of ProbabilityLaws of Probability Random VariablesRandom Variables DistributionsDistributions.
Sampling Distributions Sampling Distributions. Sampling Distribution Introduction In real life calculating parameters of populations is prohibitive because.
Statistics Sampling Distributions and Point Estimation of Parameters Contents, figures, and exercises come from the textbook: Applied Statistics and Probability.
SAMPLING DISTRIBUTION. 2 Introduction In real life calculating parameters of populations is usually impossible because populations are very large. Rather.
Central Limit Theorem Let X 1, X 2, …, X n be n independent, identically distributed random variables with mean  and standard deviation . For large n:
Chap 5-1 Chapter 5 Discrete Random Variables and Probability Distributions Statistics for Business and Economics 6 th Edition.
South Dakota School of Mines & Technology Estimation Industrial Engineering.
Estimation Interval Estimates Industrial Engineering Estimation Interval Estimates Industrial Engineering.
R. Kass/W04 P416 Lec 3 1 Lecture 3 The Gaussian Probability Distribution Function Plot of Gaussian pdf x p(x)p(x) Introduction l The Gaussian probability.
Sampling Distributions
MECH 373 Instrumentation and Measurements
Confidence Intervals and Sample Size
Sampling Distribution Estimation Hypothesis Testing
Engineering Probability and Statistics - SE-205 -Chap 4
Measures of Central Tendency
The Gaussian Probability Distribution Function
Estimation Estimates for Proportions Industrial Engineering
South Dakota School of Mines & Technology Introduction to Probability & Statistics Industrial Engineering.
Data Analysis Statistical Measures Industrial Engineering
Estimation Interval Estimates Industrial Engineering
Notes Over 7.7 Finding Measures of Central Tendency
Random Sampling Population Random sample: Statistics Point estimate
Estimation Interval Estimates (s unknown) Industrial Engineering
Chapter 7 Sampling Distributions.
Probability Review for Financial Engineers
Using the Tables for the standard normal distribution
Introduction to Probability & Statistics The Central Limit Theorem
Estimation Interval Estimates Industrial Engineering
Estimation Method of Moments Industrial Engineering
Chapter 7 Sampling Distributions.
Introduction to Probability & Statistics Expectations
Estimating Proportions
Introduction to Probability & Statistics Joint Expectations
Exam 2 - Review Chapters
Chapter 7 Sampling Distributions.
Normal Probability Distributions
Estimation Method of Moments Industrial Engineering
Chapter 7 Sampling Distributions.
Presentation transcript:

Data Analysis Statistical Measures Industrial Engineering

Aside: Mean, Variance Mean: Variance:     xp x discrete ( ) ,   2    ( ) x p

Example Consider the discrete uniform die example: x 1 2 3 4 5 6 p(x) 1/6 1/6 1/6 1/6 1/6 1/6  = E[X] = 1(1/6) + 2(1/6) + 3(1/6) + 4(1/6) + 5(1/6) + 6(1/6) = 3.5

Example Consider the discrete uniform die example: x 1 2 3 4 5 6 p(x) 1/6 1/6 1/6 1/6 1/6 1/6 2 = E[(X-)2] = (1-3.5)2(1/6) + (2-3.5)2(1/6) + (3-3.5)2(1/6) + (4-3.5)2(1/6) + (5-3.5)2(1/6) + (6-3.5)2(1/6) = 2.92

å Binomial Mean  p ÷ ø ö ç è æ ) 1 ( )! !   xp x ( ) = 1p(1) + 2p(2) + 3p(3) + . . . + np(n) x n p - = ÷ ø ö ç è æ å ) 1 ( )! !

å Binomial Mean  p ÷ ø ö ç è æ ) 1 ( )! !   xp x ( ) = 1p(1) + 2p(2) + 3p(3) + . . . + np(n) x n p - = ÷ ø ö ç è æ å ) 1 ( )! ! Miracle 1 occurs = np

Binomial Measures Mean: Variance:     xp ( x ) = np     ( ) x 2    ( ) x p = np(1-p)

Binomial Distribution 0.0 0.1 0.2 0.3 0.4 0.5 1 2 3 4 5 x P(x) 0.0 0.1 0.2 0.3 0.4 0.5 1 2 3 4 5 x P(x) n=5, p=.3 n=8, p=.5 n=20, p=.5 n=4, p=.8 0.0 0.1 0.2 0.3 0.4 0.5 2 4 x P(x) 0.0 0.1 0.2 0.3 0.4 0.5 1 2 3 4 5 6 7 8 P(x) x

Measures of Centrality Mean Median Mode

Measures of Centrality Mean    xp x discrete ( ) ,      xf x dx continuous ( ) , Sample Mean å = n i x X 1

Measures of Centrality Exercise: Compute the sample mean for the student Gpa data 2.4, 2.7, 2.8, 2.9, 3.0, 3.0, 3.1, 3.3, 3.5, 3.9

Measures of Centrality Exercise: Compute the sample mean for the student Gpa data 2.4, 2.7, 2.8, 2.9, 3.0, 3.0, 3.1, 3.3, 3.5, 3.9 å = n i x X 1 10 9 . 3 5 1 2 8 7 4 + = = 3.06

Measures of Centrality Failure Data X 1 . 19 =

Measures of Centrality Median Compute the median for the student Gpa data 2.4, 2.7, 2.8, 2.9, 3.0, 3.0, 3.1, 3.3, 3.5, 3.9 . 3 2 = + X

Measures of Centrality Mode Class mark of most frequently occurring interval For Failure data, mode = class mark first interval ( . 5 = X

Measures of Centrality Measure Student Gpa Failure Data Mean 3.00 19.10 Median 3.04 14.40 Mode --- 5.00

Measures of Dispersion Range Sample Variance

Measures of Dispersion Range Compute the range for the student Gpa data 2.4, 2.7, 2.8, 2.9, 3.0, 3.0, 3.1, 3.3, 3.5, 3.9

Measures of Dispersion Range Compute the range for the student Gpa data 2.4, 2.7, 2.8, 2.9, 3.0, 3.0, 3.1, 3.3, 3.5, 3.9 Min = 2.4 Max = 3.9 Range = 3.9 - 2.4 = 1.5

Measures of Dispersion Variance   2    ( ) x p   2      ( ) x f dx Sample variance x 1 2 - = å n s i

Measures of Dispersion Exercise: Compute the sample variance for the student Gpa data 2.4, 2.7, 2.8, 2.9, 3.0, 3.0, 3.1, 3.3, 3.5, 3.9 x 1 2 - = å n s i

Measures of Dispersion Sample Variance x 1 2 - = å n s i ( ) 185 . 1 10 06 3 95 2 = -

Measures of Dispersion Exercise: Compute the variance for failure time data s2 = 302.76

An Aside For Failure Time data, we now have three measures for the data s2 = 302.76 X 1 . 19 =

An Aside For Failure Time data, we now have three measures for the data Exponential ?? s2 = 302.76 X 1 . 19 =

An Aside X 1 . 19 = Recall that for the exponential distribution m = 1/l s2 = 1/l2 If E[ X ] = m and E [s2 ] = s2, then 1/l = 19.1 or 1/l2 = 302.76 X 1 . 19 = s2 = 302.76 l 0524 . ˆ = l 0575 . ˆ =

Introduction to Probability & Statistics The Central Limit Theorem

The Sample Mean    x 1 2 3 4 5 6 p(x) 1/6 1/6 1/6 1/6 1/6 1/6   Suppose, for our die example, we wish to compute the mean from the throw of 2 dice: x 1 2 3 4 5 6 p(x) 1/6 1/6 1/6 1/6 1/6 1/6    xp x ( ) . 3 5 Estimate  by computing the average of two throws:   X   1 2

Joint Distributions x 1 2 3 4 5 6 p(x) 1/6 1/6 1/6 1/6 1/6 1/6 X1 X2 X

Joint Distributions x 1 2 3 4 5 6 p(x) 1/6 1/6 1/6 1/6 1/6 1/6 X1 X X2

Joint Distributions x 1 2 3 4 5 6 p(x) 1/6 1/6 1/6 1/6 1/6 1/6 X1 X X2

Joint Distributions x 1 2 3 4 5 6 p(x) 1/6 1/6 1/6 1/6 1/6 1/6 X1 X X2

Distribution of X x 1 1.5 2 2.5 3 3.5 4 4.5 5 5.5 6 p(x) 1/36 2/36 3/36 4/36 5/36 6/36 5/36 4/36 3/36 2/36 1/36 Distribution of X Distribution of X 0.00 0.05 0.10 0.15 0.20 1 2 3 4 5 6 0.00 0.05 0.10 0.15 0.20 1 2 3 4 5 6 7 8 9 10 11

Distribution of X n = 2 n = 10 n = 15 0.0 0.1 0.2 0.3 0.4 1.0 1.5 2.0 2.5 3.0 3.5 4.0 4.5 5.0 5.5 6.0 0.00 0.05 0.10 0.15 0.20 1 2 3 4 5 6 7 8 9 10 11 0.0 0.1 0.2 0.3 0.4 1.0 1.5 2.0 2.5 3.0 3.5 4.0 4.5 5.0 5.5 6.0 n = 15

Distribution of X X lim Normal  n = 2 n = 10 n = 15 n   0.0 0.1 0.2 0.3 0.4 1.0 1.5 2.0 2.5 3.0 3.5 4.0 4.5 5.0 5.5 6.0 0.00 0.05 0.10 0.15 0.20 1 2 3 4 5 6 7 8 9 10 11 0.0 0.1 0.2 0.3 0.4 1.0 1.5 2.0 2.5 3.0 3.5 4.0 4.5 5.0 5.5 6.0 n = 15 X lim n Normal   

Expected Value of X   E X n [ ] .           1 n E X [ ] . 2     1 2 n E X [ ] .

Expected Value of X     E X n [ ] .           1 n E X [ 2     1 2 n E X [ ] .     1 2 n  .

Expected Value of X       E X n [ ] .           1 n E 2     1 2 n E X [ ] .     1 2 n  .    1 n 

Variance of X  ( ) . x X n                  n X . 2 1 ( ) . x X n                  2 1 n X .

Variance of X    ( ) . x X n                  n X 2 1 ( ) . x X n                  2 1 n X .           1 2 n X  ( ) .

Variance of X    ( ) . x X n                  n X 2 1 ( ) . x X n                  2 1 n X .           1 2 n X  ( ) .        1 2 n ( )    2 n

Distribution of x Recall that x is a function of random variables, so it also is a random variable with its own distribution. By the central limit theorem, we know that where,

Example Suppose that breakeven analysis indicates we must have average daily revenues of $500. A random sample of 10 days yields an average of only $450 dollars. What is the probability we will not breakeven this year?

Example P not breakeven x { } = < m 500 450 = - > P x { } m 500 Suppose that breakeven analysis indicates we must have average daily revenues of $500. A random sample of 10 days yields an average of only $450 dollars. What is the probability we will not breakeven this year? P not breakeven x { } = < m 500 450 = - > P x { } m 500 450

Example P not breakeven { } = - > x m 500 450 Recall that Using the standard normal transformation

Example P not breakeven { }

Example   In order to solve this problem, we need to know the true but unknown standard deviation . Let us assume we have enough past data that a reasonable estimate is s = 25. P Z          50 25 10      P Z 1 58 . Pr{not breakeven} = = 0.943