Download presentation
Presentation is loading. Please wait.
1
Known Probability Distributions
Engineers frequently work with data that can be modeled as one of several known probability distributions. Being able to model the data allows us to: model real systems design predict results Key discrete probability distributions include: binomial / multinomial negative binomial hypergeometric Poisson EGR
2
Discrete Uniform Distribution
Simplest of all discrete distributions All possible values of the random variable have the same probability, i.e., f(x; k) = 1/ k, x = x1 , x2 , x3 , … , xk Expectations of the discrete uniform distribution draw the uniform distribution EGR
3
Binomial & Multinomial Distributions
Bernoulli Trials Inspect tires coming off the production line. Classify each as defective or not defective. Define “success” as defective. If historical data shows that 95% of all tires are defect-free, then P(“success”) = 0.05. Signals picked up at a communications site are either incoming speech signals or “noise.” Define “success” as the presence of speech. P(“success”) = P(“speech”) Administer a test drug to a group of patients with a specific condition. P(“success”) = ___________ Bernoulli Process n repeated trials the outcome may be classified as “success” or “failure” the probability of success (p) is constant from trial to trial repeated trials are independent. EGR
4
Binomial Distribution
Example: Historical data indicates that 10% of all bits transmitted through a digital transmission channel are received in error. Let X = the number of bits in error in the next 4 bits transmitted. Assume that the transmission trials are independent. What is the probability that Exactly 2 of the bits are in error? At most 2 of the 4 bits are in error? more than 2 of the 4 bits are in error? The number of successes, X, in n Bernoulli trials is called a binomial random variable. remember, YOU define what a “success” is … e.g., votes, defects, errors EGR
5
Binomial Distribution
The probability distribution is called the binomial distribution. b(x; n, p) = , x = 0, 1, 2, …, n where p = _________________ q = _________________ For our example, b(x; n, p) = _________________ p = probability of success (error in transmission) q = probability of failure = 1-p b(x; n, p) = (4 choose x)(0.1)x(0.9)4-x , x = 0,1,2,3,4 EGR
6
For Our Example … What is the probability that exactly 2 of the bits are in error? At most 2 of the 4 bits are in error? b(x; n, p) = (4 choose x)(0.1)x(0.9)4-x , x = 0,1,2,3,4 P(X = 2) = (4 choose 2) (0.1)2(0.9)2 = P(X < 2) = P(0) + P(1) + P(2) = (4 choose 0) (0.1)0(0.9)4 + (4 choose 1) (0.1)1(0.9)3 + (4 choose 2) (0.1)2(0.9)2 = EGR
7
Your turn … What is the probability that more than 2 of the 4 bits are in error? b(x; n, p) = (4 choose x)(0.1)x(0.9)4-x , x = 0,1,2,3,4 P(X > 2) = P(3) + P(4) = (4 choose 3) (0.1)3(0.9)1 + (4 choose 4) (0.1)4(0.9)0 = P(X < 2) = P(0) + P(1) + P(2) = (4 choose 0) (0.1)0(0.9)4 + (4 choose 1) (0.1)1(0.9)3 + (4 choose 2) (0.1)2(0.9)2 = EGR
8
Expectations of the Binomial Distribution
The mean and variance of the binomial distribution are given by μ = np σ2 = npq Suppose, in our example, we check the next 20 bits. What are the expected number of bits in error? What is the standard deviation? μ = ___________ σ2 = __________ , σ = __________ μ = np = 20(0.1) = 2 σ2 = npq = 20(0.1)(0.9)= 1.8, σ = 1.34 EGR
9
Another example A worn machine tool produces 1% defective parts. If we assume that parts produced are independent, what is the mean number of defective parts that would be expected if we inspect 25 parts? What is the expected variance of the 25 parts? μ = np = 25(0.01) = 0.25 σ2 = npq = 25(0.01)(0.99)= ** NOTE: ≠ 0.25 EGR
10
Helpful Hints … Sometimes it helps to draw a picture.
Suppose we inspect the next 5 parts … P(at least 3) P(2 ≤ X ≤ 4) P(less than 4) Appendix Table A.1 (pp ) lists Binomial Probability Sums, ∑rx=0b(x; n, p) EGR
11
Your turn … Use Table A.1 to determine
1. b(x; 15, 0.4) , P(X ≤ 8) = ______________ 2. b(x; 15, 0.4) , P(X < 8) = ______________ 3. b(x; 12, 0.2) , P(2 ≤ X ≤ 5) = ___________ 4. b(x; 4, 0.1) , P(X > 2) = ______________ 0.9050 0.7869 – = 1 – = EGR
12
Multinomial Experiments
What if there are more than 2 possible outcomes? (e.g., acceptable, scrap, rework) That is, suppose we have: n independent trials k outcomes that are mutually exclusive (e.g., ♠, ♣, ♥, ♦) exhaustive (i.e., ∑all k pi = 1) Then f(x1, x2, …, xk; p1, p2, …, pk, n) = EGR
13
Example Look at problem 5.22, pg. 152
f( __, __, __; ___, ___, ___, __) =_________________ = __________________________________ x1 = _______ p1 = x2 = p2 = n = _____ x3 = p3 = f( 5,2,1; 0.5, 0.25, 0.25, 8) = (8 choose 5,2,1)(0.5)5(0.25)2(0.25)1 = 8!/(5!2!1!)* )(0.5)5(0.25)2(0.25)1 = 21/256 or EGR
14
Hypergeometric Distribution
Example*: Automobiles arrive in a dealership in lots of 10. Five out of each 10 are inspected. For one lot, it is know that 2 out of 10 do not meet prescribed safety standards. What is probability that at least 1 out of the 5 tested from that lot will be found not meeting safety standards? *from Complete Business Statistics, 4th ed (McGraw-Hill) Note the difference between binomial (assumes sampling “with replacement”) and hypergeometric (sampling “without replacement”) Also, binomial assumes independence, while hypergeometric does not. EGR
15
This example follows a hypergeometric distribution:
A random sample of size n is selected without replacement from N items. k of the N items may be classified as “successes” and N-k are “failures.” The probability associated with getting x successes in the sample (given k successes in the lot.) Where, k = number of “successes” = 2 n = number in sample = 5 N = the lot size = 10 x = number found = 1 or 2 EGR
16
Hypergeometric Distribution
In our example, = _____________________________ P(X > 1) = = 0.778 EGR
17
Expectations of the Hypergeometric Distribution
The mean and variance of the hypergeometric distribution are given by What are the expected number of cars that fail inspection in our example? What is the standard deviation? μ = ___________ σ2 = __________ , σ = __________ μ = nk/N = 5*2/10 = 1 σ2 = (5/9)(5*2/10)(1-2/10) = 0.444 σ = 0.667 EGR
18
Your turn … A worn machine tool produced defective parts for a period of time before the problem was discovered. Normal sampling of each lot of 20 parts involves testing 6 parts and rejecting the lot if 2 or more are defective. If a lot from the worn tool contains 3 defective parts: What is the expected number of defective parts in a sample of six from the lot? What is the expected variance? What is the probability that the lot will be rejected? N = 20 n = 6 k = 3 μ = nk/N = 6*3/20 =18/20=0.9 σ2 = (14/19)(6*3/20)(1-3/20) = P(X>2) = 1 – [P(0)+P(1)] = =0.2018 = P(2)+P(3) = [3 choose 2]*[17 choose 4] / [20 choose 6] + [3 choose 3]*[17 choose 3] / [20 choose 6] = = EGR
19
Binomial Approximation
Note, if N >> n, then we can approximate this with the binomial distribution. For example: Automobiles arrive in a dealership in lots of out of each 100 are inspected. 2 /10 (p=0.2) are indeed below safety standards. What is probability that at least 1 out of 5 will be found not meeting safety standards? Recall: P(X ≥ 1) = 1 – P(X < 1) = 1 – P(X = 0) Hypergeometric distribution Binomial distribution h(0;100,5,20) = (20 choose 0)(80 choose 5)/(100 choose 5) = 1-P(0) = = From Table A1, n=5, p=0.2 b(0;5,0.2) = 1-P(0) = = NOTE: If N = 200, then the hypergeometric distribution yields P(X > 1) = 0.676 Comparing to example 5.14, we can see that the binomial approximation gets very close as N gets very large relative to n. (Compare to example 5.15, pg. 155) EGR
20
Negative Binomial Distribution
Example: Historical data indicates that 30% of all bits transmitted through a digital transmission channel are received in error. An engineer is running an experiment to try to classify these errors, and will start by gathering data on the first 10 errors encountered. What is the probability that the 10th error will occur on the 25th trial? EGR
21
This example follows a negative binomial distribution:
Repeated independent trials. Probability of success = p and probability of failure = q = 1-p. Random variable, X, is the number of the trial on which the kth success occurs. The probability associated with the kth success occurring on trial x is given by, Where, k = “success number” = 10 x = trial number on which k occurs = 25 p = probability of success (error) = 0.3 q = 1 – p = 0.7 EGR
22
Negative Binomial Distribution
In our example, = _____________________________ b*(15;10,0.1) = (24 choose 9)(.3)10(.7)15 = 0.037 EGR
23
Geometric Distribution
Example: In our example, what is the probability that the 1st bit received in error will occur on the 5th trial? This is an example of the geometric distribution, which is a special case of the negative binomial in which k = 1. The probability associated with the 1st success occurring on trial x is given by = __________________________________ (0.3)(0.7)4 = 0.072 EGR
24
Your turn … A worn machine tool produces 1% defective parts. If we assume that parts produced are independent: What is the probability that the 2nd defective part will be the 6th one produced? What is the probability that the 1st defective part will be seen before 3 are produced? How many parts can we expect to produce before we see the 1st defective part? (Hint: see Theorem 5.4, pg. 161) b*(6:2,0.01) = (5 choose 1)(.01)2(.99)4 = P(X<3) = P(1)+P(2) = (0.01)*(0.99)1-1 + (0.01)*(0.99)2-1 = μ = 1/p = 1/0.01 = 100 EGR
25
Poisson Process The number of occurrences in a given interval or region with the following properties: “memoryless” P(occurrence) during a very short interval or small region is proportional to the size of the interval and doesn’t depend on number occurring outside the region or interval. P(X>1) in a very short interval is negligible memoryless number in one interval is independent of the number in a different interval EGR
26
Poisson Process Examples: Number of bits transmitted per minute.
Number of calls to customer service in an hour. Number of bacteria in a given sample. Number of hurricanes per year in a given region. memoryless number in one interval is independent of the number in a different interval EGR
27
Poisson Process Example
An average of 2.7 service calls per minute are received at a particular maintenance center. The calls correspond to a Poisson process. To determine personnel and equipment needs to maintain a desired level of service, the plant manager needs to be able to determine the probabilities associated with numbers of service calls. What is the probability that fewer than 2 calls will be received in any given minute? EGR
28
Poisson Distribution The probability associated with the number of occurrences in a given period of time is given by, Where, λ = average number of outcomes per unit time or region = 2.7 t = time interval or region = 1 minute EGR
29
Our Example The probability that fewer than 2 calls will be received in any given minute is … P(X < 2) = P(X = 0) + P(X = 1) = __________________________ The mean and variance are both λt, so μ = _____________________ Note: Table A.2, pp , gives Σt p(x;μ) P(x=0) = e /0! + e /1! = μ = 2.7 EGR
30
Poisson Distribution If more than 6 calls are received in a 3-minute period, an extra service technician will be needed to maintain the desired level of service. What is the probability of that happening? μ = λt = _____________________ P(X > 6) = 1 – P(X < 6) = _____________________ μ = 2.7*3 = 8.1 ≈ 8 see page 668 , with μ = 8 and r = 6, P(X < 6) = P = = EGR
31
Poisson Distribution EGR
32
Poisson Distribution The effect of λ on the Poisson distribution
EGR
33
Continuous Probability Distributions
Many continuous probability distributions, including: Uniform Normal Gamma Exponential Chi-Squared Lognormal Weibull Uniform Normal – Gamma Exponential Chi-Squared Lognormal Weibull - EGR
34
Uniform Distribution Simplest – characterized by the interval endpoints, A and B. A ≤ x ≤ B = 0 elsewhere Mean and variance: and draw distribution EGR
35
Example A circuit board failure causes a shutdown of a computing system until a new board is delivered. The delivery time X is uniformly distributed between 1 and 5 days. What is the probability that it will take 2 or more days for the circuit board to be delivered? interval = [1,5] f(x) = 1/(B-A) = 1/(5-1) = ¼, 1 < x < 5 (0 elsewhere) First: show the distribution and demonstrate the “intuitive” answer Then – P(X>2) = ∫25 (1/4)dx = 0.75 EGR
36
Normal Distribution The “bell-shaped curve”
Also called the Gaussian distribution The most widely used distribution in statistical analysis forms the basis for most of the parametric tests we’ll perform later in this course. describes or approximates most phenomena in nature, industry, or research Random variables (X) following this distribution are called normal random variables. the parameters of the normal distribution are μ and σ (sometimes μ and σ2.) note: nonparametric tests are distribution-free, assume no underlying distribution (see ch 16) EGR
37
Normal Distribution The density function of the normal random variable X, with mean μ and variance σ2, is all x. (μ = 5, σ = 1.5) properties of the curve: peak is both the mean and the mode and occurs at x = μ curve is symmetrical about a vertical axis through the mean total area under the curve and above the horizontal axis = 1. points of inflection are at x = μ + σ EGR
38
Standard Normal RV … Note: the probability of X taking on any value between x1 and x2 is given by: To ease calculations, we define a normal random variable where Z is normally distributed with μ = 0 and σ2 = 1 EGR
39
Standard Normal Distribution
Table A.3: “Areas Under the Normal Curve” EGR
40
Examples P(Z ≤ 1) = P(Z ≥ -1) = P(-0.45 ≤ Z ≤ 0.36) =
draw the area on the picture … 1. P(Z < 1) = 2. P(Z ≥ -1) = 3. P(-0.45 ≤ Z ≤ 0.36) = P(Z < 0.36) – P(Z< -0.45) = – = 0.316 EGR
41
Your turn … Use Table A.3 to determine (draw the picture!)
1. P(Z ≤ 0.8) = 2. P(Z ≥ 1.96) = 3. P(-0.25 ≤ Z ≤ 0.15) = 4. P(Z ≤ -2.0 or Z ≥ 2.0) = 1. P(Z ≤ 0.8) = 2. P(Z ≥ 1.96) = 1 – = (=P(Z < -1.96)) Note symmetry!! 3. P(-0.25 ≤ Z ≤ 0.15) = – = 4. P(Z ≤ -2.0 or Z ≥ 2.0) = 2 * = EGR
42
The Normal Distribution “In Reverse”
Example: Given a normal distribution with μ = 40 and σ = 6, find the value of X for which 45% of the area under the normal curve is to the left of X. If P(Z < k) = 0.45, k = ___________ Z = _______ X = _________ Will X be greater than 40? k = Z = = (X – 40)/6 Z = (-0.125*6)+40 = 39.25 EGR
43
Normal Approximation to the Binomial
If n is large and p is not close to 0 or 1, or if n is smaller but p is close to 0.5, then the binomial distribution can be approximated by the normal distribution using the transformation: NOTE: add or subtract 0.5 from X to be sure the value of interest is included (draw a picture to know which) Look at example 6.15, pg. 191 EGR
44
Look at example 6.15, pg. 191 p = n = 100 μ = ____________ σ = ______________ if x = 30, then z = _____________________ and, P(X < 30) = P (Z < _________) = _________ μ= np = 100*0.4 = 40 σ = sqrt(npq) = sqrt(100*0.4*0.6) = 4.899 draw the picture! z = ((30-0.5) – 40)/4.899 = -2.14 P(Z < -2.14) = EGR
45
Your Turn Refer to the previous example, DRAW THE PICTURE!!
What is the probability that more than 50 survive? What is the probability that exactly 45 survive? x > 50, z = (( ) – 40)/4.899 = 2.14 P(Z > 2.14) = P(Z < -2.14) = (by symmetry) 2. x = 45, z1 = ( )/4.899 = 1.12 z2 = (44.5 – 40)/4.899 = P(X = 45) = P(z < 1.12) – P(z < 0.98) = – = NOTE: b(45;100,0.4) = (100 choose 45)* = EGR
46
Gamma & Exponential Distributions
Recall the Poisson Process Number of occurrences in a given interval or region “Memoryless” process Sometimes we’re interested in the time or area until a certain number of events occur. For example An average of 2.7 service calls per minute are received at a particular maintenance center. The calls correspond to a Poisson process. What is the probability that up to a minute will elapse before 2 calls arrive? How long before the next call? EGR
47
Gamma Distribution The density function of the random variable X with gamma distribution having parameters α (number of occurrences) and β (time or region). x > 0. μ = αβ σ2 = αβ2 Describes the time until a specified # of Poisson events occurs. EGR
48
Exponential Distribution
Special case of the gamma distribution with α = 1. x > 0. Describes the time until or time between Poisson events. μ = β σ2 = β2 EGR
49
Example An average of 2.7 service calls per minute are received at a particular maintenance center. The calls correspond to a Poisson process. What is the probability that up to a minute will elapse before 2 calls arrive? β = ________ α = ________ P(X ≤ 1) = _________________________________ β = 1/λ = 1/(2.7) = α = 2 P(X < 1) = 0∫1 (1/ β2) x e-x/ β dx = ∫1 x e -2.7x dx = [-2.7xe-2.7x – e-2.7x]01 = 1 – e-2.7 ( ) = EGR
50
Example (cont.) What is the expected time before the next call arrives? β = ________ α = ________ μ = _________________________________ Exponential distribution μ = β = 1/2.7 = min. EGR
51
Your turn … Look at problem 6.40, page 205. EGR 252 - 6 α = 2 β = 3
α = 2 β = 3 P(x>9) = = 1/9 9∫∞ x e-x/ 3 dx = [-x/3 * e-x/3 - e-x/3]9∞ = 4 e-3 = EGR
52
Chi-Squared Distribution
Special case of the gamma distribution with α = ν/2 and β = 2. x > 0. where ν is a positive integer. single parameter,ν is called the degrees of freedom. μ = ν σ2 = 2ν Note: this will become important when we start talking about statistical inference. Stay tuned! EGR 252 Ch. 6 EGR 52 52
53
Lognormal Distribution
When the random variable Y = ln(X) is normally distributed with mean μ and standard deviation σ, then X has a lognormal distribution with the density function, Uses – reliability and maintainability; environmental engineers – concentration of pollutants, particle size in emissions; long term rate of return on stock investments EGR 252 Ch. 6 EGR 53 53
54
Example Look at problem 6.72, pg. 207 …
Since ln(X) has normal distribution with μ = 5 and σ = 2, the probability that X > 50,000 is, P(X > 50,000) = __________________________ P(X > 50,000) = 1 – P(X < 50,000) = 1 – P (Z < (ln(50,000) – 5)/ 2) = 1 – P (Z < 2.91) = = EGR EGR 252 Ch. 6 54 54
55
Wiebull Distribution Used for many of the same applications as the gamma and exponential distributions, but does not require memoryless property of the exponential Uses – EGR 252 Ch. 6 EGR 55 55
56
Example Designers of wind turbines for power generation are interested in accurately describing variations in wind speed, which in a certain location can be described using the Weibull distribution with α = 0.02 and β = 2. A designer is interested in determining the probability that the wind speed in that location is between 3 and 7 mph. P(3 < X < 7) = ___________________________ P(3 < X < 7) = F(7) – F(3) = [1 – e – (0.02)7^2] – [1 – e – (0.02)3^2] = = 0.46 EGR EGR 252 Ch. 6 56 56
57
Populations and Samples
Population: “a group of individual persons, objects, or items from which samples are taken for statistical measurement” Sample: “a finite part of a statistical population whose properties are studied to gain information about the whole” (Merriam-Webster Online Dictionary, October 5, 2004) EGR
58
Examples Population Samples
Students pursuing undergraduate engineering degrees Cars capable of speeds in excess of 160 mph. Potato chips produced at the Frito-Lay plant in Kathleen Freshwater lakes and rivers Samples Samples: engineering students selected at random from all engineering programs in the US. 50 cars selected at random from among those certified as having achieved 160 mph or more during 2003. 10 chips selected at random every 5 minutes as the conveyor passes the inspector. 4 samples taken from randomly selected locations in randomly selected and representative freshwater lakes and rivers OTHERS? EGR
59
Basic Statistics (review)
1. Sample Mean: Example: At the end of a team project, team members were asked to give themselves and each other a grade on their contribution to the group. The results for two team members were as follows: = ___________________ XQ = 87.5 XS = 85 Q S 92 85 95 88 75 78 EGR
60
Basic Statistics (review)
1. Sample Variance: For our example: SQ2 = ___________________ SS2 = ___________________ Q S 92 85 95 88 75 78 S2Q = S2S = EGR
61
Your Turn Work in groups of 4 or 5. Find the mean, variance, and standard deviation for your group of the (approximate) number of hours spent working on homework each week. EGR
62
Sampling Distributions
If we conduct the same experiment several times with the same sample size, the probability distribution of the resulting statistic is called a sampling distribution Sampling distribution of the mean: if n observations are taken from a normal population with mean μ and variance σ2, then: EGR
63
Central Limit Theorem Given: Then,
X : the mean of a random sample of size n taken from a population with mean μ and finite variance σ2, Then, the limiting form of the distribution of is _________________________ The standard normal distribution n(z;0,1) EGR
64
Central Limit Theorem If the population is known to be normal, the sampling distribution of X will follow a normal distribution. Even when the distribution of the population is not normal, the sampling distribution of X is normal when n is large. NOTE: when n is not large, we cannot assume the distribution of X is normal. EGR
65
Example: The time to respond to a request for information from a customer help line is uniformly distributed between 0 and 2 minutes. In one month 48 requests are randomly sampled and the response time is recorded. What is the probability that the average response time is between 0.9 and 1.1 minutes? μ =______________ σ2 = ________________ μX =__________ σX2 = ________________ Z1 = _____________ Z2 = _______________ P(0.9 < X < 1.1) = _____________________________ f(x) = ½, 0<x<2 (uniform dist) μ = (b-a)/2 = 1 σ2 = (b-a)2/12 = 1/3 z = (x-μ) (σ/sqrt(n)) μx = 1 σx2 = (1/3)/48 = 1/144 Z1 = (.9-1)/(1/12) = -1.2 Z2 = (1.1-1)/(1/12) = 1.2 P(Z2) – P(Z1) = =0.7698 EGR
66
Sampling Distribution of the Difference Between two Averages
Given: Two samples of size n1 and n2 are taken from two populations with means μ1 and μ2 and variances σ12 and σ22 Then, See example 8.8, pg 213 and example 8.9, pg 214 EGR
67
Sampling Distribution of S2
Given: S2 is the variance of of a random sample of size n taken from a population with mean μ and finite variance σ2, Then, has a χ2 distribution with ν = n - 1 EGR
68
χ2 Distribution χα2 represents the χ2 value above which we find an area of α, that is, for which P(χ2 > χα2 ) = α. EGR
69
Example Look at example 8.10, pg. 256: μ = 3 σ = 1 n = 5
s2 = ________________ χ2 = __________________ If the χ2 value fits within an interval that covers 95% of the χ2 values with 4 degrees of freedom, then the estimate for σ is reasonable. (See Table A.5, pp ) from book (& excel) – s2 = 0.815 χ2 = (n-1)s2 / σ2 = (4)(0.815)/1 = 3.26 looking for X2 values that cover 95%, meaning α values between and 0.975 from table A.5 => Χ =11.143 Χ = 0.484 EGR
70
Your turn … If a sample of size 7 is taken from a normal population (i.e., n = 7), what value of χ2 corresponds to P(χ2 < χα2) = 0.95? (Hint: first determine α.) NOTE the figure associated with table A.5!! (These values cover areas > the X2 value …) ν= 7-1 = 6; α = 0.05 X2 = EGR
71
t- Distribution Recall, by CLT: is n(z; 0,1)
Assumption: _____________________ (Generally, if an engineer is concerned with a familiar process or system, this is reasonable, but …) assumption: we know σ EGR
72
What if we don’t know σ? New statistic: Where, and
follows a t-distribution with ν = n – 1 degrees of freedom. EGR
73
Characteristics of the t-Distribution
Look at fig. 8.13, pg. 259 Note: Shape: _________________________ Effect of ν: __________________________ See table A.4, pp shape – symmetrical about 0 effect of ν – variance, as seen in the width of the curve, depends on sample size note – as ν increases, curve looks more like a normal distribution (hence, the CLT) Table A.4 – critical values of t for several values of α and df. Note that the table yields the right tail of the distribution. EGR
74
Using the t-Distribution
Testing assumptions about the value of μ Example: problem 8.52, pg. 265 What value of t corresponds to P(t < tα) = 0.95? x-bar = 0.475 s2 = sum(x – x-bar)2/(n-1) = s = μ= 0.5 t = ( )/(0.1832/sqrt(8)) = -0.39 Looking in table a.4, pg. 672 for an α value associated with (n-1) = 7 degrees of freedom and a t-value of 0.39 (by symmetry), we see it is somewhere between 0.4 and 0.3 call it 0.35. P(xbar < 0.5) = P(T<-.39) ≈ 0.35 … inconclusive EGR
75
Comparing Variances of 2 Samples
Given two samples of size n1 and n2, with sample means X1 and X2, and variances, s12 and s22 … Are the differences we see in the means due to the means or due to the variances (that is, are the differences due to real differences between the samples or variability within each samples)? See figure 8.16, pg. 262 EGR
76
F-Distribution Given: Then,
S12 and S22, the variances of independent random samples of size n1 and n2 taken from normal populations with variances σ12 and σ22, respectively, Then, has an F-distribution with ν1 = n1 - 1 and ν2 = n2 – 1 degrees of freedom. (See table A.6, pp ) EGR
77
Example Problem 8.55, pg. 266 S12 = ___________________
F = _____________ f0.05 (4, 5) = _________ NOTE: Note: if the two population variances are equal, then F=S12 / S22 so we are testing the hypothesis that F=S12 / S22 = 1 S12 =(5*SUMSQ(A1:A5)-SUM(A1:A5)^2)/(5*4) = 15750 S22 =(6*SUMSQ(B1:B6)-SUM(B1:B6)^2)/(5*6) = 10920 F = 15750/10920 =1.44 f0.05 (4, 5) = 5.19 EGR
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.