Learning Objectives for Chapter 8 After careful study of this chapter, you should be able to do the following: 1.Construct confidence intervals on the.

Slides:



Advertisements
Similar presentations
Estimation of Means and Proportions
Advertisements

Chapter 6 Confidence Intervals.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Confidence Intervals Chapter 8.
CmpE 104 SOFTWARE STATISTICAL TOOLS & METHODS MEASURING & ESTIMATING SOFTWARE SIZE AND RESOURCE & SCHEDULE ESTIMATING.
© 2011 Pearson Education, Inc
Confidence Intervals This chapter presents the beginning of inferential statistics. We introduce methods for estimating values of these important population.
Chap 8-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 8 Estimation: Single Population Statistics for Business and Economics.
Chapter 11- Confidence Intervals for Univariate Data Math 22 Introductory Statistics.
Chapter 5: Confidence Intervals.
BCOR 1020 Business Statistics Lecture 17 – March 18, 2008.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 7-1 Chapter 7 Confidence Interval Estimation Statistics for Managers.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Confidence Interval Estimation Basic Business Statistics 10 th Edition.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 7-1 Introduction to Statistics: Chapter 8 Estimation.
HS 1674B: Probability Part B1 4B: Probability part B Normal Distributions.
Evaluating Hypotheses
8 Statistical Intervals for a Single Sample CHAPTER OUTLINE
Chapter 8 Estimation: Single Population
Estimates and sample sizes Chapter 6 Prof. Felix Apfaltrer Office:N763 Phone: Office hours: Tue, Thu 10am-11:30.
CHAPTER 6 Statistical Analysis of Experimental Data
Part III: Inference Topic 6 Sampling and Sampling Distributions
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 6-1 Chapter 6 The Normal Distribution and Other Continuous Distributions.
Chapter 7 Estimation: Single Population
Inferences About Process Quality
Copyright © Cengage Learning. All rights reserved. 7 Statistical Intervals Based on a Single Sample.
8-1 Introduction In the previous chapter we illustrated how a parameter can be estimated from sample data. However, it is important to understand how.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 7 Statistical Intervals Based on a Single Sample.
Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Confidence Interval Estimation Business Statistics, A First Course.
Elec471 Embedded Computer Systems Chapter 4, Probability and Statistics By Prof. Tim Johnson, PE Wentworth Institute of Technology Boston, MA Theory and.
Statistical Inference for Two Samples
Chapter 7: Normal Probability Distributions
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 7-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter.
Confidence Interval Estimation
Statistical Intervals for a Single Sample
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Confidence Intervals Chapter 7.
Confidence Intervals Chapter 6. § 6.1 Confidence Intervals for the Mean (Large Samples)
Chapter 6 Confidence Intervals.
Chapter 7 Confidence Intervals (置信区间)
Slide 1 Copyright © 2004 Pearson Education, Inc..
Chapter 6: Probability Distributions
Estimation of Statistical Parameters
Population All members of a set which have a given characteristic. Population Data Data associated with a certain population. Population Parameter A measure.
16-1 Copyright  2010 McGraw-Hill Australia Pty Ltd PowerPoint slides to accompany Croucher, Introductory Mathematics and Statistics, 5e Chapter 16 The.
Chapter 7 Estimates and Sample Sizes
PROBABILITY (6MTCOAE205) Chapter 6 Estimation. Confidence Intervals Contents of this chapter: Confidence Intervals for the Population Mean, μ when Population.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 6-1 Chapter 6 The Normal Distribution and Other Continuous Distributions.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Statistics for Managers Using Microsoft Excel, 5e © 2008 Pearson Prentice-Hall, Inc.Chap 8-1 Statistics for Managers Using Microsoft® Excel 5th Edition.
Chap 7-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 7 Estimating Population Values.
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides 11 th Edition Chapter 7.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 8-1 Confidence Interval Estimation.
Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc. Chapter The Normal Probability Distribution 7.
Confidence Interval Estimation For statistical inference in decision making:
Chap 7-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 7 Estimating Population Values.
Statistics Sampling Intervals for a Single Sample Contents, figures, and exercises come from the textbook: Applied Statistics and Probability for Engineers,
Copyright ©2013 Pearson Education, Inc. publishing as Prentice Hall
1 Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Example: In a recent poll, 70% of 1501 randomly selected adults said they believed.
Chapters 6 & 7 Overview Created by Erin Hodgess, Houston, Texas.
Chapter 7 Estimates and Sample Sizes 7-1 Overview 7-2 Estimating a Population Proportion 7-3 Estimating a Population Mean: σ Known 7-4 Estimating a Population.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Confidence Interval Estimation Business Statistics: A First Course 5 th Edition.
6-1 Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Confidence Intervals. Point Estimate u A specific numerical value estimate of a parameter. u The best point estimate for the population mean is the sample.
Statistics for Business and Economics 8 th Edition Chapter 7 Estimation: Single Population Copyright © 2013 Pearson Education, Inc. Publishing as Prentice.
Probability & Statistics Review I 1. Normal Distribution 2. Sampling Distribution 3. Inference - Confidence Interval.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Chapter 6 The Normal Distribution and Other Continuous Distributions
Applied Statistics and Probability for Engineers
Statistical Intervals for a Single Sample
Part A: Concepts & binomial distributions Part B: Normal distributions
CONCEPTS OF ESTIMATION
Chapter 8 Confidence Intervals.
Presentation transcript:

Learning Objectives for Chapter 8 After careful study of this chapter, you should be able to do the following: 1.Construct confidence intervals on the mean of a normal distribution, using normal distribution or t distribution method. 2.Construct confidence intervals on variance and standard deviation of normal distribution. 3.Construct confidence intervals on a population proportion. 4.Constructing an approximate confidence interval on a parameter. 5.Prediction intervals for a future observation. 6.Tolerance interval for a normal population. 2Chapter 8 Learning Objectives

Statistical Intervals For A Single Sample Confidence Interval Confidence Interval. We have learned in the previous chapter how a parameter can be estimated from sample data It is important to understand how good is the estimate obtained. Confidence Interval An interval estimate for population parameter is called a Confidence Interval. Tolerance interval is another important type of interval estimate. Introduction The confidence interval is a random interval The appropriate interpretation of a confidence interval (for example on  ) is: The observed interval [l, u] brackets the true value of , with confidence 100(1-  ).

Confidence interval A range of values, derived from sample statistics, that is likely to contain the value of an unknown population parameter. Because of their random nature, it is unlikely that two samples from a given population will yield identical confidence intervals. But if you repeated your sample many times, a certain percentage of the resulting confidence intervals would contain the unknown population parameter. The percentage of these confidence intervals that contain the parameter is the confidence level of the interval. For example, suppose you want to know the average amount of time it takes for an automobile assembly line to complete a vehicle. You take a sample of completed cars, record the time they spent on the assembly line, and use the 1-sample t procedure to obtain a 95% confidence interval for the mean amount of time all cars spend on the assembly line. Because 95% of the confidence intervals constructed from all possible samples will contain the population parameter, you conclude that the mean amount of time all cars spend on the assembly line falls between your interval's endpoints, which are called confidence limits.

5 A confidence interval estimate for  is an interval of the form l ≤  ≤ u, where the end-points l and u are computed from the sample data. There is a probability of 1  α of selecting a sample for which the CI will contain the true value of . The endpoints or bounds l and u are called lower- and upper- confidence limits,and 1  α is called the confidence coefficient. Sec 8-1 Confidence Interval on the Mean of a Normal, σ 2 Known Confidence Interval and its Properties

Confidence interval Creating confidence intervals is analogous to throwing nets over a target with an unknown, yet fixed, location. Consider the graphic below, which depicts confidence intervals generated from 20 samples from the same population. The black line represents the fixed value of the unknown population parameter; the blue confidence intervals contain the value of the population parameter; the red confidence interval does not. A 95% confidence interval indicates that 19 out of 20 samples (95%) from the same population will produce confidence intervals that contain the population parameter.

Confidence Interval on the Mean of a Normal Distribution, Variance Known Figure 8-1 Repeated construction of a confidence interval for .

Guidelines for Constructing Confidence Intervals

Prediction Interval: Provides prediction on future observations.

Hw 2 groups Probability Tables Description, Objective and uses …5 ++ slides

8-2.1 Development of the Confidence Interval and its Basic Properties The endpoints or bounds l and u are called lower- and upper-confidence limits, respectively. Since Z follows a standard normal distribution, we can write: Confidence Interval on the Mean of a Normal Distribution, Variance Known

One sided confidence Interval Confidence Interval on the Mean of a Normal Distribution, Variance Known

Example 8-1 Confidence Interval on the Mean of a Normal Distribution, Variance Known

Confidence interval A range of values, derived from sample statistics, that is likely to contain the value of an unknown population parameter. Because of their random nature, it is unlikely that two samples from a given population will yield identical confidence intervals. But if you repeated your sample many times, a certain percentage of the resulting confidence intervals would contain the unknown population parameter. The percentage of these confidence intervals that contain the parameter is the confidence level of the interval. For example, suppose you want to know the average amount of time it takes for an automobile assembly line to complete a vehicle. You take a sample of completed cars, record the time they spent on the assembly line, and use the 1-sample t procedure to obtain a 95% confidence interval for the mean amount of time all cars spend on the assembly line. Because 95% of the confidence intervals constructed from all possible samples will contain the population parameter, you conclude that the mean amount of time all cars spend on the assembly line falls between your interval's endpoints, which are called confidence limits.

Confidence Interval on the Mean of a Normal Distribution, Variance Known Figure 8-1 Repeated construction of a confidence interval for .

Confidence Level and Precision of Error The length of a confidence interval is a measure of the precision of estimation. Confidence Interval on the Mean of a Normal Distribution, Variance Know n Figure 8-2 Error in estimating  with.

8-2.2 Choice of Sample Size Sample Confidence Interval on the Mean of a Normal Distribution, Variance Known

Example 8-2 Confidence Interval on the Mean of a Normal Distribution, Variance Known EXAMPLE 8-2 Metallic Material Transition

8-1.3 One-Sided Confidence Bounds 26 A 100(1 − α)% upper-confidence bound for  is (8-3) and a 100(1 − α)% lower-confidence bound for  is (8-4) Sec 8-1 Confidence Interval on the Mean of a Normal, σ 2 Known

Example 8-3 One-Sided Confidence Bound 27 The same data for impact testing from Example 8-1 are used to construct a lower, one-sided 95% confidence interval for the mean impact energy. Recall that z α = 1.64, n = 10,  = l, and. A 100(1 − α)% lower-confidence bound for  is Sec 8-1 Confidence Interval on the Mean of a Normal, σ 2 Known

8-2.5 A Large-Sample Confidence Interval for  Confidence Interval on the Mean of a Normal Distribution, Variance Known See example 8-3

8-2.5 A Large-Sample Confidence Interval for  Definition Confidence Interval on the Mean of a Normal Distribution, Variance Known

Example 8-3 Confidence Interval on the Mean of a Normal Distribution, Variance Known, Example 8-5 Mercury Contamination

31 The summary statistics for the data are as follows: VariableNMeanMedianStDevMinimumMaximumQ1Q1Q3Q3 Concentration Sec 8-1 Confidence Interval on the Mean of a Normal, σ 2 Known Example 8-5 Mercury Contamination (continued) Because n > 40, the assumption of normality is not necessary to use in Equation 8-5. The required values are, and z = The approximate 95  CI on  is Interpretation: This interval is fairly wide because there is variability in the mercury concentration measurements. A larger sample size would have produced a shorter interval.

Text and Material Exam 1 Probability Tables Home Work Policy Attendance policy

t-distribution In probability and statistics, Student's t-distribution (or simply the t-distribution) is any member of a family of continuousprobability distributions that arises when estimating the mean of a normally distributed population in situations where thesample size is small and population standard deviation is unknown.probabilitystatisticsprobability distributionsmeannormally distributedpopulationsample sizestandard deviation he t-distribution plays a role in a number of widely used statistical analyses, including Student's t-test for assessing thestatistical significance of the difference between two sample means, the construction of confidence intervals for the difference between two population means, and in linear regression analysis. The Student's t-distribution also arises in theBayesian analysis of data from a normal family.Student's t-teststatistical significancemeansconfidence intervalsregression analysisBayesian analysis The t-distribution is symmetric and bell-shaped, like the normal distribution, but has heavier tails, meaning that it is more prone to producing values that fall far from its mean. This makes it useful for understanding the statistical behavior of certain types of ratios of random quantities, in which variation in the denominator is amplified and may produce outlying values when the denominator of the ratio falls close to zero. The Student's t-distribution is a special case of the generalised hyperbolic distributionnormal distributiongeneralised hyperbolic distribution Probability density function Cumulative distribution function

Confidence Interval on the mean of a normal distribution, Variance Unknown

8-2.2 The Confidence Interval on Mean, Variance Unknown Table III 36 If and s are the mean and standard deviation of a random sample from a normal distribution with unknown variance  2, a 100(1   )  confidence interval on  is given by (8-7) where t  2,n  1 the upper 100  2 percentage point of the t distribution with n  1 degrees of freedom. One-sided confidence bounds on the mean are found by replacing t  /2,n-1 in Equation 8-7 with t ,n-1. Sec 8-2 Confidence Interval on the Mean of a Normal, σ 2 Unknown

Example

Example 8-6 Alloy Adhesion 39 Construct a 95% CI on  to the following data. The sample mean is and sample standard deviation is s = Since n = 22, we have n  1 =21 degrees of freedom for t, so t 0.025,21 = The resulting CI is Interpretation: The CI is fairly wide because there is a lot of variability in the measurements. A larger sample size would have led to a shorter interval Sec 8-2 Confidence Interval on the Mean of a Normal, σ 2 Unknown

Sec 8-7 to write40

Sec 8-7 to write41

Exam : First week on March No cell phones Signature Cheat Sheet First three Chapters + Sat 1 (Introduction)

(Chi-square or χ²-distribution) The Chi Square test is the most important and most used method in statistical tests. The purpose of Chi Square test A-Test how well a sample fits a theoretical distribution B- The difference between an observed frequency and expected frequency. (differences between the two or more observed data). C- Its value can be calculated by using the given observed frequency and expected frequency. D- Test the independence between categorical variables. For example, a manufacturer wants to know if the occurrence of four types of defects (missing pin, broken clamp, loose fastener, and leaky seal) is related to shift (day, evening, overnight). The chi-squared distribution (also It is a special case of the gamma distribution and is one of the most widely used probability distributions in inferential statistics,gamma distributionprobability distributionsinferential statistics For example, you can use a goodness-of-fit test of an observed distribution to a theoretical one and classification of qualitative data, and in confidence interval estimation for a population standard deviation of a normal distribution from a sample standard deviation. also to determine whether your sample data fit a Poisson distribution.qualitative dataconfidence intervalstandard deviation The shape of the chi-square distribution depends on the number of degrees of freedom. The distribution is positively skewed, but skewness decreases with more degrees of freedom. When the degrees of freedom are 30 or more, the distribution can be approximated by a normal distribution.

Goodness of Fit Test (observed and expected test) Failure times,Bulbs

Poisson Distribution (no of defects in finite space A/C Mech sys..lamda represent mean and variance binomial and standard no upper bound of calls

Chi Square Formula The Chi Square is denoted by X2and the formula is given as: Here, O = Observed frequency E = Expected frequency ∑ = Summation X2 = Chi Square value

Confidence Interval on the (Variance and Standard Deviation of a Normal Distribution) Figure 8-8 Probability density functions of several  2 distributions.

54 Let X 1, X 2, , X n be a random sample from a normal distribution with mean  and variance  2, and let S 2 be the sample variance. Then the random variable (8-8) has a chi-square (  2 ) distribution with n  1 degrees of freedom. Sec 8-3 Confidence Interval on σ 2 & σ of a Normal Distribution

Confidence Interval on the Variance and Standard Deviation 55 If s 2 is the sample variance from a random sample of n observations from a normal distribution with unknown variance  2, then a 100(1 –  )% confidence interval on  2 is (8-9) where and are the upper and lower 100  /2 percentage points of the chi-square distribution with n – 1 degrees of freedom, respectively. A confidence interval for  has lower and upper limits that are the square roots of the corresponding limits in Equation 8–9. Sec 8-3 Confidence Interval on σ 2 & σ of a Normal Distribution

Sec 8-7 to write56

Confidence Interval on the Variance and Standard Deviation of a Normal Distribution One-Sided Confidence Bounds

Example 8-7 Detergent Filling 58 An automatic filling machine is used to fill bottles with liquid detergent. A random sample of 20 bottles results in a sample variance of fill volume of s 2 = Assume that the fill volume is approximately normal. Compute a 95% upper confidence bound. A 95% upper confidence bound is found from Equation 8-10 as follows: A confidence interval on the standard deviation  can be obtained by taking the square root on both sides, resulting in Sec 8-3 Confidence Interval on σ 2 & σ of a Normal Distribution

Sec 8-7 to write59

8-6.1 Prediction Interval for Future Observation 8-6 Tolerance and Prediction Intervals The prediction interval for X n+1 will always be longer than the confidence interval for . 60 A 100 (1   )% prediction interval (PI) on a single future observation from a normal distribution is given by (8-15) Sec 8-6 Tolerance & Prediction Intervals

Repeated for comparison Example 8-11 Alloy Adhesion 61 The load at failure for n = 22 specimens was observed, and found that and s  The 95% confidence interval on  was    Plan to test a 23rd specimen. A 95% prediction interval on the load at failure for this specimen is Interpretation: The prediction interval is considerably longer than the CI. This is because the CI is an estimate of a parameter, but the PI is an interval estimate of a single future observation. See next slide Sec 8-6 Tolerance & Prediction Intervals

Example 8-6 Alloy Adhesion 62 Construct a 95% CI on  to the following data. The sample mean is and sample standard deviation is s = Since n = 22, we have n  1 =21 degrees of freedom for t, so t 0.025,21 = The resulting CI is Interpretation: The CI is fairly wide because there is a lot of variability in the measurements. A larger sample size would have led to a shorter interval Sec 8-2 Confidence Interval on the Mean of a Normal, σ 2 Unknown

63 A tolerance interval for capturing at least  % of the values in a normal distribution with confidence level 100(1 –  )% is where k is a tolerance interval factor found in Appendix Table XII. Values are given for  = 90%, 95%, and 99% and for 90%, 95%, and 99% confidence. Sec 8-6 Tolerance & Prediction Intervals Tolerance Interval for a Normal Distribution

Example 8-12 Alloy Adhesion 65 The load at failure for n = 22 specimens was observed, and found that and s = Find a tolerance interval for the load at failure that includes 90% of the values in the population with 95% confidence. From Appendix Table XII, the tolerance factor k for n = 22,  = 0.90, and 95% confidence is k = The desired tolerance interval is Interpretation: We can be 95% confident that at least 90% of the values of load at failure for this particular alloy lie between 5.67 and Sec 8-6 Tolerance & Prediction Intervals

Important Terms & Concepts of Chapter 8 Chi-squared distribution Confidence coefficient Confidence interval Confidence interval for a: – Population proportion – Mean of a normal distribution – Variance of a normal distribution Confidence level Error in estimation Large sample confidence interval 1-sided confidence bounds Precision of parameter estimation Prediction interval Tolerance interval 2-sided confidence interval t distribution Chapter 8 Summary67

Guidelines for Constructing Confidence Intervals

8-5 Guidelines for Constructing Confidence Intervals 69 Table 8-1 provides a simple road map for appropriate calculation of a confidence interval. Sec 8-5 Guidelines for Constructing Confidence Intervals

Normal Approximation for Binomial Proportion 8-4 A Large-Sample Confidence Interval For a Population Proportion The quantity is called the standard error of the point estimator. 70 If n is large, the distribution of is approximately standard normal. Sec 8-4 Large-Sample Confidence Interval for a Population Proportion

Approximate Confidence Interval on a Binomial Proportion 71 If is the proportion of observations in a random sample of size n, an approximate 100(1   )% confidence interval on the proportion p of the population is (8-11) where z  /2 is the upper  /2 percentage point of the standard normal distribution. Sec 8-4 Large-Sample Confidence Interval for a Population Proportion

Example 8-8 Crankshaft Bearings 72 In a random sample of 85 automobile engine crankshaft bearings, 10 have a surface finish that is rougher than the specifications allow. Construct a 95% two-sided confidence interval for p. A point estimate of the proportion of bearings in the population that exceeds the roughness specification is. A 95% two-sided confidence interval for p is computed from Equation 8-11 as Interpretation: This is a wide CI. Although the sample size does not appear to be small (n = 85), the value of is fairly small, which leads to a large standard error for contributing to the wide CI. Sec 8-4 Large-Sample Confidence Interval for a Population Proportion

Sample size for a specified error on a binomial proportion : If we set and solve for n, the appropriate sample size is The sample size from Equation 8-12 will always be a maximum for p = 0.5 [that is, p(1 − p) ≤ 0.25 with equality for p = 0.5], and can be used to obtain an upper bound on n. 73 Sec 8-4 Large-Sample Confidence Interval for a Population Proportion Choice of Sample Size (8-12) (8-13)

Example 8-9 Crankshaft Bearings 74 Consider the situation in Example 8-8. How large a sample is required if we want to be 95% confident that the error in using to estimate p is less than 0.05? Using as an initial estimate of p, we find from Equation 8-12 that the required sample size is If we wanted to be at least 95% confident that our estimate of the true proportion p was within 0.05 regardless of the value of p, we would use Equation 8-13 to find the sample size Interpretation: If we have information concerning the value of p, either from a preliminary sample or from past experience, we could use a smaller sample while maintaining both the desired precision of estimation and the level of confidence. Sec 8-4 Large-Sample Confidence Interval for a Population Proportion

Approximate One-Sided Confidence Bounds on a Binomial Proportion 75 The approximate 100(1   )% lower and upper confidence bounds are (8-14) respectively. Sec 8-4 Large-Sample Confidence Interval for a Population Proportion

Example 8-10 The Agresti-Coull CI on a Proportion 76 Reconsider the crankshaft bearing data introduced in Example 8-8. In that example we reported that. The 95% CI was. Construct the new Agresti-Coull CI. Interpretation: The two CIs would agree more closely if the sample size were larger. Sec 8-4 Large-Sample Confidence Interval for a Population Proportion

7: Normal Probability Distributions77 Probabilities Normal Distributions Normal Distributions Determining Normal Probabilities Finding Values That Correspond to Normal Probabilities Assessing Departures from Normality

7: Normal Probability Distributions78 §7.1: Normal Distributions This pdf is the most popular distribution for continuous random variables First described de Moivre in 1733 Elaborated in 1812 by Laplace Describes some natural phenomena More importantly, describes sampling characteristics of totals and means

7: Normal Probability Distributions79 Normal Probability Density Function Recall: continuous random variables are described with probability density function (pdfs) curves Normal pdfs are recognized by their typical bell-shape Figure: Age distribution of a pediatric population with overlying Normal pdf

7: Normal Probability Distributions80 Area Under the Curve pdfs should be viewed almost like a histogram Top Figure: The darker bars of the histogram correspond to ages ≤ 9 (~40% of distribution) Bottom Figure: shaded area under the curve (AUC) corresponds to ages ≤ 9 (~40% of area)

7: Normal Probability Distributions81 Parameters μ and σ Normal pdfs have two parameters μ - expected value (mean “mu”) σ - standard deviation (sigma) σ controls spreadμ controls location

7: Normal Probability Distributions82 Mean and Standard Deviation of Normal Density μ σ

7: Normal Probability Distributions83 Standard Deviation σ Points of inflections one σ below and above μ Practice sketching Normal curves Feel inflection points (where slopes change) Label horizontal axis with σ landmarks

7: Normal Probability Distributions84 Two types of means and standard deviations The mean and standard deviation from the pdf (denoted μ and σ) are parameters The mean and standard deviation from a sample (“xbar” and s) are statistics Statistics and parameters are related, but are not the same thing!

7: Normal Probability Distributions Rule for Normal Distributions 68% of the AUC within ±1σ of μ 95% of the AUC within ±2σ of μ 99.7% of the AUC within ±3σ of μ

7: Normal Probability Distributions86 Example: Rule Wechsler adult intelligence scores: Normally distributed with μ = 100 and σ = 15; X ~ N(100, 15) 68% of scores within μ ± σ = 100 ± 15 = 85 to % of scores within μ ± 2σ = 100 ± (2)(15) = 70 to % of scores in μ ± 3σ = 100 ± (3)(15) = 55 to 145

7: Normal Probability Distributions87 Symmetry in the Tails … we can easily determine the AUC in tails 95% Because the Normal curve is symmetrical and the total AUC is exactly 1…

7: Normal Probability Distributions88 Example: Male Height Male height: Normal with μ = 70.0˝ and σ = 2.8˝ 68% within μ ± σ = 70.0  2.8 = 67.2 to % in tails (below 67.2˝ and above 72.8˝) 16% below 67.2˝ and 16% above 72.8˝ (symmetry)

7: Normal Probability Distributions89 Reexpression of Non-Normal Random Variables Many variables are not Normal but can be reexpressed with a mathematical transformation to be Normal Example of mathematical transforms used for this purpose: – logarithmic – exponential – square roots Review logarithmic transformations…

7: Normal Probability Distributions90 §7.2: Determining Normal Probabilities When value do not fall directly on σ landmarks: 1. State the problem 2. Standardize the value(s) (z score) 3. Sketch, label, and shade the curve 4. Use Table B

7: Normal Probability Distributions91 §7.2: Determining Normal Probabilities When value do not fall directly on σ landmarks: 1. State the problem 2. Standardize the value(s) (z score) 3. Sketch, label, and shade the curve 4. Use Table B

7: Normal Probability Distributions92 Step 1: State the Problem What percentage of gestations are less than 40 weeks? Let X ≡ gestational length We know from prior research: X ~ N(39, 2) weeks Pr(X ≤ 40) = ?

7: Normal Probability Distributions93 Step 2: Standardize Standard Normal variable ≡ “Z” ≡ a Normal random variable with μ = 0 and σ = 1, Z ~ N(0,1) Use Table B to look up cumulative probabilities for Z

7: Normal Probability Distributions94 Example: A Z variable of 1.96 has cumulative probability

7: Normal Probability Distributions95 Step 2 (cont.) z-score = no. of σ-units above (positive z) or below (negative z) distribution mean μ Turn value into z score:

7: Normal Probability Distributions96 3. Sketch 4. Use Table B to lookup Pr(Z ≤ 0.5) = Steps 3 & 4: Sketch & Table B

7: Normal Probability Distributions97 a represents a lower boundary b represents an upper boundary Pr(a ≤ Z ≤ b) = Pr(Z ≤ b) − Pr(Z ≤ a) Probabilities Between Points

7: Normal Probability Distributions98 Pr(-2 ≤ Z ≤ 0.5) = Pr(Z ≤ 0.5) − Pr(Z ≤ -2).6687=.6915 −.0228 Between Two Points See p. 144 in text

7: Normal Probability Distributions99 §7.3 Values Corresponding to Normal Probabilities 1.State the problem 2.Find Z-score corresponding to percentile (Table B) 3.Sketch 4. Unstandardize:

7: Normal Probability Distributions100 z percentiles  z p ≡ the Normal z variable with cumulative probability p  Use Table B to look up the value of z p  Look inside the table for the closest cumulative probability entry  Trace the z score to row and column

7: Normal Probability Distributions101 Notation: Let z p represents the z score with cumulative probability p, e.g., z.975 = 1.96 e.g., What is the 97.5 th percentile on the Standard Normal curve? z.975 = 1.96

7: Normal Probability Distributions102 Step 1: State Problem Question: What gestational length is smaller than 97.5% of gestations? Let X represent gestations length We know from prior research that X ~ N(39, 2) A value that is smaller than.975 of gestations has a cumulative probability of.025

7: Normal Probability Distributions103 Step 2 (z percentile) Less than 97.5% (right tail) = greater than 2.5% (left tail) z lookup: z.025 = −1.96 z –

7: Normal Probability Distributions104 The 2.5 th percentile is 35 weeks Unstandardize and sketch

Example: A Z variable of 1.96 has cumulative probability of …… z lookup: z.025 = What is the 97.5 th percentile on the Standard Normal curve? z.975 =

7: Normal Probability Distributions106 Between Two Points Q 2:A Z variable of 1.96 has cumulative probability of …… Q3: Find P(Z less than 1.96) Q1: z lookup: z.025 = q2 What is the 97.5 th percentile on the Standard Normal curve? z.975 = Find P(Z greater than 1.96)