Presentation is loading. Please wait.

Presentation is loading. Please wait.

The Role of Probability Chapter 5. Objectives Understand probability as it pertains to statistical inference Understand the concepts “at random” and “equally.

Similar presentations


Presentation on theme: "The Role of Probability Chapter 5. Objectives Understand probability as it pertains to statistical inference Understand the concepts “at random” and “equally."— Presentation transcript:

1 The Role of Probability Chapter 5

2 Objectives Understand probability as it pertains to statistical inference Understand the concepts “at random” and “equally likely” Understand the attributes and applications of the binomial distribution Understand the attributes and applications of the normal distribution

3 Objectives Understand and apply the results of the Central Limit Theorem

4 Sampling: Population Size=N, Sample Size=n Simple random sample –Enumerate all members of population N (sampling frame), select n individuals at random (each has same probability of being selected) Systematic sample –Start with sampling frame; determine sampling interval (N/n); select first person at random from first (N/n) and every (N/n) thereafter

5 Sampling: Population Size=N, Sample Size=n Stratified sample –Organize population into mutually exclusive strata; select individuals at random within each stratum Convenience sample –Non-probability sample (not for inference) Quota sample –Select a pre-determined number of individuals into sample from groups of interest

6 Basics Probability reflects the likelihood that outcome will occur 0 < Probability < 1

7 Example 5.1. Basic Probability Age5678910Total Boys4323795014104204182560 Girls4085134124364615002730 Total8408929138468819185290 P(Select any child) = 1/5290 = 0.0002

8 Example 5.1. Basic Probability P(Select a boy) = 2560/5290 = 0.484 P(Select boy age 10) = 418/5290 = 0.079 P(Select child at least 8 years of age) = (846+881+918)/5290 = 2645/5290 = 0.500

9 Conditional Probability Probability of outcome in a specific sub- population Example 5.1, P(Select 9 year old from among girls) = P(Select 9 year old|girl) = 461/2730 = 0.169 P(Select boy|6 years of age) = 379/892=0.425

10 Example 5.2. Conditional Probability Prostate Cancer No Prostate Cancer Total Low PSA36164 Moderate PSA132841 High PSA12315 Total2892120

11 Example 5.2. Conditional Probability P(Prostate Cancer|Low PSA) = 3/64 = 0.047 P(Prostate Cancer|Moderate PSA) = 13/41 = 0.317 P(Prostate Cancer|High PSA) = 12/15 = 0.80

12 Sensitivity and Specificity Sensitivity = true positive fraction = P(test +|disease) Specificity = true negative fraction = P(test -|disease free) False negative fraction=P(test -|disease) False positive fraction=P(test +|disease free)

13 Example 5.4. Sensitivity and Specificity Affected Fetus Unaffected Fetus Total Positive Screen 9351360 Negative Screen 144494450 Total1048004810

14 Sensitivity and Specificity Sensitivity = P(test +|disease) =9/10=0.90 Specificity = P(test -|disease free) = 4449/4800 = 0.927 False negative fraction= P(test -|disease) = 1/10 = 0.10 False positive fraction=P(test +|disease free) = 351/4800 = 0.073

15 Independence Two events, A and B, are independent if P(A|B) = P(A) or if P(B|A) = P(B) Example 5.2. Is screening test independent of prostate cancer diagnosis? –P(Prostate Cancer) = 28/120 = 0.023 –P(Prostate Cancer|Low PSA) = 0.047 –P(Prostate Cancer|Moderate PSA) = 0.317 –P(Prostate Cancer|High PSA) = 0.80

16 Bayes Theorem

17 Example 5.8. Bayes Theorem P(disease) = 0.002 Sensitivity = 0.85 = P(test +|disease) P(test +)=0.08 and P(test -) = 0.92 What is P(disease|test +)?

18 Binomial Distribution Model for discrete outcome Process or experiment has 2 possible outcomes: success and failure Replications of process are independent P(success) is constant for each replication

19 Binomial Distribution Notation: n=number of times process is replicated, p=P(success), x=number of successes of interest 0< x<n

20 Example 5.9. Binomial Distribution Medication for allergies is effective in reducing symptoms in 80% of patients. If medication is given to 10 patients, what is the probability it is effective in 7? = 120(0.2097)(0.008) = 0.2013

21 Normal Distribution Model for continuous outcome Mean=median=mode

22 Normal Distribution Notation:  =mean and  =standard deviation 

23 Example 5.11. Normal Distribution Body mass index (BMI) for men age 60 is normally distributed with a mean of 29 and standard deviation of 6? What is the probability that a male has BMI less than 35?

24 Example 5.11. Normal Distribution 11 17 23 29 35 41 47 P(X<35)=?

25 Standard Normal Distribution Z Normal distribution with  =0 and  =1 -3 -2 -1 0 1 2 3

26 Example 5.11. Normal Distribution 11 17 23 29 35 41 47 P(X<35)= P(Z<1) = ?

27 Example 5.11. Normal Distribution P(X<35) = P(Z<1). Using Table 1, P(Z<1.00) = 0.8413 Table 1. Probabilities of Z Table entries represent P(Z < Zi) Zi.00.01.02.03.04 … 0.0 0.5000 0.5040 0.5080 0.5120 0.5160 … 0.1 0.5398 0.5438 0.5478 0.5517 0.5557 …. 1.0 0.8413 0.8438 0.8461 0.8485 0.8508 …

28 Example 5.11. Normal Distribution 11 17 23 29 35 41 47 P(X<30)=? What is the probability that a male has BMI less than 30?

29 Example 5.11. Normal Distribution P(X<30)= P(Z<0.17) = 0.5675

30 Percentiles of the Normal Distribution A percentile is a value that holds a specified percentage of the distribution below it. The median is the 50 th percentile, Q 1 is the 25 th percentile and Q 3 is the 75 th percentile.

31 Percentiles of the Normal Distribution Percentiles are determined by: x =  + Z  where z is the desired percentile from the standard normal distribution (See Table 1A)

32 Example 5.12. Percentiles of the Normal Distribution BMI in men follows a normal distribution with  =29,  =6. BMI in women follows a normal distribution with  =28,  =7. The 90 th percentile of BMI for men: X = 29 + 1.282 (6) = 36.69. The 90 th percentile of BMI for women: X = 28 + 1.282 (7) = 36.97.

33 Central Limit Theorem Suppose we have a population with known mean  and standard deviation . If we take simple random samples of size n with replacement, then for large n, the sampling distribution of the sample means is approximately normal with mean and standard deviation

34 Application Non-normal population Take samples of size n – as long as n is sufficiently large (usually n > 30 suffices) The distribution of the sample mean is approximately normal, therefore can use Z to compute probabilities

35 Example 5.18. Central Limit Theorem HDL cholesterol has a mean of 54 and standard deviation of 17 in patients over 50. A physician has 40 patients over age 50 and wants to know the probability that their mean cholesterol is above 60.

36 Example 5.18. Central Limit Theorem


Download ppt "The Role of Probability Chapter 5. Objectives Understand probability as it pertains to statistical inference Understand the concepts “at random” and “equally."

Similar presentations


Ads by Google