Binomial Distribution and Applications. Binomial Probability Distribution A binomial random variable X is defined to the number of “successes” in n independent.

Slides:



Advertisements
Similar presentations
Lesson Objective Be able to calculate probabilities for Binomial situations Begin to recognise the conditions necessary for a Random variable to have a.
Advertisements

Normal Approximation of the Binomial Distribution.
CHAPTER 13: Binomial Distributions
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 6: Random Variables Section 6.3 Binomial and Geometric Random Variables.
Nonparametric Inference
Probability Distributions Finite Random Variables.
7-2 Estimating a Population Proportion
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by.
Slide 1 Statistics Workshop Tutorial 7 Discrete Random Variables Binomial Distributions.
McGraw-Hill Ryerson Copyright © 2011 McGraw-Hill Ryerson Limited. Adapted by Peter Au, George Brown College.
Sample Size Determination Ziad Taib March 7, 2014.
Objectives (BPS chapter 13) Binomial distributions  The binomial setting and binomial distributions  Binomial distributions in statistical sampling 
COURSE: JUST 3900 INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE Instructor: Dr. John J. Kerbs, Associate Professor Joint Ph.D. in Social Work and Sociology.
HAWKES LEARNING SYSTEMS Students Matter. Success Counts. Copyright © 2013 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Section 5.2.
Copyright © Cengage Learning. All rights reserved. 3.4 The Binomial Probability Distribution.
Nonparametric Inference
The paired sample experiment The paired t test. Frequently one is interested in comparing the effects of two treatments (drugs, etc…) on a response variable.
Inference for a Single Population Proportion (p).
1 Lecture 5 Binomial Random Variables Many experiments are like tossing a coin a fixed number of times and recording the up-face. * The two possible outcomes.
Evidence Based Medicine
Binomial Distributions Calculating the Probability of Success.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Review and Preview This chapter combines the methods of descriptive statistics presented in.
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 11 Section 1 – Slide 1 of 34 Chapter 11 Section 1 Random Variables.
Section 6.2: How Can We Find Probabilities When Each Observation Has Two Possible Outcomes? 1.
Copyright © 1998, Triola, Elementary Statistics Addison Wesley Longman 1 Binomial Experiments Section 4-3 & Section 4-4 M A R I O F. T R I O L A Copyright.
Bernoulli Trials Two Possible Outcomes –Success, with probability p –Failure, with probability q = 1  p Trials are independent.
BINOMIALDISTRIBUTION AND ITS APPLICATION. Binomial Distribution  The binomial probability density function –f(x) = n C x p x q n-x for x=0,1,2,3…,n for.
Introduction to Behavioral Statistics Probability, The Binomial Distribution and the Normal Curve.
CHAPTER Discrete Models  G eneral distributions  C lassical: Binomial, Poisson, etc Continuous Models  G eneral distributions 
Binomial Experiment A binomial experiment (also known as a Bernoulli trial) is a statistical experiment that has the following properties:
Binomial Probability Distribution
Chapter 4. Discrete Random Variables A random variable is a way of recording a quantitative variable of a random experiment. A variable which can take.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Section 7-1 Review and Preview.
Probability Lecture 2. Probability Why did we spend last class talking about probability? How do we use this?
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 6 Section 2 – Slide 1 of 31 Chapter 6 Section 2 The Binomial Probability Distribution.
Remember Playing perfect black jack – the probability of winning a hand is.498 What is the probability that you will win 8 of the next 10 games of blackjack?
Chapter 3 Discrete Random Variables and Probability Distributions  Random Variables.2 - Probability Distributions for Discrete Random Variables.3.
P. 403 – 404 #71 – 73, 75 – 78, 80, 82, 84 #72B: Binary? Yes – Success is a person is left-handed. I: Independent? Yes, since students are selected randomly,
Section 5-3 Binomial Probability Distributions. Binomial Probability Distribution A binomial probability distribution results from a procedure that meets.
Comparing the Means of Two Dependent Populations.
1 Binomial Random Variables Lecture 5  Many experiments are like tossing a coin a fixed number of times and recording the up-face.  The two possible.
The Binomial Probability Distribution. ● A binomial experiment has the following structure  The first test is performed … the result is either a success.
Copyright © Cengage Learning. All rights reserved. 3 Discrete Random Variables and Probability Distributions.
Copyright © Cengage Learning. All rights reserved. 3 Discrete Random Variables and Probability Distributions.
Section 10.2: Tests of Significance Hypothesis Testing Null and Alternative Hypothesis P-value Statistically Significant.
Chapter 6: Random Variables
Binomial Distributions Chapter 5.3 – Probability Distributions and Predictions Mathematics of Data Management (Nelson) MDM 4U Authors: Gary Greer (with.
6.3 Binomial and Geometric Random Variables
Section 6.3 Geometric Random Variables. Binomial and Geometric Random Variables Geometric Settings In a binomial setting, the number of trials n is fixed.
13 Nonparametric Methods Introduction So far the underlying probability distribution functions (pdf) are assumed to be known, such as SND, t-distribution,
Slide 1 Copyright © 2004 Pearson Education, Inc. Chapter 5 Probability Distributions 5-1 Overview 5-2 Random Variables 5-3 Binomial Probability Distributions.
Probability Distributions ( 확률분포 ) Chapter 5. 2 모든 가능한 ( 확률 ) 변수의 값에 대해 확률을 할당하는 체계 X 가 1, 2, …, 6 의 값을 가진다면 이 6 개 변수 값에 확률을 할당하는 함수 Definition.
Binomial Distribution
Inference for a Single Population Proportion (p)
Discrete Random Variables
INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE
Binomial and Geometric Random Variables
CHAPTER 14: Binomial Distributions*
Sampling Distributions
Sampling Distributions
Chapter 5 Sampling Distributions
Elementary Statistics
Chapter 11 Goodness-of-Fit and Contingency Tables
Discrete Random Variables and Probability Distributions
Section 10.2: Tests of Significance
Binomial Distribution
Random Variables Binomial Distributions
Chapter 5 Sampling Distributions
Review of the Binomial Distribution
Binomial Distribution
Presentation transcript:

Binomial Distribution and Applications

Binomial Probability Distribution A binomial random variable X is defined to the number of “successes” in n independent trials where the P(“success”) = p is constant. Notation: X ~ BIN(n,p) In the definition above notice the following conditions need to be satisfied for a binomial experiment: 1. There is a fixed number of n trials carried out. 2. The outcome of a given trial is either a “success” or “failure”. 3. The probability of success (p) remains constant from trial to trial. 4. The trials are independent, the outcome of a trial is not affected by the outcome of any other trial.

Binomial Distribution Binomial Distribution If X ~ BIN(n, p), then If X ~ BIN(n, p), then where where

Binomial Distribution Binomial Distribution If X ~ BIN(n, p), then If X ~ BIN(n, p), then E.g. when n = 3 and p =.50 there are 8 possible equally likely outcomes (e.g. flipping a coin) E.g. when n = 3 and p =.50 there are 8 possible equally likely outcomes (e.g. flipping a coin) SSS SSF SFS FSS SFF FSF FFS FFF SSS SSF SFS FSS SFF FSF FFS FFF X=3 X=2 X=2 X=2 X=1 X=1 X=1 X=0 X=3 X=2 X=2 X=2 X=1 X=1 X=1 X=0 P(X=3)=1/8, P(X=2)=3/8, P(X=1)=3/8, P(X=0)=1/8 P(X=3)=1/8, P(X=2)=3/8, P(X=1)=3/8, P(X=0)=1/8 Now let’s use binomial probability formula instead… Now let’s use binomial probability formula instead…

Binomial Distribution Binomial Distribution If X ~ BIN(n, p), then If X ~ BIN(n, p), then E.g. when n = 3, p =.50 find P(X = 2) E.g. when n = 3, p =.50 find P(X = 2) SSF SFS FSS

Example: Treatment of Kidney Cancer Suppose we have n = 40 patients who will be receiving an experimental therapy which is believed to be better than current treatments which historically have had a 5-year survival rate of 20%, i.e. the probability of 5-year survival is p =.20. Suppose we have n = 40 patients who will be receiving an experimental therapy which is believed to be better than current treatments which historically have had a 5-year survival rate of 20%, i.e. the probability of 5-year survival is p =.20. Thus the number of patients out of 40 in our study surviving at least 5 years has a binomial distribution, i.e. X ~ BIN(40,.20). Thus the number of patients out of 40 in our study surviving at least 5 years has a binomial distribution, i.e. X ~ BIN(40,.20).

Results and “The Question” Suppose that using the new treatment we find that 16 out of the 40 patients survive at least 5 years past diagnosis. Suppose that using the new treatment we find that 16 out of the 40 patients survive at least 5 years past diagnosis. Q: Does this result suggest that the new therapy has a better 5-year survival rate than the current, i.e. is the probability that a patient survives at least 5 years greater than.20 or a 20% chance when treated using the new therapy? Q: Does this result suggest that the new therapy has a better 5-year survival rate than the current, i.e. is the probability that a patient survives at least 5 years greater than.20 or a 20% chance when treated using the new therapy?

What do we consider in answering the question of interest? We essentially ask ourselves the following: If we assume that new therapy is no better than the current what is the probability we would see these results by chance variation alone? If we assume that new therapy is no better than the current what is the probability we would see these results by chance variation alone? More specifically what is the probability of seeing 16 or more successes out of 40 if the success rate of the new therapy is.20 or 20% as well? More specifically what is the probability of seeing 16 or more successes out of 40 if the success rate of the new therapy is.20 or 20% as well?

Connection to Binomial This is a binomial experiment situation… There are n = 40 patients and we are counting the number of patients that survive 5 or more years. The individual patient outcomes are independent and IF WE ASSUME the new method is NOT better then the probability of success is p =.20 or 20% for all patients. This is a binomial experiment situation… There are n = 40 patients and we are counting the number of patients that survive 5 or more years. The individual patient outcomes are independent and IF WE ASSUME the new method is NOT better then the probability of success is p =.20 or 20% for all patients. So X = # of “successes” in the clinical trial is binomial with n = 40 and p =.20, i.e. X ~ BIN(40,.20) So X = # of “successes” in the clinical trial is binomial with n = 40 and p =.20, i.e. X ~ BIN(40,.20)

Example: Treatment of Kidney Cancer X ~ BIN(40,.20), find the probability that exactly 16 patients survive at least 5 years. X ~ BIN(40,.20), find the probability that exactly 16 patients survive at least 5 years. This requires some calculator gymnastics and some scratchwork! This requires some calculator gymnastics and some scratchwork! Also, keep in mind we need to find the probability of having 16 or more patients surviving at least 5 yrs. Also, keep in mind we need to find the probability of having 16 or more patients surviving at least 5 yrs.

Example: Treatment of Kidney Cancer So we actually need to find: So we actually need to find: P(X > 16) = P(X = 16) + P(X = 17) + … + P(X = 40) +…+ = YIPES! = YIPES!

Example: Treatment of Kidney Cancer X ~ BIN(40,.20), find the probability that 16 or more patients survive at least 5 years. X ~ BIN(40,.20), find the probability that 16 or more patients survive at least 5 years. USE COMPUTER! USE COMPUTER! Binomial Probability calculator in JMP Binomial Probability calculator in JMP probabilities are computed automatically for greater than or equal to and less than or equal to x. Enter n = sample size x = observed # of “successes” p = probability of “success”

Example: Treatment of Kidney Cancer X ~ BIN(40,.20), find the probability that 16 or more patients survive at least 5 years. X ~ BIN(40,.20), find the probability that 16 or more patients survive at least 5 years. USE COMPUTER! USE COMPUTER! Binomial Probability calculator in JMP Binomial Probability calculator in JMP The chance that we would see 16 or more patients out of 40 surviving at least 5 years if the new method has the same chance of success as the current methods (20%) is VERY SMALL,.0029!!!! P(X > 16) =

Conclusion Because it is high unlikely (p =.0029) that we would see this many successes in a group 40 patients if the new method had the same probability of success as the current method we have to make a choice, either … Because it is high unlikely (p =.0029) that we would see this many successes in a group 40 patients if the new method had the same probability of success as the current method we have to make a choice, either … A) we have obtained a very rare result by dumb luck. OR B) our assumption about the success rate of the new method is wrong and in actuality the new method has a better than 20% 5-year survival rate making the observed result more plausible.

Sign Test The sign test can be used in place of the paired t- test when we have evidence that the paired differences are NOT normally distributed. The sign test can be used in place of the paired t- test when we have evidence that the paired differences are NOT normally distributed. It can be used when the response is ordinal. It can be used when the response is ordinal. Best used when the response is difficult to quantify and only improvement can be measured, i.e. subject got better, got worse, or no change. Best used when the response is difficult to quantify and only improvement can be measured, i.e. subject got better, got worse, or no change. Magnitude of the paired difference is lost when using this test. Magnitude of the paired difference is lost when using this test.

Example: Sign Test A study evaluated hepatic arterial infusion of floxuridine and cisplatin for the treatment of liver metastases of colorectral cancer. A study evaluated hepatic arterial infusion of floxuridine and cisplatin for the treatment of liver metastases of colorectral cancer. Performance scores for 29 patients were recorded before and after infusion. Performance scores for 29 patients were recorded before and after infusion. Is there evidence that patients had a better performance score after infusion? Is there evidence that patients had a better performance score after infusion?

Example: Sign Test Patient Before (B) Infusion After (A) Infusion Difference (A – B) Patient Before (B) Infusion After (A) Infusion Difference (A – B)

Sign Test The sign test looks at the number of (+) and (-) differences amongst the nonzero paired differences. The sign test looks at the number of (+) and (-) differences amongst the nonzero paired differences. A preponderance of +’s or –’s can indicate that some type of change has occurred. A preponderance of +’s or –’s can indicate that some type of change has occurred. If in reality there is no change as a result of infusion we expect +’s and –’s to be equally likely to occur, i.e. P(+) = P(-) =.50 and the number of each observed follows a binomial distribution. If in reality there is no change as a result of infusion we expect +’s and –’s to be equally likely to occur, i.e. P(+) = P(-) =.50 and the number of each observed follows a binomial distribution.

Example: Sign Test Given these results do we have evidence that performance scores of patients generally improves following infusion? Given these results do we have evidence that performance scores of patients generally improves following infusion? Need to look at how likely the observed results are to be produced by chance variation alone. Need to look at how likely the observed results are to be produced by chance variation alone.

Example: Sign Test Patient Before (B) Infusion After (A) Infusion Difference (A – B) Patient Before (B) Infusion After (A) Infusion Difference (A – B) nonzeros differences, 11 +’s 7 –’s - -

Example: Sign Test If there is truly no change in performance as a result of infusion the number of +’s has a binomial distribution with n = 18 and p = P(+) =.50. If there is truly no change in performance as a result of infusion the number of +’s has a binomial distribution with n = 18 and p = P(+) =.50. We have observed 11 +’s amongst the 18 non-zero performance differences. We have observed 11 +’s amongst the 18 non-zero performance differences. How likely are we to see 11 or more +’s out 18? How likely are we to see 11 or more +’s out 18? P(X > 11) =.2403 for a binomial n = 18, p =.50 P(X > 11) =.2403 for a binomial n = 18, p =.50 There is 24.03% chance we would see this many improvements by dumb luck alone, therefore we are not convinced that infusion leads to improvement (Remember less than.05 or a 5% chance is what we are looking for “statistical significance”) There is 24.03% chance we would see this many improvements by dumb luck alone, therefore we are not convinced that infusion leads to improvement (Remember less than.05 or a 5% chance is what we are looking for “statistical significance”)

Example 2: Sign Test Resting Energy Expenditure (REE) for Patient with Cystic Fibrosis A researcher believes that patients with cystic fibrosis (CF) expend greater energy during resting than those without CF. To obtain a fair comparison she matches 13 patients with CF to 13 patients without CF on the basis of age, sex, height, and weight. She then measured there REE for each pair of subjects and compared the results. A researcher believes that patients with cystic fibrosis (CF) expend greater energy during resting than those without CF. To obtain a fair comparison she matches 13 patients with CF to 13 patients without CF on the basis of age, sex, height, and weight. She then measured there REE for each pair of subjects and compared the results.

Example 2: Sign Test There are 11 +’s & 2 –’s out of n = 13 paired differences.

Example 2: Sign Test The probability of seeing this many +’s is small. We conclude that when comparing individuals with cystic fibrosis to healthy individuals of the same gender and size that in general those with CF have larger resting energy expenditure (REE) (p =.0112).