Hypergeometric Distribution

Slides:



Advertisements
Similar presentations
Discrete Random Variables and Probability Distributions
Advertisements

Special random variables Chapter 5 Some discrete or continuous probability distributions.
MOMENT GENERATING FUNCTION AND STATISTICAL DISTRIBUTIONS
Discrete Uniform Distribution
ฟังก์ชั่นการแจกแจงความน่าจะเป็น แบบไม่ต่อเนื่อง Discrete Probability Distributions.
Probability Distribution
Discrete Random Variables and Probability Distributions
Probability Distributions
1 Engineering Computation Part 5. 2 Some Concepts Previous to Probability RANDOM EXPERIMENT A random experiment or trial can be thought of as any activity.
A random variable that has the following pmf is said to be a binomial random variable with parameters n, p The Binomial random variable.
Class notes for ISE 201 San Jose State University
Stat 321- Day 13. Last Time – Binomial vs. Negative Binomial Binomial random variable P(X=x)=C(n,x)p x (1-p) n-x  X = number of successes in n independent.
Chapter 21 Random Variables Discrete: Bernoulli, Binomial, Geometric, Poisson Continuous: Uniform, Exponential, Gamma, Normal Expectation & Variance, Joint.
Lesson 6 – 2b Hyper-Geometric Probability Distribution.
Section 15.8 The Binomial Distribution. A binomial distribution is a discrete distribution defined by two parameters: The number of trials, n The probability.
Approximation and Nested Problem. Four players are playing a poker game out of a deck of 52 cards. Each player has 13 cards. Let X be the number of Kings.
The Negative Binomial Distribution An experiment is called a negative binomial experiment if it satisfies the following conditions: 1.The experiment of.
Random Variables Section 3.1 A Random Variable: is a function on the outcomes of an experiment; i.e. a function on outcomes in S. For discrete random variables,
4.5 Comparing Discrete Probability Distributions.
Random Variables. A random variable X is a real valued function defined on the sample space, X : S  R. The set { s  S : X ( s )  [ a, b ] is an event}.
CHAPTER Discrete Models  G eneral distributions  C lassical: Binomial, Poisson, etc Continuous Models  G eneral distributions 
Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Chapter 5 Discrete Random Variables.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 5 Discrete Random Variables.
1 Topic 3 - Discrete distributions Basics of discrete distributions Mean and variance of a discrete distribution Binomial distribution Poisson distribution.
Math b (Discrete) Random Variables, Binomial Distribution.
STA347 - week 31 Random Variables Example: We roll a fair die 6 times. Suppose we are interested in the number of 5’s in the 6 rolls. Let X = number of.
Methodology Solving problems with known distributions 1.
Ch. 15H continued. * -applied to experiments with replacement ONLY(therefore…..independent events only) * -Note: For DEPENDENT events we use the “hypergeometric.
Exam 2: Rules Section 2.1 Bring a cheat sheet. One page 2 sides. Bring a calculator. Bring your book to use the tables in the back.
Topic 3 - Discrete distributions Basics of discrete distributions - pages Mean and variance of a discrete distribution - pages ,
Chapter 3 Discrete Random Variables and Probability Distributions  Random Variables.2 - Probability Distributions for Discrete Random Variables.3.
Random Variables Example:
DISCRETE PROBABILITY MODELS
LECTURE 18 TUESDAY, 27 OCTOBER STA 291 Fall
Chapter 3 Discrete Random Variables and Probability Distributions  Random Variables.2 - Probability Distributions for Discrete Random Variables.3.
Chapter 5 Joint Probability Distributions and Random Samples  Jointly Distributed Random Variables.2 - Expected Values, Covariance, and Correlation.3.
1 Chapter 8 Random Variables and Probability Distributions IRandom Sampling A.Population 1.Population element 2.Sampling with and without replacement.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Chapter 5 Discrete Random Variables.
Engineering Probability and Statistics - SE-205 -Chap 3 By S. O. Duffuaa.
Chap 5-1 Chapter 5 Discrete Random Variables and Probability Distributions Statistics for Business and Economics 6 th Edition.
Unit 3: Probability.  You will need to be able to describe how you will perform a simulation  Create a correspondence between random numbers and outcomes.
3.1 Discrete Random Variables Present the analysis of several random experiments Discuss several discrete random variables that frequently arise in applications.
Chapter 3 Discrete Random Variables and Probability Distributions
MAT 446 Supplementary Note for Ch 3
Ch3.5 Hypergeometric Distribution
Math 4030 – 4a More Discrete Distributions
Supplemental Lecture Notes
Discrete Random Variables
Engineering Probability and Statistics - SE-205 -Chap 3
Chapter 5 Joint Probability Distributions and Random Samples
Discrete random variable X Examples: shoe size, dosage (mg), # cells,…
Discrete Random Variables
Chapter 3 Discrete Random Variables and Probability Distributions
Chapter 4 Continuous Random Variables and Probability Distributions
The Bernoulli distribution
Chapter 3 Discrete Random Variables and Probability Distributions
Chapter 3 Discrete Random Variables and Probability Distributions
ASV Chapters 1 - Sample Spaces and Probabilities
Discrete random variable X Examples: shoe size, dosage (mg), # cells,…
Chapter 3 Discrete Random Variables and Probability Distributions
Some Discrete Probability Distributions
ASV Chapters 1 - Sample Spaces and Probabilities
Random Variables Binomial Distributions
Chapter 3 Discrete Random Variables and Probability Distributions
Bernoulli Trials Two Possible Outcomes Trials are independent.
Known Probability Distributions
Each Distribution for Random Variables Has:
Random Variables Binomial and Hypergeometric Probabilities
Known Probability Distributions
Presentation transcript:

Hypergeometric Distribution Random Sample n = 5 (w/ or w/o replacement) Infinite Population of decks of 52 cards (Assume each is fair) Random Variable X = # Spades in sample (x = 0, 1, 2, 3, 4, 5) Calculate P(X = 2) Binomial distribution  = P(Spades) = 13/52 Outcomes of “X = 2” (0, 0, 0, 1, 1) (0, 0, 1, 0, 1) (0, 0, 1, 1, 0) (0, 1, 0, 0, 1) (0, 1, 0, 1, 0) (0, 1, 1, 0, 0) (1, 0, 0, 0, 1) (1, 0, 0, 1, 0) (1, 0, 1, 0, 0) (1, 1, 0, 0, 0) Binary r.v. 10 combinations

Hypergeometric Distribution Random Sample n = 5 (w/ or w/o replacement) Random Sample n = 5 (w/ replacement) Infinite Population of decks of 52 cards (Assume each is fair) Finite Population N = 52 (Assume fair) Random Variable X = # Spades in sample (x = 0, 1, 2, 3, 4, 5) Calculate P(X = 2) Binomial distribution  = P(Spades) = 13/52 Outcomes of “X = 2” (0, 0, 0, 1, 1) (0, 0, 1, 0, 1) (0, 0, 1, 1, 0) (0, 1, 0, 0, 1) (0, 1, 0, 1, 0) (0, 1, 1, 0, 0) (1, 0, 0, 0, 1) (1, 0, 0, 1, 0) (1, 0, 1, 0, 0) (1, 1, 0, 0, 0) Binary r.v. 10 combinations

Hypergeometric Distribution Random Sample n = 5 (w/o replacement) Random Sample n = 5 (w/ replacement) Finite Population N = 52 (Assume fair) Random Variable X = # Spades in sample (x = 0, 1, 2, 3, 4, 5) Calculate P(X = 2) Binomial distribution  = P(Spades) = 13/52 Outcomes of “X = 2” (0, 0, 0, 1, 1) (0, 0, 1, 0, 1) (0, 0, 1, 1, 0) (0, 1, 0, 0, 1) (0, 1, 0, 1, 0) (0, 1, 1, 0, 0) (1, 0, 0, 0, 1) (1, 0, 0, 1, 0) (1, 0, 1, 0, 0) (1, 1, 0, 0, 0) Binary r.v. 10 combinations

Hypergeometric Distribution Random Sample n = 5 (w/o replacement) Finite Population N = 52 (Assume fair) Random Variable X = # Spades in sample (x = 0, 1, 2, 3, 4, 5) Calculate P(X = 2) Binomial distribution  = P(Spades) = 13/52 Outcomes of “X = 2” (0, 0, 0, 1, 1) (0, 0, 1, 0, 1) (0, 0, 1, 1, 0) (0, 1, 0, 0, 1) (0, 1, 0, 1, 0) (0, 1, 1, 0, 0) (1, 0, 0, 0, 1) (1, 0, 0, 1, 0) (1, 0, 1, 0, 0) (1, 1, 0, 0, 0) Binary r.v. ? ? ? ? ? ? = 0.27428

Hypergeometric Distribution Random Sample n = 5 (w/o replacement) Finite Population N = 52 (Assume fair) Random Variable X = # Spades in sample (x = 0, 1, 2, 3, 4, 5) Calculate P(X = 2) # combinations of 2 Spades from 13 Spades # combinations of 3 Non-Spades from 39 Non-Spades  Sample Space All combinations of 5 cards from 52 = 0.27428 # R command: dhyper(2, 13, 39, 5) x, s, N – s, n

Hypergeometric Distribution Random Sample (without replacement) of size n  N/10 Finite Population of size N s = # Successes Discrete random variable X = # Successes in sample (x = 0, 1, 2, 3, …,, n)  N – s = # Failures Then for any x = 0, 1, 2,…, the pmf p(x) is given by the following… # combinations of x Successes out of s Successes # combinations of n – x Failures out of N – s Failures See textbook for  and  2. # combinations of n out of N

Hypergeometric Distribution POPULATION N = 100 Random Sample (w/o replacement), n = 12 Discrete random variable X = # defectives in sample (x = 0, 1, 2, 3, 4) s = 4 defectives

Hypergeometric Distribution POPULATION N = 100 Random Sample (w/o replacement), n = 12 Discrete random variable X = # defectives in sample (x = 0, 1, 2, 3, 4) dhyper(0:4, 4, 96, 12) 0 0.594684 1 0.335822 2 0.064431 3 0.004937 4 0.000126 s = 4 defectives

∞ N < ∞ N < ∞, n  N/10 X = # Successes (x) in n trials (x = 0, 1, 2, …, n) Classical Discrete Model Population Size Sampling: replacement? Bernoulli trials?2 pmf p(x) P(X = x) Binomial X ~ Bin(n, ) dbinom(x, n, ) ∞ with or without yes N < ∞ with Poisson1 X ~ Pois() dpois(x, ) Hypergeometric X ~ Hyp(x, s, N, n) dhyper(x, s, N – s, n) N < ∞, n  N/10 without no 1 for rare events ONLY, i.e., small, n large 2 independent outcomes, with constant  = P(Success) in population

Negative Binomial Distribution Infinite Population of “Successes” and “Failures” Random Sample (w/ or w/o replacement) P(Success) =  P(Failure) = 1 –  Sample size NOT specified! Discrete random variable X = # trials for s Successes (x = s, s+1, s+2, s+3, …) Then for any x = s, s+1, s+2,…, the pmf p(x) is… See textbook for  and  2. dnbinom(x–s, s, ) (In R, X counts the # Failures before s Successes)

Negative Binomial Distribution Random Variable X = # trials for s = 6 Spades (x = 6, 7, 8, 9, 10,…)  = P(Spades) = 13/52 Random Sample (w/ replacement)

Negative Binomial Distribution Random Variable X = # trials for s = 1 Spades (x = 1, 2, 3, 4, 5,…) Random Variable X = # trials for s = 6 Spades (x = 6, 7, 8, 9, 10,…)  = P(Spades) = 13/52 Random Sample (w/ replacement) SPECIAL CASE OF NEGATIVE BINOMIAL: s = 1

Negative Binomial Distribution Geometric Distribution Random Variable X = # trials for s = 1 Spades (x = 1, 2, 3, 4, 5,…)  = P(Spades) = 13/52 Random Sample (w/ replacement) SPECIAL CASE OF NEGATIVE BINOMIAL: s = 1 (x – 1) Failures 1 Success! dgeom(x–1, ) (In R, X counts the # Failures before 1 Success)

Negative Binomial Distribution Geometric Distribution Random Variable X = # trials for s = 1 Spades (x = 1, 2, 3, 4, 5,…)  = P(Spades) = 13/52 Random Sample (w/ replacement) SPECIAL CASE OF NEGATIVE BINOMIAL: s = 1

Negative Binomial Distribution Geometric Distribution Random Variable X = # trials for s = 1 Spades (x = 1, 2, 3, 4, 5,…)  = P(Spades) = 13/52 Random Sample (w/ replacement) x 1 2 3 4 5 6 7 p(x) .250 .188 .141 .105 .079 .059 .044 Exercise Graph cdf F(x)

Multinomial Distribution Geometric Distribution Random Variables X1 = # Spades X2 = # Clubs X3 = # Hearts X4 = # Diamonds For i = 1, 2, 3, 4 xi = 0, 1, 2,…,10 with x1 + x2 + x3 + x4 = 10. Random Variable X = # trials for s = 1 Spades (x = 1, 2, 3, 4, 5,…) 1 = P(Spades) = 13/52 2 = P(Clubs) = 13/52 3 = P(Hearts) = 13/52 4 = P(Diamonds) = 13/52 Random Sample n = 10 (w/ replacement) Random Sample (w/ replacement)  = P(Spades) = 13/52

Discrete random variable RECALL… BINARY POPULATION of “Successes” vs. “Failures” P(Success) =  P(Failure) = 1 –  Discrete random variable X = # “Successes” in sample (n – X = # “Failures” in sample) (0, 1, 2, 3, …, n) RANDOM SAMPLE of n “Bernoulli trials” Then X is said to follow a Binomial distribution, written X ~ Bin(n, ), with “probability mass function” p(x) = … x = 0, 1, 2, …, n

Discrete random variable RECALL… BINARY POPULATION of “Successes” vs. “Failures” P(Success) =  P(Failure) = 1 –  OR… Discrete random variable X = # “Successes” in sample (n – X = # “Failures” in sample) (0, 1, 2, 3, …, n) RANDOM SAMPLE of n “Bernoulli trials” Then X is said to follow a Binomial distribution, written X ~ Bin(n, ), with “probability mass function” p(x) = … x = 0, 1, 2, …, n

Discrete random variables Discrete random variable RECALL… BINARY POPULATION of “Successes” vs. “Failures” P(Success) = 1 P(Failure) = 2 1 + 2 = 1 OR… Discrete random variables X1 = # “Successes” in sample X2 = # “Failures” in sample = n – X1 (0, 1, 2, 3, …, n) Discrete random variable X = # “Successes” in sample (n – X = # “Failures” in sample) (0, 1, 2, 3, …, n) RANDOM SAMPLE of n “Bernoulli trials” Then X is said to follow a Binomial distribution, written X ~ Bin(n, ), with “probability mass function” p(x) = … x = 0, 1, 2, …, n

Discrete random variables RECALL… BINARY POPULATION of “Successes” vs. “Failures” P(Success) = 1 P(Failure) = 2 1 + 2 = 1 OR… Discrete random variables X1 = # “Successes” in sample X2 = # “Failures” in sample = n – X1 (0, 1, 2, 3, …, n) RANDOM SAMPLE of n “Bernoulli trials” Then X1 and X2 “jointly” follow a Binomial distribution, written X1 ~ Bin(n, 1), X2 ~ Bin(n, 2), with “probability mass function”… for any x1 = 0, 1, 2, …, n, x2 = 0, 1, 2,…, n, with . x1 + x2 = n

Discrete random variables POPULATION of k categories P(Category 1) = 1 P(Category 2) = 2 P(Category k) = k BINARY POPULATION of “Successes” vs. “Failures” P(Success) = 1 P(Failure) = 2 1 + 2 = 1 1 + 2 + … + k = 1 Discrete random variables X1 = # “Successes” in sample X2 = # “Failures” in sample = n – X1 (0, 1, 2, 3, …, n) Discrete random variables X1 = # Category 1 in sample X2 = # Category 2 in sample Xk = # Category k in sample RANDOM SAMPLE of n “Bernoulli trials” Then the components of X = (X1, X2,…, Xk) “jointly” follow a Multinomial distribution, written X ~ Multi(n, 1, 2, …, k), with “probability mass function” p(x1,…, xk) = for any xi = 0, 1, 2, …, n, with .

Multinomial Distribution Random Variables X1 = # Spades X2 = # Clubs X3 = # Hearts X4 = # Diamonds xi = 0, 1, 2,…,10 with x1 + x2 + x3 + x4 = 10. Random Variable X = # trials for s = 6 Spades (x = 6, 7, 8, 9, 10,…) 1 = P(Spades) = 13/52 2 = P(Clubs) = 13/52 3 = P(Hearts) = 13/52 4 = P(Diamonds) = 13/52 Random Sample (w/ replacement) Random Sample n = 10 (w/ replacement)  = P(Spades) = 13/52 dmultinom(c(1,2,3,4), 10, c(.25,.25,.25,.25))