Chapter 3 Discrete Random Variables and Probability Distributions  3.1 - Random Variables.2 - Probability Distributions for Discrete Random Variables.3.

Slides:



Advertisements
Similar presentations
Statistics review of basic probability and statistics.
Advertisements

Random Variables A random variable is a variable (usually we use x), that has a single numerical value, determined by chance, for each outcome of a procedure.
CHAPTER Discrete Models  G eneral distributions  C lassical: Binomial, Poisson, etc Continuous Models  G eneral distributions  C.
1 Set #3: Discrete Probability Functions Define: Random Variable – numerical measure of the outcome of a probability experiment Value determined by chance.
Chapter 2 Discrete Random Variables
Review of Basic Probability and Statistics
Chapter 4 Discrete Random Variables and Probability Distributions
Chapter 5 Basic Probability Distributions
Review.
Probability Distributions
Probability and Statistics Review
Probability Distributions Random Variables: Finite and Continuous A review MAT174, Spring 2004.
Probability Distributions Random Variables: Finite and Continuous Distribution Functions Expected value April 3 – 10, 2003.
Copyright © 2014 by McGraw-Hill Higher Education. All rights reserved.
Chapter 21 Random Variables Discrete: Bernoulli, Binomial, Geometric, Poisson Continuous: Uniform, Exponential, Gamma, Normal Expectation & Variance, Joint.
4-1 Continuous Random Variables 4-2 Probability Distributions and Probability Density Functions Figure 4-1 Density function of a loading on a long,
Random Variables and Probability Distributions
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 4 and 5 Probability and Discrete Random Variables.
Week71 Discrete Random Variables A random variable (r.v.) assigns a numerical value to the outcomes in the sample space of a random phenomenon. A discrete.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Discrete Random Variables Chapter 4.
Continuous Probability Distributions  Continuous Random Variable  A random variable whose space (set of possible values) is an entire interval of numbers.
Statistics for Engineer Week II and Week III: Random Variables and Probability Distribution.
Random Variables. A random variable X is a real valued function defined on the sample space, X : S  R. The set { s  S : X ( s )  [ a, b ] is an event}.
Chapter 5: Random Variables and Discrete Probability Distributions
CHAPTER Discrete Models  G eneral distributions  C lassical: Binomial, Poisson, etc Continuous Models  G eneral distributions 
Probability & Statistics I IE 254 Summer 1999 Chapter 4  Continuous Random Variables  What is the difference between a discrete & a continuous R.V.?
Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Chapter 5 Discrete Random Variables.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 5 Discrete Random Variables.
STA347 - week 51 More on Distribution Function The distribution of a random variable X can be determined directly from its cumulative distribution function.
Week 21 Conditional Probability Idea – have performed a chance experiment but don’t know the outcome (ω), but have some partial information (event A) about.
Math b (Discrete) Random Variables, Binomial Distribution.
STA347 - week 31 Random Variables Example: We roll a fair die 6 times. Suppose we are interested in the number of 5’s in the 6 rolls. Let X = number of.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Section 5-5 Poisson Probability Distributions.
Random Variables an important concept in probability.
Exam 2: Rules Section 2.1 Bring a cheat sheet. One page 2 sides. Bring a calculator. Bring your book to use the tables in the back.
IE 300, Fall 2012 Richard Sowers IESE. 8/30/2012 Goals: Rules of Probability Counting Equally likely Some examples.
Chapter 3 Discrete Random Variables and Probability Distributions  Random Variables.2 - Probability Distributions for Discrete Random Variables.3.
Random Variables Example:
Probability Distributions, Discrete Random Variables
Probability Theory Modelling random phenomena. Permutations the number of ways that you can order n objects is: n! = n(n-1)(n-2)(n-3)…(3)(2)(1) Definition:
Chapter 4 Continuous Random Variables and Probability Distributions
Chapter 3 Discrete Random Variables and Probability Distributions  Random Variables.2 - Probability Distributions for Discrete Random Variables.3.
CSE 474 Simulation Modeling | MUSHFIQUR ROUF CSE474:
Chapter 3 Discrete Random Variables and Probability Distributions  Random Variables.2 - Probability Distributions for Discrete Random Variables.3.
CHAPTER Discrete Models  G eneral distributions  C lassical: Binomial, Poisson, etc Continuous Models  G eneral distributions 
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Chapter 5 Discrete Random Variables.
Unit 4 Review. Starter Write the characteristics of the binomial setting. What is the difference between the binomial setting and the geometric setting?
Chap 5-1 Chapter 5 Discrete Random Variables and Probability Distributions Statistics for Business and Economics 6 th Edition.
Fractiles Given a probability distribution F(x) and a number p, we define a p-fractile x p by the following formulas.
Random Variables By: 1.
12.SPECIAL PROBABILITY DISTRIBUTIONS
Topic Overview and Study Checklist. From Chapter 7 in the white textbook: Modeling with Differential Equations basic models exponential logistic modified.
Supplemental Lecture Notes
CHAPTER 2 RANDOM VARIABLES.
Chapter 2 Discrete Random Variables
Example: X = Cholesterol level (mg/dL)
Chapter 4 Continuous Random Variables and Probability Distributions
Discrete Probability Distributions
Chapter 3 Discrete Random Variables and Probability Distributions
Chapter 3 Discrete Random Variables and Probability Distributions
Probability Review for Financial Engineers
x1 p(x1) x2 p(x2) x3 p(x3) POPULATION x p(x) ⋮ Total 1 “Density”
ASV Chapters 1 - Sample Spaces and Probabilities
Discrete Random Variables
ASV Chapters 1 - Sample Spaces and Probabilities
ASV Chapters 1 - Sample Spaces and Probabilities
M248: Analyzing data Block A UNIT A3 Modeling Variation.
Random Variables Binomial and Hypergeometric Probabilities
1/2555 สมศักดิ์ ศิวดำรงพงศ์
Presentation transcript:

Chapter 3 Discrete Random Variables and Probability Distributions  Random Variables.2 - Probability Distributions for Discrete Random Variables.3 - Expected Values.4 - The Binomial Probability Distribution.5 - Hypergeometric and Negative Binomial Distributions.6 - The Poisson Probability Distribution

What is the connection between probability and random variables? Events (and their corresponding probabilities) that involve experimental measurements can be described by random variables. 2

POPULATION random variable X 3 x1x1 x2x2 x3x3 x4x4 x5x5 x6x6 …etc…. xnxn Data values x i Relative Frequencies f (x i ) = f i /n x1x1 f (x 1 ) x2x2 f (x 2 ) x3x3 f (x 3 ) ⋮⋮ xkxk f (x k ) Total1 Pop values x i Probabilities f (x i ) x1x1 f (x 1 ) x2x2 f (x 2 ) x3x3 f (x 3 ) ⋮⋮ Total1 SAMPLE of size n Example: X = Cholesterol level (mg/dL)

POPULATION Pop values x Probabilities f (x) x1x1 f (x 1 ) x2x2 f (x 2 ) x3x3 f (x 3 ) ⋮⋮ Total1 Example: X = Cholesterol level (mg/dL) random variable X Probability Histogram X Total Area = 1 f(x) = Probability that the random variable X is equal to a specific value x, i.e., |x|x “probability mass function” (pmf) f(x) = P(X = x)

X Consider the following discrete random variable… 5 Example: X = “value shown on a single random toss of a fair die (1, 2, 3, 4, 5, 6)” Probability Table xf(x)f(x) 11/ Probability Histogram “What is the probability of rolling a 4?” X is said to be uniformly distributed over the values 1, 2, 3, 4, 5, 6. Total Area = 1 P(X = x) Density

Probability Table xf(x)f(x) 11/ X Consider the following discrete random variable… 6 Example: X = “value shown on a single random toss of a fair die (1, 2, 3, 4, 5, 6)” Probability Histogram “What is the probability of rolling a 4?” X is said to be uniformly distributed over the values 1, 2, 3, 4, 5, 6. Total Area = 1 Density P(X = x)

X POPULATION Pop values x Probabilities f (x) x1x1 f (x 1 ) x2x2 f (x 2 ) x3x3 f (x 3 ) ⋮⋮ Total1 Example: X = Cholesterol level (mg/dL) random variable X Probability Histogram Total Area = 1 F(x) = Probability that the random variable X is less than or equal to a specific value x, i.e., “cumulative distribution function” (cdf) F(x) = P(X  x) |x|x

Motivation ~ Motivation ~ Consider the following discrete random variable… 8 Example: X = “value shown on a single random toss of a fair die (1, 2, 3, 4, 5, 6)” P(X = x) xf(x)f(x) 11/ X is said to be uniformly distributed over the values 1, 2, 3, 4, 5, 6. Cumulative distribution P(X  x)P(X  x) F(x)F(x) 1/6 2/6 3/6 4/6 5/6 1

Motivation ~ Motivation ~ Consider the following discrete random variable… 9 Example: X = “value shown on a single random toss of a fair die (1, 2, 3, 4, 5, 6)” P(X = x) xf(x)f(x) 11/ X is said to be uniformly distributed over the values 1, 2, 3, 4, 5, 6. Cumulative distribution P(X  x)P(X  x) F(x)F(x) 1/6 2/6 3/6 4/6 5/6 1 “staircase graph” from 0 to 1

POPULATION Pop vals xpmf f (x) cdf F(x) = P(X  x) x1x1 f (x 1 ) F(x 1 ) = f(x 1 ) x2x2 f (x 2 ) F(x 2 ) = f(x 1 ) + f(x 2 ) x3x3 f (x 3 ) F(x 3 ) = f(x 1 ) + f(x 2 ) + f(x 3 ) ⋮⋮⋮ Total1 increases from 0 to 1 Example: X = Cholesterol level (mg/dL) random variable X X Calculating “interval probabilities”… f (x) F( b ) – F( a – ) = F(b) = P(X  b) F( a – ) = P(X  a – ) P(X  b) – P(X  a – ) = P( a  X  b) |a|a |b|b |a–|a–

POPULATION Pop vals xpmf f (x) cdf F(x) = P(X  x) x1x1 f (x 1 ) F(x 1 ) = f(x 1 ) x2x2 f (x 2 ) F(x 2 ) = f(x 1 ) + f(x 2 ) x3x3 f (x 3 ) F(x 3 ) = f(x 1 ) + f(x 2 ) + f(x 3 ) ⋮⋮⋮ Total1 increases from 0 to 1 Example: X = Cholesterol level (mg/dL) random variable X X Calculating “interval probabilities”… F( b ) – F( a – ) f (x) F( b ) – F( a – ) = F(b) = P(X  b) P(X  b) – P(X  a – ) = P( a  X  b) F( a – ) = P(X  a – ) |a–|a– f (x) = |a|a FUNDAMENTAL THEOREM OF CALCULUS (discrete form) |b|b

POPULATION Pop vals xpmf f (x) cdf F(x) = P(X  x) x1x1 f (x 1 ) F(x 1 ) = f(x 1 ) x2x2 f (x 2 ) F(x 2 ) = f(x 1 ) + f(x 2 ) x3x3 f (x 3 ) F(x 3 ) = f(x 1 ) + f(x 2 ) + f(x 3 ) ⋮⋮⋮ Total1 increases from 0 to 1 Example: X = Cholesterol level (mg/dL) random variable X X Calculating “interval probabilities”… F( b ) – F( a – ) f (x) F( b ) – F( a – ) = F(b) = P(X  b) P(X  b) – P(X  a – ) = P( a  X  b) F( a – ) = P(X  a – ) |a–|a– f (x) = |a|a FUNDAMENTAL THEOREM OF CALCULUS (discrete form) |b|b Hey!!! What about the population mean  and the population variance  2 ???

Chapter 3 Discrete Random Variables and Probability Distributions  Random Variables.2 - Probability Distributions for Discrete Random Variables.3 - Expected Values.4 - The Binomial Probability Distribution.5 - Hypergeometric and Negative Binomial Distributions.6 - The Poisson Probability Distribution

14 POPULATION Pop values x Probabilities f (x) x1x1 f (x 1 ) x2x2 f (x 2 ) x3x3 f (x 3 ) ⋮⋮ Total1 Population mean Also denoted by E[X], the “expected value” of the variable X. Population variance Example: X = Cholesterol level (mg/dL) random variable X Just as the sample mean and sample variance s 2 were used to characterize “measure of center” and “measure of spread” of a dataset, we can now define the “true” population mean  and population variance  2, using probabilities.

Pop values x Probabilities f (x) x1x1 f (x 1 ) x2x2 f (x 2 ) x3x3 f (x 3 ) ⋮⋮ Total1 15 POPULATION Population mean Also denoted by E[X], the “expected value” of the variable X. Population variance Example: X = Cholesterol level (mg/dL) random variable X Just as the sample mean and sample variance s 2 were used to characterize “measure of center” and “measure of spread” of a dataset, we can now define the “true” population mean  and population variance  2, using probabilities.

Pop values x i Probabilities f (x i ) 2101/6 2401/3 2701/2 Total1 Example 1: 16 POPULATION /6 1/3 1/2 Example: X = Cholesterol level (mg/dL) random variable X

Example 2: 17 POPULATION Pop values x i Probabilities f (x i ) 1801/3 2101/3 2401/3 Total1 1/3 1/3 1/3 Equally likely outcomes result in a “uniform distribution.” ( clear from symmetry) Example: X = Cholesterol level (mg/dL) random variable X

To summarize… 18

19 POPULATION SAMPLE of size n x1x1 x2x2 x3x3 x4x4 x5x5 x6x6 …etc…. xnxn Data x i Relative Frequencies f (x i ) = f i /n x1x1 f (x 1 ) x2x2 f (x 2 ) x3x3 f (x 3 ) ⋮⋮ xkxk f (x k ) 1 Pop x i Probabilities f (x i ) x1x1 f (x 1 ) x2x2 f (x 2 ) x3x3 f (x 3 ) ⋮⋮ 1 Frequency Table Probability Table Probability Histogram X Total Area = 1 Density Histogram X Total Area = 1 Discrete random variable X

20 POPULATION SAMPLE of size n x1x1 x2x2 x3x3 x4x4 x5x5 x6x6 …etc…. xnxn Data x i Relative Frequencies f (x i ) = f i /n x1x1 f (x 1 ) x2x2 f (x 2 ) x3x3 f (x 3 ) ⋮⋮ xkxk f (x k ) 1 Pop x i Probabilities f (x i ) x1x1 f (x 1 ) x2x2 f (x 2 ) x3x3 f (x 3 ) ⋮⋮ 1 Frequency Table Probability Table Probability Histogram X Total Area = 1 Density Histogram X Total Area = 1 Discrete random variable X Continuous

21

22 Example 3: TWO INDEPENDENT POPULATIONS X 1 = Cholesterol level (mg/dL) xf 1 (x) 2101/6 2401/3 2701/2 Total1 X 2 = Cholesterol level (mg/dL) xf 2 (x) 1801/3 2101/3 2401/3 Total1  1 = 250  1 2 = 500  2 = 210  2 2 = 600 D = X 1 – X 2 ~ ??? dOutcomes -30(210, 240) 0(210, 210), (240, 240) +30(210, 180), (240, 210), (270, 240) +60(240, 180), (270, 210) +90(270, 180) NOTE: By definition, this is the sample space of the experiment! N O T E : B y d e f i n i t i o n, t h i s i s t h e s a m p l e s p a c e o f t h e e x p e r i m e n t ! What are the probabilities of the corresponding events “D = d” for d = -30, 0, 30, 60, 90? N O T E : B y d e f i n i t i o n, t h i s i s t h e s a m p l e s p a c e o f t h e e x p e r i m e n t ! W h a t a r e t h e p r o b a b i l i t i e s o f t h e c o r r e s p o n d i n g e v e n t s “ D = d ” f o r d = - 3 0, 0, 3 0, 6 0, 9 0 ?

dOutcomes -30(210, 240) 0(210, 210), (240, 240) +30(210, 180), (240, 210), (270, 240) +60(240, 180), (270, 210) +90(270, 180) dProbabilities f(d) -301/9 ? 02/9 ? +303/9 ? +602/9 ? +901/9 ? 23 Example 3: TWO INDEPENDENT POPULATIONS X 1 = Cholesterol level (mg/dL) xf 1 (x) 2101/6 2401/3 2701/2 Total1 X 2 = Cholesterol level (mg/dL) xf 2 (x) 1801/3 2101/3 2401/3 Total1  1 = 250  1 2 = 500  2 = 210  2 2 = 600 D = X 1 – X 2 ~ ??? The outcomes of D are NOT EQUALLY LIKELY!!!

dOutcomes -30(210, 240) 0(210, 210), (240, 240) +30(210, 180), (240, 210), (270, 240) +60(240, 180), (270, 210) +90(270, 180) dProbabilities f(d) -30(1/6)(1/3) = 1/18 via independence 0(210, 210), (240, 240) +30(210, 180), (240, 210), (270, 240) +60(240, 180), (270, 210) +90(270, 180) 24 Example 3: TWO INDEPENDENT POPULATIONS X 1 = Cholesterol level (mg/dL) xf 1 (x) 2101/6 2401/3 2701/2 Total1 X 2 = Cholesterol level (mg/dL) xf 2 (x) 1801/3 2101/3 2401/3 Total1  1 = 250  1 2 = 500  2 = 210  2 2 = 600 D = X 1 – X 2 ~ ???

dProbabilities f(d) -30(1/6)(1/3) = 1/18 via independence 0(210, 210), (240, 240) +30(210, 180), (240, 210), (270, 240) +60(240, 180), (270, 210) +90(270, 180) dProbabilities f(d) -30(1/6)(1/3) = 1/18 via independence 0(1/6)(1/3) + (1/3)(1/3) = 3/18 +30(210, 180), (240, 210), (270, 240) +60(240, 180), (270, 210) +90(270, 180) 25 Example 3: TWO INDEPENDENT POPULATIONS X 1 = Cholesterol level (mg/dL) xf 1 (x) 2101/6 2401/3 2701/2 Total1 X 2 = Cholesterol level (mg/dL) xf 2 (x) 1801/3 2101/3 2401/3 Total1  1 = 250  1 2 = 500  2 = 210  2 2 = 600 D = X 1 – X 2 ~ ???

dProbabilities f(d) -30(1/6)(1/3) = 1/18 via independence 0(1/6)(1/3) + (1/3)(1/3) = 3/18 +30(210, 180), (240, 210), (270, 240) +60(240, 180), (270, 210) +90(270, 180) 26 Example 3: TWO INDEPENDENT POPULATIONS X 1 = Cholesterol level (mg/dL) xf 1 (x) 2101/6 2401/3 2701/2 Total1 X 2 = Cholesterol level (mg/dL) xf 2 (x) 1801/3 2101/3 2401/3 Total1  1 = 250  1 2 = 500  2 = 210  2 2 = 600 D = X 1 – X 2 ~ ??? dProbabilities f(d) -30(1/6)(1/3) = 1/18 via independence 0(1/6)(1/3) + (1/3)(1/3) = 3/18 +30(1/6)(1/3) + (1/3)(1/3) + (1/2)(1/3) = 6/18 +60(1/3)(1/3) + (1/2)(1/3) = 5/18 +90(1/2)(1/3) = 3/18 1/18 3/18 6/18 5/18 3/18 Probability Histogram

dProbabilities f(d) -30(1/6)(1/3) = 1/18 via independence 0(1/6)(1/3) + (1/3)(1/3) = 3/18 +30(210, 180), (240, 210), (270, 240) +60(240, 180), (270, 210) +90(270, 180) 27 Example 3: TWO INDEPENDENT POPULATIONS X 1 = Cholesterol level (mg/dL) xf 1 (x) 2101/6 2401/3 2701/2 Total1 X 2 = Cholesterol level (mg/dL) xf 2 (x) 1801/3 2101/3 2401/3 Total1  1 = 250  1 2 = 500  2 = 210  2 2 = 600 D = X 1 – X 2 ~ ??? dProbabilities f(d) -30(1/6)(1/3) = 1/18 via independence 0(1/6)(1/3) + (1/3)(1/3) = 3/18 +30(1/6)(1/3) + (1/3)(1/3) + (1/2)(1/3) = 6/18 +60(1/3)(1/3) + (1/2)(1/3) = 5/18 +90(1/2)(1/3) = 3/18  1 = 250  1 2 = 500  2 = 210  2 2 = 600 1/18 3/18 6/18 5/18 3/18 Probability Histogram  D = (-30)(1/18) + (0)(3/18) + (30)(6/18) + (60)(5/18) + (90)(3/18) = 40  D 2 = (-70) 2 (1/18) + (-40) 2 (3/18) + (-10) 2 (6/18) + (20) 2 (5/18) + (50) 2 (3/18) = 1100  D =  1 –  2  D 2 =   2 2

General: TWO INDEPENDENT POPULATIONS dProbabilities f(d) -30(1/6)(1/3) = 1/18 via independence 0(1/6)(1/3) + (1/3)(1/3) = 3/18 +30(210, 180), (240, 210), (270, 240) +60(240, 180), (270, 210) +90(270, 180) 28 X1X1 xf 1 (x) 2101/6 2401/3 2701/2 Total1 X2X2 xf 2 (x) 1801/3 2101/3 2401/3 Total1  1 = 250  1 2 = 500  2 = 210  2 2 = 600 dProbabilities f(d) -30(1/6)(1/3) = 1/18 via independence 0(1/6)(1/3) + (1/3)(1/3) = 3/18 +30(1/6)(1/3) + (1/3)(1/3) + (1/2)(1/3) = 6/18 +60(1/3)(1/3) + (1/2)(1/3) = 5/18 +90(1/2)(1/3) = 3/18  1 = 250  1 2 = 500  2 = 210  2 2 = 600 1/18 3/18 6/18 5/18 3/18 Probability Histogram  D = (-30)(1/18) + (0)(3/18) + (30)(6/18) + (60)(5/18) + (90)(3/18) = 40  D 2 = (-70) 2 (1/18) + (-40) 2 (3/18) + (-10) 2 (6/18) + (20) 2 (5/18) + (50) 2 (3/18) = 1100 D = X 1 – X 2 ~ ???  D =  1 –  2  D 2 =   2 2 – 2 Cov (X 1, X 2 ) X 1 = Cholesterol level (mg/dL)X 2 = Cholesterol level (mg/dL) These two formulas are valid for continuous as well as discrete distributions. IF the two populations are dependent… …then this formula still holds, BUT……