Random Variables And Discrete Probability Distributions

Slides:



Advertisements
Similar presentations
Lecture (7) Random Variables and Distribution Functions.
Advertisements

© 2004 Prentice-Hall, Inc.Chap 5-1 Basic Business Statistics (9 th Edition) Chapter 5 Some Important Discrete Probability Distributions.
Chapter 5 Some Important Discrete Probability Distributions
Chapter 5 Discrete Random Variables and Probability Distributions
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc.. Chap 5-1 Chapter 5 Some Important Discrete Probability Distributions Basic Business Statistics.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 5-1 Chapter 5 Some Important Discrete Probability Distributions Basic Business Statistics.
© 2003 Prentice-Hall, Inc.Chap 5-1 Basic Business Statistics (9 th Edition) Chapter 5 Some Important Discrete Probability Distributions.
ฟังก์ชั่นการแจกแจงความน่าจะเป็น แบบไม่ต่อเนื่อง Discrete Probability Distributions.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 5-1 Chapter 5 Some Important Discrete Probability Distributions Statistics.
Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall 5-1 Chapter 5 Discrete Probability Distributions Statistics for Managers using Microsoft.
HAWKES LEARNING SYSTEMS math courseware specialists Copyright © 2010 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Chapter 7 Probability.
Chapter 5 Discrete Probability Distributions
Chapter 4 Discrete Random Variables and Probability Distributions
1 1 Slide 2009 University of Minnesota-Duluth, Econ-2030 (Dr. Tadesse) Chapter 5: Probability Distributions: Discrete Probability Distributions.
1 Random Variables and Discrete probability Distributions Chapter 7.
Introduction to Probability and Statistics
Discrete Random Variables and Probability Distributions
Probability Distributions
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 5-1 Chapter 5 Some Important Discrete Probability Distributions Statistics.
Discrete Probability Distributions
Chapter 5 Discrete Random Variables and Probability Distributions
Discrete Probability Distributions
Chap 5-1 Copyright ©2012 Pearson Education, Inc. publishing as Prentice Hall Chap 5-1 Chapter 5 Discrete Probability Distributions Basic Business Statistics.
Keller: Stats for Mgmt & Econ, 7th Ed
Statistics Alan D. Smith.
Statistics for Managers Using Microsoft® Excel 5th Edition
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 7.1 Chapter 7 Random Variables and Discrete Probability Distributions.
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 7.1 Chapter 7 Random Variables and Discrete Probability Distributions.
Binomial Probability Distribution.
1 Random Variables and Discrete probability Distributions SESSION 2.
© 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 4 and 5 Probability and Discrete Random Variables.
Chap 5-1 Copyright ©2013 Pearson Education, Inc. publishing as Prentice Hall Chapter 5 Discrete Probability Distributions Business Statistics: A First.
Lecture Note 5 Discrete Random Variables and Probability Distributions ©
Chapter 5 Discrete Random Variables and Probability Distributions ©
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc.. Chap 5-1 Chapter 5 Some Important Discrete Probability Distributions Basic Business Statistics.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Review and Preview This chapter combines the methods of descriptive statistics presented in.
Random Variables… A random variable is a symbol that represents the outcome of an experiment. Alternatively, the value of a random variable can be either.
Chapter 5 Discrete Random Variables Statistics for Business 1.
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 11 Section 1 – Slide 1 of 34 Chapter 11 Section 1 Random Variables.
Random Variables and Discrete probability Distributions
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc.. Chap 5-1 Chapter 5 Some Important Discrete Probability Distributions Basic Business Statistics.
Random Variables. A random variable X is a real valued function defined on the sample space, X : S  R. The set { s  S : X ( s )  [ a, b ] is an event}.
Binomial Experiment A binomial experiment (also known as a Bernoulli trial) is a statistical experiment that has the following properties:
Statistics for Managers Using Microsoft Excel, 5e © 2008 Pearson Prentice-Hall, Inc.Chap 5-1 Statistics for Managers Using Microsoft® Excel 5th Edition.
Chapter 7 Random Variables and Discrete Probability Distributions Sir Naseer Shahzada.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics Seventh Edition By Brase and Brase Prepared by: Mistah Flynn.
Probability Distribution
1 1 Slide University of Minnesota-Duluth, Econ-2030 (Dr. Tadesse) University of Minnesota-Duluth, Econ-2030 (Dr. Tadesse) Chapter 5: Probability Distributions:
Statistics Lecture Notes Dr. Halil İbrahim CEBECİ Chapter 06 Random Variables and Discrete Probability Distributions.
Chapter 5 Discrete Random Variables Probability Distributions
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 5-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Chapter 5 Probability Distributions 5-1 Overview 5-2 Random Variables 5-3 Binomial Probability Distributions 5-4 Mean, Variance and Standard Deviation.
Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 5-1 Chapter 5 Some Important Discrete Probability Distributions Business Statistics,
Chapter 4 Discrete Random Variables and Probability Distributions
Conceptual Foundations © 2008 Pearson Education Australia Lecture slides for this course are based on teaching materials provided/referred by: (1) Statistics.
Probability Distributions ( 확률분포 ) Chapter 5. 2 모든 가능한 ( 확률 ) 변수의 값에 대해 확률을 할당하는 체계 X 가 1, 2, …, 6 의 값을 가진다면 이 6 개 변수 값에 확률을 할당하는 함수 Definition.
Discrete Probability Distributions
Keller: Stats for Mgmt & Econ, 7th Ed
Keller: Stats for Mgmt & Econ, 7th Ed
Chapter 5 Some Important Discrete Probability Distributions
Discrete Probability Distributions
Probability distributions
Discrete Probability Distributions
Discrete Probability Distributions
Discrete Probability Distributions
Discrete Probability Distributions
Discrete Probability Distributions
Discrete Random Variables and Probability Distributions
Presentation transcript:

Random Variables And Discrete Probability Distributions Statistics for Management and Economics Chapter 7

Objectives Random Variables and Probability Distributions Bivariate Distributions Binomial Distribution Poisson Distribution

Random Variables A random variable is a function or rule that assigns a number to each outcome of an experiment. Alternatively, the value of a random variable is a numerical event. Instead of talking about the coin flipping event as {heads, tails} think of it as “the number of heads when flipping a coin” so we have {1, 0}

Random Variables Continuous one whose values are not discrete, not countable Values resulting from the event time taken to walk to campus (anywhere from 5 to 30 minutes) Real Numbers Discrete one that takes on a countable number of values Values on the roll of dice: 2, 3, 4, …, 12 Integers

Probability Distributions A probability distribution is a table, formula, or graph that describes the values of a random variable and the probability associated with these values. Since we’re describing a random variable (which can be discrete or continuous) we have two types of probability distributions: Discrete Probability Distribution, (this chapter) Continuous Probability Distribution (Chapter 8)

Notation An upper-case letter will represent the name of the random variable, usually X. Its lower-case counterpart will represent the value of the random variable. The probability that the random variable X will equal x is: P(X = x) …or more simply P(x)

Discrete Probability Distributions The probabilities of the values of a discrete random variable may be derived by means of probability tools such as tree diagrams or by applying one of the definitions of probability, so long as these two conditions apply:

Discrete Distributions A survey of Amazon.com shoppers reveals the following probability distribution of he number of books purchased per hit: e.g. P(X=4) = P(4) = 0.06 = 6% # of books x P(x) 0.35 1 0.25 2 0.20 3 0.08 4 0.06 5 0.03 6 0.02 7 0.01 What is the probability that a shopper buys at least one book, but no more than 3 books? P(1 ≤ X ≤ 3) = P(1) + P(2) + P(3) = 0.25 + 0.20 + 0.08 = 0.53

Developing a Probability Distribution Probability calculation techniques can be used to develop probability distributions, for example, a new game in Vegas is developed where a fair coin is tossed three times. What is the probability distribution of the number of heads if I play this game in Vegas? Let H denote success, i.e. flipping a head P(H)=.50 Thus HC is not flipping a head, and P(HC)=.50

Probability Distribution The discrete probability distribution represents a population Since we have populations, we can describe them by computing various parameters. E.g. the population mean and population variance.

Population Mean Univariate Discrete Probability Distribution The population mean is the weighted average of all of its values. The weights are the probabilities. This parameter is also called the expected value of X and is represented by E(X).

Population Variance Univariate Discrete Probability Distribution The population variance is calculated similarly. It is the weighted average of the squared deviations from the mean. As before, there is a “short-cut” formulation… The standard deviation is the same as before:

Population Mean, Variance, and Standard Deviation # of books x P(x) 0.35 1 0.25 2 0.20 3 0.08 4 0.06 5 0.03 6 0.02 7 0.01 Find the mean, variance, and standard deviation for the population of the number of books purchased on Amazon.com = 0(0.35) + 1(0.25) + … + 7(0.01) = 1.51 = 2.53 = 1.59 = (0-1.51)2*0.35 + (1-1.51)2*0.25 +…+ (7-1.51)2*0.01 = 2.53

Laws of Expected Value E(c) = c E(X + c) = E(X) + c E(cX) = cE(X) The expected value of a constant (c) is just the value of the constant. E(X + c) = E(X) + c E(cX) = cE(X) We can “pull” a constant out of the expected value expression (either as part of a sum with a random variable X or as a coefficient of random variable X).

Laws of Variance V(c) = 0 V(X + c) = V(X) V(cX) = c2V(X) The variance of a constant (c) is zero. V(X + c) = V(X) The variance of a random variable and a constant is just the variance of the random variable (per 1 above). V(cX) = c2V(X) The variance of a random variable and a constant coefficient is the coefficient squared times the variance of the random variable.

Bivariate Distributions Up to now, we have looked at univariate distributions, i.e. probability distributions in one variable. As you might guess, bivariate distributions are probabilities of combinations of two variables. Bivariate probability distributions are also called joint probability. A joint probability distribution of X and Y is a table or formula that lists the joint probabilities for all pairs of values x and y, and is denoted P(x,y). P(x,y) = P(X=x and Y=y)

Discrete Bivariate Distribution As you might expect, the requirements for a bivariate distribution are similar to a univariate distribution, with only minor changes to the notation: for all pairs (x,y).

Bivariate Distribution After analyzing several months of sales data, the owner of an appliance store produced the following joint probability distribution of the number of refrigerators and stoves sold daily: Refrigerators (x) Stoves (y) 1 2 0.08 0.14 0.12 0.34 0.09 0.17 0.13 0.39 0.05 0.18 0.04 0.27 0.22 0.49 0.29 1.00 We interpret these joint probabilities as before. E.g. the probability that the store sells 0 Refrigerators and 1 Stove in the day is P(0, 1) = 0.09

Describing the Bivariate Distribution We can describe the mean, variance, and standard deviation of each variable in a bivariate distribution by working with the marginal probabilities… x P(x) 0.22 1 0.49 2 0.29 y P(y) 0.34 1 0.39 2 0.27 E(x) = 1.07 V(x) = 0.51 x = 0.71 E(y) = 0.93 V(y) = 0.61 y = 0.78 same formulae as for univariate distributions…

Covariance Bivariate Distribution The covariance of two discrete variables is defined as: or alternatively using this shortcut method:

Coefficient of Correlation Bivariate Distribution The coefficient of correlation is calculated in the same way as described earlier

Covariance and Correlation Bivariate Distribution Compute the covariance and the coefficient of correlation between the numbers of refrigerators and stoves sold. COV(X,Y) = (0 –1.07)(0 – 0.93)(0.08) + (1 – 1.07)(0 – 0.93)(0.14) + … … + (2 – 1.07)(2 – 0.93)(.04) = –0.045 = –0.045 ÷ [(.71)(.78)] = –0.081 There is a weak, negative relationship between the two variables.

Sum of Two Variables The bivariate distribution allows us to develop the probability distribution of any combination of the two variables, of particular interest is the sum of two variables. If we consider our example of refrigerators and stoves, we can create a probability distribution… …to answer questions like “what is the probability that two appliances are sold”? P(X+Y=2) = P(0,2) + P(1,1) + P(2,0) = 0.05 + 0.17 + 0.12 = 0.34 x+y 1 2 3 4 P(x+y) 0.08 0.23 0.34 0.31 0.04

Laws: Bivariate Distribution We can derive laws of expected value and variance for the sum of two variables as follows… E(X + Y) = E(X) + E(Y) V(X + Y) = V(X) + V(Y) + 2COV(X, Y) If X and Y are independent, COV(X, Y) = 0 and thus: V(X + Y) = V(X) + V(Y)

Binomial Distribution The binomial distribution is the probability distribution that results from doing a “binomial experiment”. Binomial experiments have the following properties: Fixed number of trials, represented as n. Each trial has two possible outcomes, a “success” and a “failure”. P(success)=p (and thus: P(failure)=1–p), for all trials. The trials are independent, which means that the outcome of one trial does not affect the outcomes of any other trials.

“Success” and “Failure” …are just labels for a binomial experiment, there is no value judgment implied. For an experiment to b binomial, it simply must have two possible outcomes: something happens or something doesn’t happen. For example a coin flip will result in either heads or tails. If we define “heads” as success then necessarily “tails” is considered a failure (inasmuch as we attempting to have the coin lands heads up). Other binomial experiment notions: An election candidate wins or loses An employee is male or female

Binomial Random Variable The random variable of a binomial experiment is defined as the number of successes in the n trials, and is called the binomial random variable. E.g. flip a fair coin 10 times… 1) Fixed number of trials  n=10 2) Each trial has two possible outcomes  {heads (success), tails (failure)} 3) P(success)= 0.50; P(failure)=1–0.50 = 0.50  4) The trials are independent  (i.e. the outcome of heads on the first flip will have no impact on subsequent coin flips). Hence flipping a coin ten times is a binomial experiment since all conditions were met.

Binomial Random Variable The binomial random variable counts the number of successes in n trials of the binomial experiment. It can take on values from 0, 1, 2, …, n. Thus, its a discrete random variable. To calculate the probability associated with each value we use combintorics: for x=0, 1, 2, …, n

Rule of Combinations The number of combiniations of selecting X objects out of n objects is given by n! X!(n-X)! Where [n-factorial] n! = n(n-1)(n-2)…(1) and 0!=1

Often denoted by the symbol n X Rule of Combinations Often denoted by the symbol n X Thus… with n=4 and x=3… n! 4! 4x3x2x1 4 X!(n-X)! 3!(4-3)! (3x2x1)(1) = = = So, there are 4 such sequences, each with the same probability of 0.0009. The probability of obtaining exactly three tagged order forms is equal to (number of possible sequences) X (probability of a particular sequence)

Binomial Distribution P(X) = n! X!(n-X)! pX(1-p)n-X Where… P(X) = probability of X successes given the parameters n and p n = sample size p = probability of success (1-p) = probability of failure X = number of successes in the sample (X = 0, 1, 2, …, n)

E.C.K. Pharmaceutical Company If the likelihood of a tagged order form is 0.1, what is the probability that three tagged order forms are found in the sample of four orders? P(3) = 4! 3!(4-3)! p3(1-p)4-3 = 0.0036 If the likelihood of a tagged order form is 0.1, what is the probability that three or more (i.e., at least three) tagged order forms are selected out of the sample of four order forms? Notation? How do we find this?

Cumulative Probability Thus far, we have been using the binomial probability distribution to find probabilities for individual values of x. To answer the question: “Find the probability that the order is tagged” requires a cumulative probability, that is, P(X ≤ x) What is the probability that we find fewer than three tagged order forms in the sample of four orders? Thus, we want to know what is: P(X ≤ 3) to answer

Cumulative Probability P(X ≤ 3) = P(X=0) + P(X=1) + P(X=2) + P(X=3) Use the binomial probability to find… P(0) = 4! 0!(4-0)! p0(1-p)4-0 = 0.6561 P(1) = 4! 1!(4-1)! p1(1-p)4-1 = 0.2916 P(2) = 4! 2!(4-2)! p2(1-p)4-2 = 0.0486 P(X ≤ 3) = 0.9963 Is there another way to get this?

Binomial Table The probabilities listed in the tables are sometimes cumulative, i.e. P(X ≤ x) – x (sometimes k) is the row index (our book is like this). In some cases, the probabilities are not cumulative – P(X=k) is shown. This should be indicated in a heading on the table. Take out your packet of tables now.

P(X = k) = P(X ≤ k) – P(X ≤ [k–1]) Binomial Table For a binomial table that gives cumulative probabilities for P(X ≤ k)… P(X = k) = P(X ≤ k) – P(X ≤ [k–1]) Likewise, for probabilities given as P(X ≥ k), we have: P(X ≥ k) = 1 – P(X ≤ [k–1]) However, if the table does not give cumulative probabilities, as we saw in the example, in order to find P(X ≤ k) you have to add the k-1, etc probabilities.

=BINOMDIST() Excel Function… There is a binomial distribution function in Excel that can also be used to calculate these probabilities. For example: What is the probability that two or fewer orders are tagged? # successes # trials P(success) cumulative (i.e. P(X≤x)?) P(X≤2)=.9963

Binomial Distribution As you might expect, statisticians have developed general formulas for the mean, variance, and standard deviation of a binomial random variable. They are:

Poisson Distribution Named for Simeon Poisson, the Poisson distribution is a discrete probability distribution and refers to the number of events (a.k.a. successes) within a specific time period or region of space. For example: The number of cars arriving at a service station in 1 hour. (The interval of time is 1 hour.) The number of flaws in a bolt of cloth. (The specific region is a bolt of cloth.) The number of accidents in 1 day on a particular stretch of highway. (The interval is defined by both time, 1 day, and space, the particular stretch of highway.)

The Poisson Experiment Like a binomial experiment, a Poisson experiment has four defining characteristic properties: The number of successes that occur in any interval is independent of the number of successes that occur in any other interval. The probability of a success in an interval is the same for all equal-size intervals The probability of a success is proportional to the size of the interval. The probability of more than one success in an interval approaches 0 as the interval becomes smaller.

Poisson Distribution The Poisson random variable is the number of successes that occur in a period of time or an interval of space in a Poisson experiment. E.g. On average, 96 trucks arrive at a border crossing every hour. E.g. The number of typographic errors in a new textbook edition averages 1.5 per 100 pages. successes time period successes (?!) interval

Poisson Probability Distribution The probability that a Poisson random variable assumes a value of x is given by: and e is the natural logarithm base. FYI:

=Poisson Excel Function