1 Random variables and probability distributions.

Slides:



Advertisements
Similar presentations
Probability Distributions
Advertisements

AP Statistics Chapter 7 Notes. Random Variables Random Variable –A variable whose value is a numerical outcome of a random phenomenon. Discrete Random.
6.2 Construct and Interpret Binomial Distributions
Business Statistics for Managerial Decision
Random Variables A random variable is a variable (usually we use x), that has a single numerical value, determined by chance, for each outcome of a procedure.
1 Set #3: Discrete Probability Functions Define: Random Variable – numerical measure of the outcome of a probability experiment Value determined by chance.
EXAMPLE 1 Construct a probability distribution
EXAMPLE 1 Construct a probability distribution Let X be a random variable that represents the sum when two six-sided dice are rolled. Make a table and.
Section 5.1 Random Variables
BCOR 1020 Business Statistics Lecture 9 – February 14, 2008.
Probability and Probability Distributions
Copyright © 2014 by McGraw-Hill Higher Education. All rights reserved.
Chapter 16: Random Variables
 The Law of Large Numbers – Read the preface to Chapter 7 on page 388 and be prepared to summarize the Law of Large Numbers.
Random variables and probability distributions
The probability of an event based on the fact that some other event has occurred, will occur, or is occurring. P(B/A) = Conditional Probability The probability.
Probability Distributions: Finite Random Variables.
Random Variables A random variable A variable (usually x ) that has a single numerical value (determined by chance) for each outcome of an experiment A.
Copyright ©2011 Nelson Education Limited. Probability and Probability Distributions CHAPTER 4 Part 2.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 4 and 5 Probability and Discrete Random Variables.
Week71 Discrete Random Variables A random variable (r.v.) assigns a numerical value to the outcomes in the sample space of a random phenomenon. A discrete.
Probability, contd. Learning Objectives By the end of this lecture, you should be able to: – Describe the difference between discrete random variables.
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 11 Section 1 – Slide 1 of 34 Chapter 11 Section 1 Random Variables.
Applied Business Forecasting and Regression Analysis Review lecture 2 Randomness and Probability.
1 Dr. Jerrell T. Stracener, SAE Fellow Leadership in Engineering EMIS 7370/5370 STAT 5340 : PROBABILITY AND STATISTICS FOR SCIENTISTS AND ENGINEERS Systems.
A study of education followed a large group of fourth-grade children to see how many years of school they eventually completed. Let x be the highest year.
1 Chapter 16 Random Variables. 2 Expected Value: Center A random variable assumes a value based on the outcome of a random event.  We use a capital letter,
Chapter 5: Random Variables and Discrete Probability Distributions
Chapter 16 Random Variables.
The Mean of a Discrete RV The mean of a RV is the average value the RV takes over the long-run. –The mean of a RV is analogous to the mean of a large population.
Copyright © 2014 Pearson Education, Inc. All rights reserved Chapter 6 Modeling Random Events: The Normal and Binomial Models.
Chapter 5 Discrete Probability Distributions. Random Variable A numerical description of the result of an experiment.
June 11, 2008Stat Lecture 10 - Review1 Midterm review Chapters 1-5 Statistics Lecture 10.
Random Variables Presentation 6.. Random Variables A random variable assigns a number (or symbol) to each outcome of a random circumstance. A random variable.
Slide Slide 1 Section 6-4 Sampling Distributions and Estimators.
Class 2 Probability Theory Discrete Random Variables Expectations.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Section 6-4 Sampling Distributions and Estimators.
AP STATISTICS Section 7.1 Random Variables. Objective: To be able to recognize discrete and continuous random variables and calculate probabilities using.
1 7.3 RANDOM VARIABLES When the variables in question are quantitative, they are known as random variables. A random variable, X, is a quantitative variable.
7.2 Means & Variances of Random Variables AP Statistics.
Probability Distribution. Probability Distributions: Overview To understand probability distributions, it is important to understand variables and random.
1 Dr. Jerrell T. Stracener, SAE Fellow Leadership in Engineering EMIS 7370/5370 STAT 5340 : PROBABILITY AND STATISTICS FOR SCIENTISTS AND ENGINEERS Systems.
1. 2 At the end of the lesson, students will be able to (c)Understand the Binomial distribution B(n,p) (d) find the mean and variance of Binomial distribution.
Week 5 Discrete Random Variables and Probability Distributions Statistics for Social Sciences.
Probability Distributions Section 7.6. Definitions Random Variable: values are numbers determined by the outcome of an experiment. (rolling 2 dice: rv’s.
WARM UP: Penny Sampling 1.) Take a look at the graphs that you made yesterday. What are some intuitive takeaways just from looking at the graphs?
Chapter 7 Lesson 7.4a Random Variables and Probability Distributions 7.4: Mean and Standard Deviation of a Random Variable.
Lesson 96 – Expected Value & Variance of Discrete Random Variables HL2 Math - Santowski.
The Binomial Probability Distribution
Discrete Random Variables Section 6.1. Objectives Distinguish between discrete and continuous random variables Identify discrete probability distributions.
Copyright © 2009 Pearson Education, Inc. Chapter 16 Random Variables.
4.2 Random Variables and Their Probability distributions
Random Variables and Probability Distribution (2)
Random Variables Random variables assigns a number to each outcome of a random circumstance, or equivalently, a random variable assigns a number to each.
Discrete Random Variables
Chapter 16 Random Variables.
Random Variables Random variables assigns a number to each outcome of a random circumstance, or equivalently, a random variable assigns a number to each.
Lecture 13 Sections 5.4 – 5.6 Objectives:
Discrete Probability Distributions
Week 8 Chapter 14. Random Variables.
Sampling Distribution Models
Probability Review for Financial Engineers
Random Variables 7.1.
Sampling Distributions
A study of education followed a large group of fourth-grade children to see how many years of school they eventually completed. Let x be the highest year.
7.1: Discrete and Continuous Random Variables
Elementary Statistics
Chapter 5: Discrete Probability Distributions
6.1 Construct and Interpret Binomial Distributions
Presentation transcript:

1 Random variables and probability distributions

2 Random variables are a key concept for statistical inference We know that in order to use results from a sample to infer about a larger population, we must avoid bias and chose a sample at random. If our sample is random, then the sample mean, the sample proportion, and other statistics that we calculate from the sample are called: random variables. The behavior of such variables is an essential step on the way to performing statistical inference.

3 Random variables A random variable X associates a numerical value with each elementary outcome of an experiment Example Let X be the number of boys born in a family of 2 children. List the possible values of X X =1 means that there is one boy in the family 2 children in the family X – number of boys BB2 BG1 GB1 GG0

4 Probability distribution of discrete variables In order to understand the pattern of behavior of a random variable, we need to know its possible values and how likely they are to occur. The probability distribution of a discrete random variable X is a list of the distinct numerical values of X, along with their associated probabilities. X x 1 x 2 x 3 …… x k Probability p 1 p 2 p 3 …… p k 0≤p i ≤1 i=1,…,k p 1 +p 2 +…+p k =1

5 Example If X represents the number of boys in a family of 2 children, find the probability distribution of X. The probability of BB is: p(first is a boy and second is a boy) = p(first is a boy)× p(second is a boy)=0.5×0.5=0.25 P(BG)= P(GB)= P(GG)= Family with 2 childrenX – number of boysProbability BB2 BG1 GB1 GG0 Since the 2 events are independent 0.25 Family with 2 childrenX – number of boysProbability BB20.25 BG10.25 GB10.25 GG00.25

6 Example - continued Now we can specify the probability of each value of X Family with 2 childrenX – number of boysProbability BB20.25 BG10.25 GB10.25 GG00.25 Value of XProbability =

7 Notations for random variables We denote the random variable by uppercase letters (e.g., X,V,Z ) We denote the values that the random variable gets by lowercase letters (e.g., x i, v i, z i ). The probability that a random variable X get a certain value x is denoted by: p(X=x i ), or, in short, p(x i ). For example Value of X P(x i ) P(X=0)=0.25  P(X=1)=0.5 P(X=2)=0.25 Remember that the sum of the probabilities must equal 1!!!

8 Example A probability distribution is given in the accompanying table with the additional information that the even values of X are equally likely. Determine the missing entries of the table. Value of XP(x) Answer: The sum of the probabilities in the table: =0.7 The remaining 0.3 probability is equally divided between the values 2,4,and 6 0.1

9 Example Here is the probability distribution p(x) for grade X of a randomly chosen student from a certain class. X =0 represents an F, X =4 represents an A. Xp(x) (F)00.08 (D)10.06 (C)20.24 (B)30.36 (A) What is the probability of getting at least a B? 2. What is the probability of getting higher than a D? 3. What is the probability of getting less than a C? 4. What is the probability of getting no better than a B?

10 Probability histogram A probability histogram serves as a display of a probability distribution Example Number of boys is a family of 2 children XP(x)

11 Example - rolling two dice: X=sum of two dice X=2,3,4,5,6,7,8,9,10,11,12 Outcome of 2 diceXProbability (1,1) Outcome of 2 diceXProbability (1,1)2 (1,2) (2,1) Outcome of 2 diceXProbability (1,1)2 (1,2) (2,1)3 (1,3) (2,2) (3,1) Outcome of 2 diceXProbability (1,1)2 (1,2) (2,1)3 (1,3) (2,2) (3,1)4 (1,4) (2,3) (3,2) (4,1) Outcome of 2 diceXProbability (1,1)2 (1,2) (2,1)3 (1,3) (2,2) (3,1)4 (1,4) (2,3) (3,2) (4,1)5 (1,5) (2,4) (3,3) (4,2) (5,1) Outcome of 2 diceXProbability (1,1)2 (1,2) (2,1)3 (1,3) (2,2) (3,1)4 (1,4) (2,3) (3,2) (4,1)5 (1,5) (2,4) (3,3) (4,2) (5,1)6 (1,6) (2,5) (3,4) (4,3) (5,2) (6,1) Outcome of 2 diceXProbability (1,1)2 (1,2) (2,1)3 (1,3) (2,2) (3,1)4 (1,4) (2,3) (3,2) (4,1)5 (1,5) (2,4) (3,3) (4,2) (5,1)6 (1,6) (2,5) (3,4) (4,3) (5,2) (6,1)7 (2,6) (3,5) (4,4) (5,3) (6,2) Outcome of 2 diceXProbability (1,1)2 (1,2) (2,1)3 (1,3) (2,2) (3,1)4 (1,4) (2,3) (3,2) (4,1)5 (1,5) (2,4) (3,3) (4,2) (5,1)6 (1,6) (2,5) (3,4) (4,3) (5,2) (6,1)7 (2,6) (3,5) (4,4) (5,3) (6,2)8 (3,6) (4,5) (5,4) (6,3) Outcome of 2 diceXProbability (1,1)2 (1,2) (2,1)3 (1,3) (2,2) (3,1)4 (1,4) (2,3) (3,2) (4,1)5 (1,5) (2,4) (3,3) (4,2) (5,1)6 (1,6) (2,5) (3,4) (4,3) (5,2) (6,1)7 (2,6) (3,5) (4,4) (5,3) (6,2)8 (3,6) (4,5) (5,4) (6,3)9 (4,6) (5,5) (6,4) Outcome of 2 diceXProbability (1,1)2 (1,2) (2,1)3 (1,3) (2,2) (3,1)4 (1,4) (2,3) (3,2) (4,1)5 (1,5) (2,4) (3,3) (4,2) (5,1)6 (1,6) (2,5) (3,4) (4,3) (5,2) (6,1)7 (2,6) (3,5) (4,4) (5,3) (6,2)8 (3,6) (4,5) (5,4) (6,3)9 (4,6) (5,5) (6,4)10 (5,6) (6,5)11 12 Outcome of 2 diceXProbability (1,1)2 (1,2) (2,1)3 (1,3) (2,2) (3,1)4 (1,4) (2,3) (3,2) (4,1)5 (1,5) (2,4) (3,3) (4,2) (5,1)6 (1,6) (2,5) (3,4) (4,3) (5,2) (6,1)7 (2,6) (3,5) (4,4) (5,3) (6,2)8 (3,6) (4,5) (5,4) (6,3)9 (4,6) (5,5) (6,4)10 (5,6) (6,5)11 (6,6)12 1/36 2/36 3/36 4/36 5/36 6/36 5/36 4/36 3/36 2/36 1/36

12 Probability histogram

13 Roll a die and let X denote the outcome 1.p(X≤2)= 2.p(X<3)= 3.p(2≤X≤4)= 4.p(X>1)= 5.p(X≤6)= Example Consider an unbalanced die with the following probabilities: Value of XProbability 11/ / /6 Roll a die and let X denote the outcome 1.p(X≤2)=p(X=1)+p(X=2)=1/6+1/6=1/3 2.p(X<3)= p(X=1)+p(X=2)=1/6+1/6=1/3 3.p(2≤X≤4)=p(X=2)+p(X=3)+p(X=4)=1/6+1/6+1/12=5/12 4.p(X>1)=1-p(X=1)=1-1/6=5/6 5.p(X≤6)=sum of all probabilities =1

14 Mean and standard deviation of a random variable

15 Mean of a random variable The mean of a list of numbers is their average The mean of a rv X is a weighted average of the possible values of X. Example – household size in the U.S X Probability Mean of X= =(1)(.251)+(2)(.321)+(3)(.171) +(4)(.154) +(5)(.067) +(6)(.022) +(7)(.014)= =2.587 X

16 X x 1 x 2 x 3 …… x k Probability p 1 p 2 p 3 …… p k μ X = E(X) = x 1 p 1 +x 2 p 2 +……+x k p k = Mean = expected value Mean of a random variable

17 Example Roll a die once and let X denote the outcome: Value of XProbability 11/ Calculate the mean outcome of the die: E(X)=μ X = =(1)(1/6)+(2)(1/6)+(3)(1/6)+(4)(1/6)+(5)(1/6)+(6)(1/6)=3.5

18 Standard deviation of a random variable We saw that we can describe a probability distribution by a probability histogram. For the dice example we had: In addition of the mean we need a measure of the spread. - variance of a random variable X - standard deviation of a random variable X

19 Standard deviation of a random variable Weighted average of the squared deviations of the rv X from its mean μ X. X x 1 x 2 x 3 …… x k Probability p 1 p 2 p 3 …… p k = (x 1 - μ X ) 2 P 1 +(x 2 - μ X ) 2 P 2 +……+(x k - μ X ) 2 P k = = Σ (x i - μ X ) 2 P i i=1 k

20 In the 2 dice example: μ X =2(1/36)+3(2/36)+4(3/36)+5(4/36)+6(5/36)+7(6/36)+ +8(5/36)+9(4/36)+ 10(3/36)+11(2/36)+12(1/36)=7 = Var(X)=(2-7) 2 (1/36)+(3-7) 2 (2/36)+ (4-7) 2 (3/36) +(5- 7) 2 (4/36) +(6-7) 2 (5/36) +(7-7) 2 (6/36)+(8-7) 2 (5/36) +(9- 7) 2 (4/36)+ (10-7) 2 (3/36)+(11-7) 2 (2/36)+(12- 7) 2 (1/36)= = SD(X)=2.415 Outcome of 2 diceXProbability (1,1)21/36 (1,2) (2,1)32/36 (1,3) (2,2) (3,1)43/36 (1,4) (2,3) (3,2) (4,1)54/36 (1,5) (2,4) (3,3) (4,2) (5,1)65/36 (1,6) (2,5) (3,4) (4,3) (5,2) (6,1)76/36 (2,6) (3,5) (4,4) (5,3) (6,2)85/36 (3,6) (4,5) (5,4) (6,3)94/36 (4,6) (5,5) (6,4)103/36 (5,6) (6,5)112/36 (6,6)121/36

21 Question At a news stand, the daily number X of requests for a certain out- of-town newspaper has the following probability distribution: Number Probability (i) What kind of random variable is X? discrete/continuous (ii) What is the probability of fewer than 2 requests? p(X<2)=p(X=1)+p(X=0)= =0.2 (iii) What is the expected number of requests? E(X)= μ X =0(.1)+1(.1)+2(.5)+3(.3)=2 (iv) What is the standard deviation of number of requests? SD(x)=(0-2) 2 (.1) + (1-2) 2 (.1) + (2-2) 2 (.5) + (3-2) 2 (.3)=√.8=.894 (v) What is the shape of the distribution of requests? left skewed / symmetric / right skewed