Download presentation
Presentation is loading. Please wait.
1
Chapter 6 Random Variables
AP Statistics Chapter 6 Random Variables
2
What is a random variable?
A random variable is a variable whose value is a numerical outcome of a random phenomenon. EXAMPLE If we toss four coins, how would we record the results? We could record it as a string of tails and heads like “HTTH” or “HTHH”. This is not a random variable because it has no numerical value to work with. Instead, we may elect to record the number of heads in the four tosses. This would make our sample space 0, 1 , 2, 3, 4 … all numerical outcomes.
3
Discrete vs. Continuous Variables
A discrete random variable has a countable number of possible values. A continuous random variable can take any possible value over an interval. EXAMPLES The number of heads in four coin tosses. A number generated by a spinner that covers the numbers between 0 and 1.
4
Discrete Random Variables
The probability distribution of a discrete variable lists the values and their probabilities. The probabilities must satisfy two requirements: Every probability is between 0 and 1. p1 + p2 + … + pk = 1 Find the probability of any event by adding the individual probabilities that make up that event. Value X X1 X2 X3 … Xk P(X) p1 p2 p3 pk
5
EXAMPLE Determine the probability distribution of the discrete random variable X that counts the number of heads in four coin tosses. We can do this if we make two reasonable assumptions: The coin is balanced, so each toss is equally likely to give an H or T. The coin has no memory, so each toss is independent. Since each outcome is equally likely, what is the probability of each combination?
6
Continued… The number X represents the number of heads in four tosses. These values are NOT equally likely. Use this information to complete your probability distribution. What is the probability of getting 2 or more heads?
7
Means and Variances The mean of a set of observations is:
The mean of a random variable X is also an average of the possible values of X. This average must take in to account that some values of X may occur more frequently than others. We can handle this adjustment by multiplying each outcome by its probability. Value X X1 X2 X3 … Xk P(X) p1 p2 p3 pk
8
EXAMPLE According to Benford’s Law, the distribution of the first digit V in a set of legitimate business records is: Use this information to compute the expected value of any randomly selected first digit. (expected value = mean) The mean of V is: First Digit V: 1 2 3 4 5 6 7 8 9 P(V) 0.301 0.176 0.125 0.097 0.079 0.067 0.058 0.051 0.046
9
EXAMPLE Continued… While the mean of is not a possible outcome of V, it still gives us an idea of where we can expect most values to occur. If each digit was truly random, we would have a uniform distribution. What would the mean be in this case? Notice how this compares to the distribution of Benford’s Law.
10
Variance In a set of discrete values, the variance is based off of how much each value “varies” from the expected amount. In the case of a random variable’s distribution, we must account for the differences in frequency among outcomes. …and the standard deviation is the square root of the variance. Value X X1 X2 X3 … Xk P(X) p1 p2 p3 pk
11
EXAMPLE Gain Communications sells aircraft communication units to both military and civilian markets. Gain uses the modern practice of using probability estimates to estimate sales for the upcoming year. The military division of the company estimates its sales as follows: Calculate the expected number of sales and the standard deviation. Units Sold (X) 1000 3000 5000 10,000 P(X) 0.1 0.3 0.4 0.2
12
HOMEWORK Complete the problems: pg. 353 (#1 – 16). This assignment will be due for completion at the start of the next session of class.
13
Continuous Random Variables
As mentioned before, continuous random variables deal with an infinite number of possible outcomes over a pre-determined interval. Since there are an infinite number of possibilities, the probability of any individual occurrence is practically zero. Suppose we wanted to make a probability distribution for an event like, What would be the theoretical probability assigned to 0.47?
14
Density Curves In order to assign probabilities to events we can use density curves to describe a distribution. The horizontal axis of the density curve will represent all of the occurrences and its height over each occurrence will represent its frequency. The area under the curve over an interval will represent the probability of an event within that interval occurring. The total area under the curve will equal 1.
15
EXAMPLE Let’s revisit the spinner that generates a random number between 0 and 1. What would be the probability of generating a number X between 0.3 and 0.7 ?
16
EXAMPLE Continued Since each number on the spinner has an equal chance of being generated, we will call this a uniform distribution. The area under the curve is 1. Since this is uniform, the curve will be rectangular in shape. The probability of getting a value between 0.3 and 0.7 will be the area between those two values.
17
Taking it further… With the same example in mind, what would be the following: Is there a difference between P(X>8) and P(X>8)?
18
The Normal Distribution
We have discussed a density curve in prior chapters. It was the NORMAL CURVE. The normal distribution is considered a probability distribution. Recall that N(μ, σ) is our shorthand way of referring to the normal distribution having a mean of μ and a standard deviation of σ. To standardize our values and use our normal distribution table, we must use a z-score.
19
EXAMPLE An opinion poll ask an SRS of 1500 American adults what the biggest issue facing schools was. Based on the sample data, 30% of the adults said drugs. We will learn how to analyze this later, but for now, we will say that this is an estimate of the population with a distribution mean of 0.3 and a standard deviation of In other words… What is the probability that the result differs from the truth by more than two percentage points? Hint: Start off by “standardizing” the data.
20
EXAMPLE Continued…
21
HOMEWORK Complete the problems: pg. 355 (#17 – 30). This assignment will be due for completion at the start of the next session of class.
22
Rules for Means If the values of a random variable, X, are increased or decreased by addition or subtraction, then the mean value of X is also increased in the same manner. If the values of a random variable, X, are increased or decreased by multiplication, then the mean value of X is also increased in the same manner. In other words,
23
Rules for Means If we have two random variables, X and Y, then the sum of those two variables will have a mean that is equal to the sum of their individual means. In other words,
24
EXAMPLE Gain Communications sells aircraft communication units to both military and civilian markets. Gain uses the modern practice of using probability estimates to estimate sales for the upcoming year. The military division of the company estimates its sales as follows: The civilian division of the company estimates its sales as follows: Compute the mean sales of each. Units Sold (X) 1000 3000 5000 10,000 P(X) 0.1 0.3 0.4 0.2 Units Sold (Y) 300 500 750 P(Y) 0.4 0.5 0.1
25
EXAMPLE Gain makes a profit of $2000 on each military unit and $3500 on each civilian unit that is sold. The mean military sales profit is: The mean civilian sales profit is: The total profit, Z, is the sum of all sales profits. The mean value of Z would be:
26
Rules for Variance We can apply similar rules to the variances of random variables. In order to do this, we must know if there the two random variables are independent of one another. This would mean that there was a correlation of ZERO between them. If there is a correlation between them, we must account for that correlation when we try to combine variances. It should also be noted that we are working with variances here and not standard deviations.
27
Rules for Variance If X is a random variable and a and b are fixed numbers, then: Notice that addition to X does not affect the variation. Only multiplication does. If X and Y are random variables with complete independence (no correlation):
28
EXAMPLE A college uses SAT scores as one criterion for admission. Experience has shown that the distribution of SAT scores among its entire population of applicants is: What are the mean and standard deviation of the total score X + Y among students applying to this college? NOTE: This is based on the assumption that the scores are independent, which many may argue that they are not. SAT Math Score (X) μx = 625 σx = 90 SAT Verbal Score (Y) μY = 590 σY = 100
29
EXAMPLE A large auto dealership keeps track of sales and lease agreements made during each hour of the day. Let X = the number of cars sold, and let Y = the number of cars leased during the first hour of a randomly selected Friday. Based on previous records, the distributions of X and Y are: Sold X 1 2 3 p 0.3 0.4 0.2 0.1 Leased Y 1 2 p 0.4 0.5 0.1
30
CONTINUED… Find the mean and standard deviation of both X and Y.
Now let’s define the total number of deals as T. (T = X + Y) Find and interpret the mean of T. Now compute the standard deviation of T.
31
CONTINUED Remember that you must deal with variances instead of standard deviations. The dealership’s manager receives a $500 bonus for each car sold and a $300 bonus for each car leased. Find the mean and standard deviation of the manager’s total bonus.
32
Check Your Understanding
Complete the Check Your Understanding problem on the top of pg We will discuss the answers in a moment.
33
HOMEWORK Complete the problems: pg. 378 (#37 – 51). This assignment will be due for completion at the start of the next session of class.
34
The Binomial Setting We have a binomial situation when the following things are in place: Each observation will fall in to one of two categories, usually considered “success” or “failure”. There is a fixed number of observations, “n”. All of the n observations are independent. The probability of success is the same for each observation.
35
Binomial Distributions
In a binomial setting, the random variable X is equal to the number of successes. The probability distribution of X in this case is considered a binomial distribution. The parameters of the distribution are n and p. n represents the number of observations p is the probability of success on any observation. As an abbreviation, we say that X is B(n, p).
36
EXAMPLE Blood type is a trait that is passed through heredity. If both parents carry the genes for both O and A blood types, there is a probability of 0.25 of having a child with Type O blood. If these parents have 5 children, how many children would have Type O blood? This is a binomial distribution B(5, 0.25). Deal 10 cards and let X be the count of the number of red cards. This would not be a binomial distribution because each occurrence is not independent.
37
Computing Binomial Probabilities
If X has the binomial distribution with n observations, having a probability of p for success on each, then the possible values of X are 0, 1, 2, …, n. If k is any of these values, This formula can be applied to find the probability of k number of successes in the situation described.
38
EXAMPLE A quality engineer selects an SRS of 10 switches from a large shipment for detailed inspection. Unknown to the engineer, 10% of the switches in the shipment fail to meet the specifications. What is the probability that exactly 1 of the ten switches in the sample will fail inspection? This is a distribution defined as B(10, .1). In this situation, k = 1. What would be the probability of the engineer finding 1 or fewer defective switches?
39
EXAMPLE 2 Each child of a particular pair of parents has a probability 0.25 of having type O blood. If they have 5 children, what is the probability that exactly 3 of the children have type o blood? There is basically, an 8.8% chance that this could happen! What is the probability that MORE THAN 3 of the children have type O blood?
40
HOMEWORK Complete the problems pg. 403 (#69 – 80). This assignment will be due for completion at the start of the next session of class.
41
Geometric Probability
We have a geometric setting when the following characteristics are in place: 1. Each observation will fall in to one of two categories, usually considered “success” or “failure”. 2. The probability of success is the same for each observation. 3. All of the n observations are independent. 4. The variable of interest, X, is the number of trials required to obtain the first success.
42
EXAMPLE GEOMETRIC DISTRIBUTION BINOMIAL DISTRIBUTION
If we are rolling a single die, and we want to roll a “5”, then how many rolls would it take to get a five for the first time? If we are rolling a die four times, and we want to count the number of fives that we roll … GEOMETRIC DISTRIBUTION BINOMIAL DISTRIBUTION
43
Calculating Geometric Probabilities
If X has a probability p of occurring, and a probability q of not occurring, the possible values of X are 1, 2, 3, … If n is any of these values, the probability that the first success occurs on the nth trial is: What would be the probability that it would take 3 rolls before we got our first five? 6 rolls?
44
Using the TI-84: pdf Just as with binomial probabilities, we can use the calculator to quickly compute geometric probabilities. The geometpdf function will quickly compute the probability for a set number of trials being required to achieve first success. To compute the probability that it would take five rolls to roll a “5” for the first time, we would use:
45
The Geometric Distribution
The geometric probability distribution also has a mean and standard deviation. The mean, or expected value, of a geometric random variable is: The standard deviation of a geometric random variable is:
46
HOMEWORK Complete the problems 8.37 – This assignment will be due for completion at the start of the next session of class.
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.