Ch. 16 Resources

Chapter 16: Random Variables

Like the previous two chapters, the computations in this chapter are fairly elementary and require no new calculator or computer techniques, so we will simply work through a few examples.

Playing the lottery

To play the Washington State Lottery's Daily Game, you pay $1 and then choose three digits (each may be 0 through 9, with repeated digits allowed). If the three digits chosen in the daily drawing match the three digits on your ticket (in the same order) you win $500; otherwise, you win nothing. (There are variations of this game, but let's stick with the basic scenario for this example.)

Let's let the random variable represent your net profit when playing this game one time. The possible values that may take on are $499 (the $500 prize if you win, minus the $1 you paid to play the game) or -$1 (in other words, the dollar you lose when you don't win the game). Let's construct a probability model for this game:

outcome profit probability

win $499 0.001
lose -$1 0.999

To compute the probabilities listed in this table we note that there are 10 possible digits for each of the three positions on the ticket, so the probability of matching any one digit is . Since each digit is selected independently of the others, we can compute:

and then note that

Now we compute the mean amount of money you win playing this game:

In other words, each time you play the game you lose an average of $0.50. Of course, you never actually lose $0.50: you either lose $1 or gain $499 (-$0.50 is just an average).

Next we compute the standard deviation but (as we will always do) we start with the variance:

Now take the square root of the variance to get the standard deviation:

Double or nothing

The Daily Game offers the opportunity to wager amounts other than $1 (there is a minimum bet of $0.50) and the prizes are adjusted accordingly. If you double the amount of your original wager (to $2), then the amount of the payout is also doubled (to $1000). Now the possible profits are $998 and -$2, which are just twice the amounts from the previous problem. Thus it makes sense to call the new random variable for the profit on one play of a $2 ticket .

We could note that the probabilities of winning and losing remain the same and create a new probability distribution for :

outcome profit probability

win $998 0.001
lose -$2 0.999

We could then compute the mean as before:

but notice this is the same as:

which uses the general formula from page 415 in the text.

For the variance we could compute:

and then take the square root to get:

Notice, however, that we can also use the general formulas and from the text:

Two tickets

Now suppose that instead of doubling the bet on a single lottery ticket, you simply buy two $1 tickets. You now have two independent events: whether the first ticket is a winner has no influence on whether or not the second ticket is also a winner (assuming you let the lottery computer choose the numbers on each ticket). By contrast, when we doubled the cost and prize money our profit was either $998 or -$2; with two tickets we could gain $998, gain $498, or lose $2.

Since we now have two random variables, let's call your profit on the first ticket and your profit on the second ticket . For any single ticket, we already know the expected value and standard deviation:

and

while

and .

Now let's define a third random variable: , the total profit for the two tickets combined. We could constuct a probability distribution for the new random variable:

outcome profit probability

win both $998 0.000001
win one, lose one   $498 0.001998
lose both -$2 0.998001

To compute the probabilities in the preceding table we note that

and

and finally deduce that

using the "Probability Assignment Rule" Rule.

We can then compute the mean:

and the variance:

and then the standard deviation:

While the above computations do lead to the mean and the standard deviation, there is a much easier way to arrive at the same answers, using the general formulas on pages 415 and 416 of the text for the mean:

the variance:

and the standard deviation:

Two players

Finally, suppose that you buy one ticket and your friend buys another ticket; we can now let the random variable represent your profit from placing a single $1 bet and represent your friend's profit. If you and your friend have a friendly competition to see who has the greater profit, we can consider a new random variable: the difference between your profit and your friend's profit, which we can denote . As above we can create a probability distribution listing the possible outcomes:

outcome difference probability

both win $0 0.000001
you win, she loses   $500 0.000999
you lose, she wins -$500 0.000999
both lose $0 0.998001

You should check that you understand where all of the numbers in table come from. It's not hard to compute the mean, variance and standard deviation using the definitions, as we have in previous cases (you can do this yourself for practice) but it's much quicker to use the appropriate formulas listed on pages 415 and 416 of the text:

Note that we always add variances, even when we're computing the variance of the difference of two independent random variables. It's also important to check, before we use these formulas, that we have two independent random variables.

Homework

Work the following exercises in Chapter 16: 1, 5, 11, 13, 23–29 odd, 37, 39, 41 and 45.

Errata

On page 415, the second equation in the "Double the love" For Example should read:

On page 416, the line below the second displayed equation should read "four times as big" (not "twice as big").

The Just Checking problem on page 418 should be #2 (not #1).

The last line of the first paragraph on page 419 should read "50 tyiyn" (not "tylyn").

The TI-83/84 instructions on page 427 should read 1-VarStats L1,L2.

ActivStats

Work the activities on pages 16-1 and 16-2 in the ActivStats lesson book, as time permits.

Additional Resources

Random Variables
Episode 16 from Against All Odds features a discussion of random variables.
Carnegie Mellon: Introduction to Statistics
Module 8 of Carnegie Mellon's open source Introduction to Statistics course covers many of the same ideas.