Joint distributions

Prev: Moments Next: Transformations

Problems

Exercises marked with s have detailed solutions at http://stat110.net.

Joint, Marginal, and Conditional Distributions

7.1

Alice and Bob arrange to meet for lunch on a certain day at noon. However, neither is known for punctuality. They both arrive independently at uniformly distributed times between noon and 1 pm on that day. Each is willing to wait up to 15 minutes for the other to show up. What is the probability they will meet for lunch that day?


7.2

Alice, Bob, and Carl arrange to meet for lunch on a certain day. They arrive independently at uniformly distributed times between 1 pm and 1:30 pm on that day.

(a) What is the probability that Carl arrives first?

For the rest of this problem, assume that Carl arrives first at 1:10 pm, and condition on this fact.

(b) What is the probability that Carl will be waiting alone for more than 10 minutes?

(c) What is the probability that Carl will have to wait more than 10 minutes until his party is complete?

(d) What is the probability that the person who arrives second will have to wait more than 5 minutes for the third person to show up?


7.3

One of two doctors, Dr. Hibbert and Dr. Nick, is called upon to perform a series of surgeries. Let be the indicator r.v. for Dr. Hibbert performing the surgeries, and suppose that . Given that Dr. Hibbert is performing the surgeries, each surgery is successful with probability , independently. Given that Dr. Nick is performing the surgeries, each surgery is successful with probability , independently. Let be the number of successful surgeries.

(a) Find the joint PMF of and .

(b) Find the marginal PMF of .

(c) Find the conditional PMF of given .


7.4

A fair coin is flipped twice. Let be the number of Heads in the two tosses, and be the indicator r.v. for the tosses landing the same way.

(a) Find the joint PMF of and .

(b) Find the marginal PMFs of and .

(c) Are and independent?

(d) Find the conditional PMFs of given and of given .


7.5

A fair die is rolled, and then a coin with probability of Heads is flipped as many times as the die roll says, e.g., if the result of the die roll is a 3, then the coin is flipped 3 times. Let be the result of the die roll and be the number of times the coin lands Heads.

(a) Find the joint PMF of and . Are they independent?

(b) Find the marginal PMFs of and .

(c) Find the conditional PMFs of given and of given .


7.6

A committee of size is chosen from a group of women and men. All possible committees of size are equally likely. Let and be the numbers of women and men on the committee, respectively.

(a) Find the joint PMF of and . Be sure to specify the support.

(b) Find the marginal PMF of in two different ways: by doing a computation using the joint PMF, and using a story.

(c) Find the conditional PMF of given that .


7.7

A stick of length (a positive constant) is broken at a uniformly random point . Given that , another breakpoint is chosen uniformly on the interval .

(a) Find the joint PDF of and . Be sure to specify the support.

(b) We already know that the marginal distribution of is . Check that marginalizing out from the joint PDF agrees that this is the marginal distribution of .

(c) We already know that the conditional distribution of given is . Check that using the definition of conditional PDFs (in terms of joint and marginal PDFs) agrees that this is the conditional distribution of given .

(d) Find the marginal PDF of .

(e) Find the conditional PDF of given .


7.8

(a) Five cards are randomly chosen from a standard deck, one at a time with replacement. Let be the numbers of chosen queens, kings, and other cards. Find the joint PMF of .

(b) Find the joint PMF of and .

Hint: In summing the joint PMF of over the possible values of , note that most terms are 0 because of the constraint that the number of chosen cards is five.

(c) Now assume instead that the sampling is without replacement (all 5-card hands are equally likely). Find the joint PMF of .

Hint: Use the naive definition of probability.


7.9

Let and be i.i.d. , and .

(a) Find the joint PMF of .

(b) Find the joint PMF of and .

(c) Find the conditional PMF of given , and give a simple description in words of what the result says.


7.10

Let and be i.i.d. , and .

(a) Find the conditional CDF of given . Be sure to specify where it is zero.

(b) Find the conditional PDF , and verify that it is a valid PDF.

(c) Find the conditional PDF , and verify that it is a valid PDF.

Hint: This can be done using Bayes’ rule without having to know the marginal PDF of , by recognizing what the conditional PDF is up to a normalizing constant; then the normalizing constant must be whatever is needed to make the conditional PDF valid.

(d) In Example 8.2.4, we will show that the marginal PDF of is , for . Give a short alternative proof of this fact, based on the previous parts and Bayes’ rule.


7.11

Let be r.v.s such that and conditional on , and are i.i.d. .

(a) Find the joint PDF of .

(b) By definition, and are conditionally independent given . Discuss intuitively whether or not and are also unconditionally independent.

(c) Find the joint PDF of and . You can leave your answer as an integral, though the integral can be done with some algebra (such as completing the square) and facts about the Normal distribution.


7.12

Let , and let be a positive constant.

(a) If you remember the memoryless property, you already know that the conditional distribution of given is the same as the distribution of (think of waiting minutes for a “success” and then having a fresh additional waiting time). Derive this in another way, by finding the conditional CDF of given and the conditional PDF of given .

(b) Find the conditional CDF of given and the conditional PDF of given .


7.13

Let and be i.i.d. . Find the conditional distribution of given in two different ways:

(a) by using calculus to find the conditional PDF.

(b) without using calculus, by arguing that the conditional distribution of given is the same distribution as the unconditional distribution of , and then applying an earlier result about the minimum of independent Exponentials.


7.14

Stat110 solution available.

(a) A stick is broken into three pieces by picking two points independently and uniformly along the stick, and breaking the stick at those two points. What is the probability that the three pieces can be assembled into a triangle?

Hint: A triangle can be formed from 3 line segments of lengths if and only if . The probability can be interpreted geometrically as proportional to an area in the plane, avoiding all calculus, but make sure for that approach that the distribution of the random point in the plane is Uniform over some region.

(b) Three legs are positioned uniformly and independently on the perimeter of a round table. What is the probability that the table will stand?


7.15

Let and be continuous r.v.s., with joint CDF . Show that the probability that falls into the rectangle is


7.16

Let and have joint PDF

(a) Check that this is a valid joint PDF.

(b) Are and independent?

(c) Find the marginal PDFs of and .

(d) Find the conditional PDF of given .


7.17

Let and have joint PDF

(a) Find to make this a valid joint PDF.

(b) Are and independent?

(c) Find the marginal PDFs of and .

(d) Find the conditional PDF of given .


7.18

Stat110 solution available.

Let be a uniformly random point in the triangle in the plane with vertices , , . Find the joint PDF of and , the marginal PDF of , and the conditional PDF of given .


7.19

Stat110 solution available.

A random point is chosen uniformly in the ball

(a) Find the joint PDF of .

(b) Find the joint PDF of .

(c) Find an expression for the marginal PDF of , as an integral.


7.20

Stat110 solution available.

Let be i.i.d. , and let , .

(a) Find the marginal CDF and marginal PDF of , and the joint CDF and joint PDF of .

Hint: For the latter, start by considering .

(b) Find the conditional PDF of given .


7.21

Find the probability that the quadratic polynomial , where the coefficients and are determined by drawing i.i.d. random variables, has at least one real root.

Hint: By the quadratic formula, the polynomial has a real root if and only if .


7.22

Let and each have support marginally, and suppose that the joint PDF of and is positive for and otherwise.

(a) What is the support of the conditional PDF of given ?

(b) Show that and can’t be independent.


7.23

The volume of a region in -dimensional Euclidean space is the integral of 1 over that region. The unit ball in is , the ball of radius 1 centered at 0. As mentioned in Section A.7 of the math appendix, the volume of the unit ball in dimensions is

where is the gamma function, a very famous function which is defined by

for all , and which will play an important role in the next chapter. A few useful facts about the gamma function (which you can assume) are that for any , and that and . Using these facts, it follows that for a positive integer, and we can also find when is a nonnegative integer. For practice, verify that (the area of the unit disk in 2 dimensions) and (the volume of the unit ball in 3 dimensions).

Let be i.i.d.

(a) Find the probability that is in the unit ball in .

(b) Evaluate the result from (a) numerically for , and plot the results (using a computer unless you are extremely good at making hand-drawn graphs). The facts above about the gamma function are sufficient so that you can do this without doing any integrals, but you can also use the command gamma in R to compute the gamma function.

(c) Let be a constant with , and let count how many of the satisfy . What is the distribution of ?

(d) For , use the result of Part (c) to give a simple, short derivation of what happens to the probability from (a) as .


7.24

Stat110 solution available.

Two students, and , are working independently on homework assignments. Student takes hours to finish their homework, while takes hours.

(a) Find the CDF and PDF of , the ratio of their problem-solving times.

(b) Find the probability that finishes their homework before does.


7.25

Two companies, Company 1 and Company 2, have just been founded. Stock market crashes occur according to a Poisson process with rate . Such a crash would put both companies out of business. For , there may be an adverse event of type , which puts Company out of business (if it is not already out of business) but does not affect the other company; such events occur according to a Poisson process with rate . If there has not been a stock market crash or an adverse event of type , then Company remains in business. The three Poisson processes are independent of each other. Let and be how long Company 1 and Company 2 stay in business, respectively.

(a) Find the marginal distributions of and .

(b) Find , and use this to find the joint CDF of and .


7.26

Stat110 solution available.

The bus company from Blissville decides to start service in Blotchville, sensing a promising business opportunity. Meanwhile, Fred has moved back to Blotchville. Now when Fred arrives at the bus stop, either of two independent bus lines may come by (both of which take him home). The Blissville company’s bus arrival times are exactly 10 minutes apart, whereas the time from one Blotchville company bus to the next is . Fred arrives at a uniformly random time on a certain day.

(a) What is the probability that the Blotchville company bus arrives first?

Hint: One good way is to use the continuous law of total probability.

(b) What is the CDF of Fred’s waiting time for a bus?


7.27

A longevity study is being conducted on married hobbit couples. Let be the probability that an individual hobbit lives at least until their eleventy-first birthday, and assume that the lifespans of different hobbits are independent. Let be the number of couples in which neither hobbit reaches age eleventy-one, one hobbit does but not the other, and both hobbits reach eleventy-one, respectively.

(a) Find the joint PMF of .

For the rest of this problem, suppose that it is observed that exactly of the cohort of hobbits reach their eleventy-first birthdays.

(b) Using (a) and the definition of conditional probability, find the conditional PMF of given this information, up to a normalizing constant (that is, you do not need to find the normalizing constant in this part, but just to give a simplified expression that is proportional to the conditional PMF). For simplicity, you can and should ignore multiplicative constants in this part; this includes multiplicative factors that are functions of , since is now being treated as a known constant.

(c) Now obtain the conditional PMF of using a direct counting argument, now including any normalizing constants needed in order to have a valid conditional PMF.

(d) Discuss intuitively whether or not should appear in the answer to (c).

(e) What is the conditional expectation of , given the above information (simplify fully)? This can be done without doing any messy sums.


7.28

There are stores in a shopping center, labeled from 1 to . Let be the number of customers who visit store in a particular month, and suppose that are i.i.d. with PMF . Let be the label of a randomly chosen store, so is the number of customers at a randomly chosen store.

(a) For , find in terms of a sum involving the PMF .

(b) Find the joint PMF of and . Are they independent?

(c) Does , the number of customers for a random store, have the same marginal distribution as , the number of customers for store 1?

(d) Let also be the label of a randomly chosen store, with and independent. Find in terms of a sum involving the PMF . How does compare to for fixed with ?


7.29

Let and be i.i.d. , , and .

(a) Find the joint PMF of and . Are they independent?

(b) Find the marginal distribution of in two ways: using the joint PMF, and using a story.

(c) Find .

Hint: A quick way is to use (b) and the fact that .

(d) Find the joint PMF of and . Are they independent?


7.30

Let have the joint CDF

for , (and otherwise), where the parameter is in .

(a) Find the joint PDF of . For which values of (if any) are they independent?

(b) Explain why we require to be in .

(c) Find the marginal PDFs of and by working directly from the joint PDF from (a). When integrating, do not use integration by parts or computer assistance; rather, pattern-match to facts we know about moments of famous distributions.

(d) Find the marginal CDFs of and by working directly from the joint CDF.


2D LOTUS

7.31

Stat110 solution available.

Let and be i.i.d. . Find the standard deviation of the distance between and .


7.32

Stat110 solution available.

Let be i.i.d. . Find in two different ways: (a) using 2D LOTUS and (b) using the memoryless property without any calculus.


7.33

Alice walks into a post office with 2 clerks. Both clerks are in the midst of serving customers, but Alice is next in line. The clerk on the left takes an time to serve a customer, and the clerk on the right takes an time to serve a customer. Let be the amount of time Alice has to wait until it is her turn.

(a) Write down expressions for the mean and variance of , in terms of double integrals (which you do not need to evaluate).

(b) Find the distribution, mean, and variance of , without using calculus.


7.34

Let be a uniformly random point in the triangle in the plane with vertices , , . Find . (Exercise 18 is about joint, marginal, and conditional PDFs in this setting.)


7.35

A random point is chosen uniformly in the unit disk . Let be its distance from the origin.

(a) Find using 2D LOTUS.

Hint: To do the integral, convert to polar coordinates (see the math appendix).

(b) Find the CDFs of and of without using calculus, using the fact that for a Uniform distribution on a region, probability within that region is proportional to area. Then get the PDFs of and of , and find in two more ways: using the definition of expectation, and using a 1D LOTUS by thinking of as a function of .


7.36

Let and be discrete r.v.s.

(a) Use 2D LOTUS (without assuming linearity) to show that .

(b) Now suppose that and are independent. Use 2D LOTUS to show that they are uncorrelated, i.e., .


7.37

Let and be i.i.d. continuous random variables with PDF , mean , and variance . We know that the expected squared distance of from its mean is , and likewise for ; this problem is about the expected squared distance of from .

(a) Use 2D LOTUS to express as a double integral.

(b) By expanding and evaluating the double integral from (a), show that

(c) Give an alternative proof of the result from (b), based on the trick of adding and subtracting :


Covariance

7.38

Stat110 solution available.

Let and be r.v.s. Is it correct to say ""? Is it correct to say ” since either the max is and the min is or vice versa, and covariance is symmetric”? Explain.


7.39

Stat110 solution available.

Two fair, six-sided dice are rolled (one green and one orange), with outcomes and for the green die and the orange die, respectively.

(a) Compute the covariance of and .

(b) Are and independent?


7.40

Let and be i.i.d. .

(a) Compute the covariance of and .

(b) Are and independent?


7.41

Stat110 solution available.

Let and be standardized r.v.s (i.e., marginally they each have mean 0 and variance 1) with correlation . Find (in terms of ) such that and are uncorrelated but still standardized.


7.42

Stat110 solution available.

Let be the number of distinct birthdays in a group of 110 people (i.e., the number of days in a year such that at least one person in the group has that birthday). Under the usual assumptions (no February 29, all the other 365 days of the year are equally likely, and the day when one person is born is independent of the days when the other people are born), find the mean and variance of .


7.43

(a) Let and be Bernoulli r.v.s, possibly with different parameters. Show that if and are uncorrelated, then they are independent.

(b) Give an example of three Bernoulli r.v.s such that each pair of them is uncorrelated, yet the three r.v.s are dependent.


7.44

Find the variance of the number of toys needed until you have a complete set in Example 4.3.12 (the coupon collector problem), as a sum.


7.45

A random triangle is formed in some way, such that all pairs of angles have the same joint distribution. What is the correlation between two of the angles (assuming that the variance of the angles is nonzero)?


7.46

Each of people puts their name on a slip of paper (no two have the same name). The slips of paper are shuffled in a hat, and then each person draws one (uniformly at random at each stage, without replacement). Find the standard deviation of the number of people who draw their own names.


7.47

As in Example 4.4.7, an urn contains white balls and black balls. The balls are randomly drawn one by one without replacement until white balls have been drawn. Let be the number of black balls drawn before drawing the th white ball. In this exercise, you will derive .

As explained in Example 4.4.7, we can assume that we continue drawing balls until the urn has been emptied out. Label the black balls as , and write

where is the indicator of black ball being drawn before the th white ball is drawn.

(a) Show that

(b) Give an intuitive explanation of whether and are positively correlated, uncorrelated, or negatively correlated, for .

(c) Show that for ,

Hint: Imagine slots, into which black balls and and the white balls will be placed. All orderings for these balls are equally likely.

(d) Find an expression for . With some algebra (which you don’t have to do), your expression should simplify to


7.48

Stat110 solution available.

Athletes compete one at a time at the high jump. Let be how high the th jumper jumped, with i.i.d. with a continuous distribution. We say that the th jumper sets a record if is greater than all of .

Find the variance of the number of records among the first jumpers (as a sum). What happens to the variance as ?


7.49

Stat110 solution available.

A chicken lays a number of eggs. Each egg hatches a chick with probability , independently. Let be the number which hatch, so .

Find the correlation between (the number of eggs) and (the number of eggs which hatch). Simplify; your final answer should work out to a simple function of (the should cancel out).


7.50

Let be random variables such that for all . Show that . This is a bound on how negatively correlated a collection of r.v.s can all be with each other.

Hint: Assume for all ; this can be done without loss of generality, since rescaling two r.v.s does not affect the correlation between them. Then use the fact that .


7.51

Let and be independent r.v.s. Show that

Hint: It is often useful when working with a second moment to express it as .


7.52

Stat 110 shirts come in 3 sizes: small, medium, and large. There are shirts of each size (where ). There are students. For each size, of the students have that size as the best fit. This seems ideal. But suppose that instead of giving each student the right size shirt, each student is given a shirt completely randomly (all allocations of the shirts to the students, with one shirt per student, are equally likely). Let be the number of students who get their right size shirt.

(a) Find .

(b) Give each student an ID number from 1 to , such that the right size shirt is small for students 1 through , medium for students through , and large for students through . Let be the event that student gets their right size shirt. Find and .

(c) Find .


7.53

Stat110 solution available.

A drunken man wanders around randomly in a large space. At each step, he moves one unit of distance North, South, East, or West, with equal probabilities. Choose coordinates such that his initial position is and if he is at at some time, then one step later he is at , , , or . Let and be his position and distance from the origin after steps, respectively.

General hint: Note that is a sum of r.v.s with possible values , and likewise for , but be careful throughout the problem about independence.

(a) Determine whether or not is independent of .

(b) Find .

(c) Find .


7.54

A scientist makes two measurements, considered to be independent standard Normal r.v.s. Find the correlation between the larger and smaller of the values.

Hint: Note that and .


7.55

Stat110 solution available.

Let and .

(a) Find the distribution of (give the PDF and, if it is a named distribution we have studied, its name and parameters).

Hint: Find the support of , and then find the CDF of by reducing to probability calculations about .

(b) Show that and are uncorrelated, but not independent. This is also another example illustrating the fact that knowing the marginal distributions of two r.v.s does not determine the joint distribution.


7.56

Stat110 solution available.

Consider the following method for creating a bivariate Poisson (a joint distribution for two r.v.s such that both marginals are Poissons). Let , where are i.i.d. (the idea is to have something borrowed and something new but not something old or something blue).

(a) Find .

(b) Are and independent? Are they conditionally independent given ?

(c) Find the joint PMF of (as a sum).


7.57

You are playing an exciting game of Battleship. Your opponent secretly positions ships on a 10 by 10 grid and you try to guess where the ships are. Each of your guesses is a hit if there is a ship there and a miss otherwise.

The game has just started and your opponent has 3 ships: a battleship (length 4), a submarine (length 3), and a destroyer (length 2). (Usually there are 5 ships to start, but to simplify the calculations we are considering 3 here.) You are playing a variation in which you unleash a salvo, making 5 simultaneous guesses. Assume that your 5 guesses are a simple random sample drawn from the 100 grid positions, i.e., all sets of 5 grid positions are equally likely.

Find the mean and variance of the number of distinct ships you will hit in your salvo. (Give exact answers in terms of binomial coefficients or factorials, and also numerical values computed using a computer.)

Hint: First work in terms of the number of ships missed, expressing this as a sum of indicator r.v.s. Then use the fundamental bridge and naive definition of probability.


7.58

This problem explores a visual interpretation of covariance. Data are collected for individuals, where for each individual two variables are measured (e.g., height and weight). Assume independence across individuals (e.g., person 1’s variables gives no information about the other people), but not within individuals (e.g., a person’s height and weight may be correlated).

Let be the data points, with . The data are considered here as fixed, known numbers: they are the observed values after performing an experiment. Imagine plotting all the points in the plane, and drawing the rectangle determined by each pair of points. For example, the points and determine the rectangle with vertices , , , .

The signed area contributed by and is the area of the rectangle they determine if the slope of the line between them is positive, and is the negative of the area of the rectangle they determine if the slope of the line between them is negative. (Define the signed area to be 0 if or , since then the rectangle is degenerate.) So the signed area is positive if a higher value goes with a higher value for the pair of points, and negative otherwise. Assume that the are all distinct and the are all distinct.

(a) The sample covariance of the data is defined to be

where

are the sample means. (There are differing conventions about whether to divide by or in the definition of sample covariance, but that need not concern us here.)

Let be one of the pairs, chosen uniformly at random. Determine precisely how is related to the sample covariance.

(b) Let be as in (a), and be an independent draw from the same distribution. That is, and are randomly chosen from the points, independently (so it is possible for the same point to be chosen twice).

Express the total signed area of the rectangles as a constant times . Then show that the sample covariance of the data is a constant times the total signed area of the rectangles.

Hint: Consider in two ways: as the average signed area of the random rectangle formed by and , and using properties of expectation to relate it to . For the former, consider the possibilities for which point is and which point ; note that such choices result in degenerate rectangles.

(c) Based on the interpretation from (b), give intuitive explanations of why for any r.v.s and constants , covariance has the following properties:

(i) ;

(ii) ;

(iii) ;

(iv) .


7.59

A statistician is trying to estimate an unknown parameter based on some data. She has available two independent estimators and (an estimator is a function of the data, used to estimate a parameter). For example, could be the sample mean of a subset of the data and could be the sample mean of another subset of the data, disjoint from the subset used to calculate . Assume that both of these estimators are unbiased, i.e., .

Rather than having a bunch of separate estimators, the statistician wants one combined estimator. It may not make sense to give equal weights to and since one could be much more reliable than the other, so she decides to consider combined estimators of the form

a weighted combination of the two estimators. The weights and are nonnegative and satisfy .

(a) Check that is also unbiased, i.e., .

(b) Determine the optimal weights , in the sense of minimizing the mean squared error . Express your answer in terms of the variances of and . The optimal weights are known as Fisher weights.

Hint: As discussed in Exercise 53 from Chapter 5, mean squared error is variance plus squared bias, so in this case the mean squared error of is . Note that there is no need for multivariable calculus here, since .

(c) Give a simple description of what the estimator found in (b) amounts to if the data are i.i.d. random variables , is the sample mean of , and is the sample mean of .


Chicken-Egg

7.60

Stat110 solution available.

A number of people vote in a certain election. Each voter votes for candidate with probability and for candidate with probability , independently of all the other voters. Let be the difference in votes, defined as the number of votes for minus the number for .

(a) Find .

(b) Find .


7.61

A traveler gets lost times on a long journey. When lost, the traveler asks someone for directions with probability . Let be the number of times that the traveler is lost and asks for directions, and be the number of times that the traveler is lost and does not ask for directions.

(a) Find the joint PMF of . Are they independent?

(b) Find the joint PMF of . Are they independent?

(c) Find the joint PMF of . Are they independent?


7.62

The number of people who visit the Leftorium store in a day is . Suppose that 10% of customers are sinister (left-handed), and 90% are dexterous (right-handed). Half of the sinister customers make purchases, but only a third of the dexterous customers make purchases. The characteristics and behavior of people are independent, with probabilities as described in the previous two sentences. On a certain day, there are 42 people who arrive at the store but leave without making a purchase. Given this information, what is the conditional PMF of the number of customers on that day who make a purchase?


7.63

A chicken lays eggs. Each egg independently does or doesn’t hatch, with probability of hatching. For each egg that hatches, the chick does or doesn’t survive (independently of the other eggs), with probability of survival. Let be the number of eggs which hatch, be the number of chicks which survive, and be the number of chicks which hatch but don’t survive (so ). Find the marginal PMF of , and the joint PMF of and . Are and independent?


7.64

There will be courses offered at a certain school next year.

(a) Find the expected number of choices of 4 courses (in terms of , fully simplified), assuming that simultaneous enrollment is allowed if there are time conflicts.

(b) Now suppose that simultaneous enrollment is not allowed. Suppose that most faculty only want to teach on Tuesdays and Thursdays, and most students only want to take courses that start at 10 am or later, and as a result there are only four possible time slots: 10 am, 11:30 am, 1 pm, 2:30 pm (each course meets Tuesday-Thursday for an hour and a half, starting at one of these times). Rather than trying to avoid major conflicts, the school schedules the courses completely randomly: after the list of courses for next year is determined, they randomly get assigned to time slots, independently and with probability for each time slot.

Let and be the number of morning and afternoon courses for next year, respectively (where “morning” means starting before noon). Find the joint PMF of and , i.e., find for all .

(c) Continuing as in (b), let be the number of 10 am, 11:30 am, 1 pm, 2:30 pm courses for next year, respectively. What is the joint distribution of ? (The result is completely analogous to that of ; you can derive it by thinking conditionally, but for this part you are also allowed to just use the fact that the result is analogous to that of (b).) Use this to find the expected number of choices of 4 non-conflicting courses (in terms of , fully simplified). What is the ratio of the expected value from (a) to this expected value?


Multinomial

7.65

Stat110 solution available.

Let be Multinomial with parameters and . Use indicator r.v.s to show that for .


7.66

Stat110 solution available.

Consider the birthdays of 100 people. Assume people’s birthdays are independent, and the 365 days of the year (exclude the possibility of February 29) are equally likely. Find the covariance and correlation between how many of the people were born on January 1 and how many were born on January 2.


7.67

A certain course has freshmen, sophomores, juniors, and seniors. Let be the number of freshmen and sophomores (total), the number of juniors, and the number of seniors in a random sample of size , where for Part (a) the sampling is with replacement and for Part (b) the sampling is without replacement (for both parts, at each stage the allowed choices have equal probabilities).

(a) Find the joint PMF of , for sampling with replacement.

(b) Find the joint PMF of , for sampling without replacement.


7.68

Stat110 solution available.

A group of people decide to play an exciting game of Rock-Paper-Scissors. As you may recall, Rock smashes Scissors, Scissors cuts Paper, and Paper covers Rock (despite Bart Simpson once saying “Good old rock, nothing beats that!”).

Usually this game is played with 2 players, but it can be extended to more players as follows. If exactly 2 of the 3 choices appear when everyone reveals their choice, say where beats , the game is decisive: the players who chose win, and the players who chose lose. Otherwise, the game is indecisive and the players play again.

For example, with 5 players, if one player picks Rock, two pick Scissors, and two pick Paper, the round is indecisive and they play again. But if 3 pick Rock and 2 pick Scissors, then the Rock players win and the Scissors players lose the game.

Assume that the players independently and randomly choose between Rock, Scissors, and Paper, with equal probabilities. Let be the number of players who pick Rock, Scissors, Paper, respectively in one game.

(a) Find the joint PMF of .

(b) Find the probability that the game is decisive. Simplify your answer.

(c) What is the probability that the game is decisive for ? What is the limiting probability that a game is decisive as ? Explain briefly why your answer makes sense.


7.69

Stat110 solution available.

Emails arrive in an inbox according to a Poisson process with rate (so the number of emails in a time interval of length is distributed as , and the numbers of emails arriving in disjoint time intervals are independent). Let be the numbers of emails that arrive from 9 am to noon, noon to 6 pm, and 6 pm to midnight (respectively) on a certain day.

(a) Find the joint PMF of .

(b) Find the conditional joint PMF of given that .

(c) Find the conditional PMF of given that .

(d) Find and . (Conditional expectation and conditional variance given an event are defined in the same way as expectation and variance, using the conditional distribution given the event in place of the unconditional distribution; these concepts are explored more in Chapter 9.)


7.70

Let be the number of statistics majors in a certain college in the Class of 2030, viewed as an r.v. Each statistics major chooses between two tracks: a general track in statistical principles and methods, and a track in quantitative finance. Suppose that each statistics major chooses randomly which of these two tracks to follow, independently, with probability of choosing the general track. Let be the number of statistics majors who choose the general track, and be the number of statistics majors who choose the quantitative finance track.

(a) Suppose that . (This isn’t the exact distribution in reality since a Poisson is unbounded, but it may be a very good approximation.) Find the correlation between and .

(b) Let be the size of the Class of 2030, where is a known constant. For this part and the next, instead of assuming that is Poisson, assume that each of the students chooses to be a statistics major with probability , independently. Find the joint distribution of , and the number of non-statistics majors, and their marginal distributions.

(c) Continuing as in (b), find the correlation between and .


7.71

In humans (and many other organisms), genes come in pairs. Consider a gene of interest, which comes in two types (alleles): type and type . The genotype of a person for that gene is the types of the two genes in the pair: , , or ( is equivalent to ). According to the Hardy-Weinberg law, for a population in equilibrium, the frequencies of , , will be , , , respectively, for some with .

Suppose that the Hardy-Weinberg law holds, and that people are drawn randomly from the population, independently. Let be the number of people in the sample with genotypes , , , respectively.

(a) What is the joint PMF of ?

(b) What is the distribution of the number of people in the sample who have an ?

(c) What is the distribution of how many of the genes among the people are ‘s?

(d) Now suppose that is unknown, and must be estimated using the observed data . The maximum likelihood estimator (MLE) of is the value of for which the observed data are as likely as possible. Find the MLE of .

(e) Now suppose that is unknown, and that our observations can’t distinguish between and . So for each person in the sample, we just know whether or not that person is an (in genetics terms, and have the same phenotype, and we only get to observe the phenotypes, not the genotypes). Find the MLE of .


Multivariate Normal

7.72

Stat110 solution available.

Let be Bivariate Normal, with and marginally and with correlation between and .

(a) Show that is also Bivariate Normal.

(b) Find the joint PDF of and (without using calculus), assuming that .


7.73

Let the joint PDF of and be

for all and , where is a constant.

(a) Find to make this a valid joint PDF.

(b) What are the marginal distributions of and ? Are and independent?

(c) Is Bivariate Normal?


7.74

Let the joint PDF of and be

for , where is a constant (the joint PDF is 0 for ).

(a) Find to make this a valid joint PDF.

(b) What are the marginal distributions of and ? Are and independent?

(c) Is Bivariate Normal?


7.75

Let be i.i.d. . Find the joint MGF of .


7.76

Let and be i.i.d. , and let be a random sign ( or , with equal probabilities) independent of .

(a) Determine whether or not is Multivariate Normal.

(b) Determine whether or not is Multivariate Normal.

(c) Determine whether or not is Multivariate Normal.


7.77

Let be Bivariate Normal with and marginally and with . Find a constant such that is independent of .

Hint: First find (in terms of ) such that and are uncorrelated.


7.78

A mother and a father have 6 children. The 8 heights in the family (in inches) are r.v.s (with the same distribution, but not necessarily independent).

(a) Assume for this part that the heights are all independent. On average, how many of the children are taller than both parents?

(b) Let be the height of the mother, be the height of the father, and be the heights of the children. Suppose that is Multivariate Normal, with marginals and for , with . On average, how many of the children are more than 1 inch taller than their mother?


Mixed Practice

7.79

Cars pass by a certain point on a road according to a Poisson process with rate cars/minute. Let be the number of cars that pass by that point in the time interval , with measured in minutes.

(a) A certain device is able to count cars as they pass by, but it does not record the arrival times. At time 0, the counter on the device is reset to 0. At time 3 minutes, the device is observed and it is found that exactly 1 car had passed by. Given this information, find the conditional CDF of when that car arrived. Also describe in words what the result says.

(b) In the late afternoon, you are counting blue cars. Each car that passes by is blue with probability , independently of all other cars. Find the joint PMF and marginal PMFs of the number of blue cars and number of non-blue cars that pass by the point in 10 minutes.


7.80

In a U.S. election, there will be registered voters. Suppose each registered voter is a registered Democrat with probability and a registered Republican with probability , independent of other voters. Also, each registered voter shows up to the polls with probability and stays home with probability , independent of other voters and independent of their own party affiliation. In this problem, we are interested in , the number of registered Democrats who actually vote.

(a) What is the distribution of , before we know anything about the number of registered voters?

(b) Suppose we learn that ; that is, people registered to vote. What is the conditional distribution of given this information?

(c) Suppose we learn there were registered Democrats and registered Republicans (where ). What is the conditional distribution of given this information?

(d) Finally, we learn in addition to all of the above information that people showed up at the polls on election day. What is the conditional distribution of given this information?


7.81

A certain college has freshmen, sophomores, juniors, and seniors. A certain class at the college consists of a simple random sample of size students, i.e., all sets of of the students are equally likely. Let be the numbers of freshmen, sophomores, juniors, and seniors in the class.

(a) Find the joint PMF of .

(b) Give both an intuitive explanation and a mathematical justification for whether or not the distribution from (a) is Multinomial.

(c) Find , fully simplified.

Hint: Take the variance of both sides of .


7.82

Let and let be a nonnegative random variable, discrete or continuous, whose MGF is finite everywhere. Show that for a certain value of (which you should specify).


7.83

A publishing company employs two proofreaders, Prue and Frida. When Prue is proofreading a book, for each typo she has probability of catching it and of missing it, independently. When Frida is proofreading a book, for each typo she has probability of catching it and of missing it, independently.

(a) A certain book draft has typos. The company randomly assigns it to one of the two proofreaders, with equal probabilities. Find the distribution of the number of typos that get detected.

(b) Another book is being written. When a draft of the book is complete, it will have a number of typos, and will be assigned to Prue to proofread. Find the probability that Prue catches exactly typos, given that she misses exactly typos.


7.84

Two authors, Bob and Martha, are about to begin writing an epic co-authored book, The Adventures of Aaron the Aardwolf. It will take them years to write. When they finish this book, they will immediately begin work on new, individually authored books. Bob will spend years writing The Bilinear Bonanza of Bonnie the Butterfly, and Martha will spend years writing Memoirs of Maude the Magnificent Mangabey, independently. Suppose that are i.i.d. . On a timeline where time 0 is defined as the time when they begin their collaboration, consider the following quantities.

: time at which The Adventures of Aaron the Aardwolf is completed;

: time at which The Bilinear Bonanza of Bonnie the Butterfly is completed;

: time at which Memoirs of Maude the Magnificent Mangabey is completed;

: time at which the last to be completed of these three books is completed.

(a) Find the distribution of (which is also the distribution of ).

(b) Find .

(c) Find .


7.85

A DNA sequence can be represented as a sequence of letters, where the alphabet has 4 letters: A,C,G,T. Suppose that a random DNA sequence of length is formed by independently generating letters one at a time, with the probabilities of A,C,G,T, respectively, where .

(a) Find the covariance between the number of A’s and the number of C’s in the sequence.

(b) It is observed that the sequence contains exactly A’s, C’s, G’s, and T’s, where and . Given this information, find the probability that the first A in the sequence is followed immediately by another A.

Hint: How does this part relate to Exercise 74 in Chapter 2?

(c) Given the information from (b) about how many times each letter occurs, find the expected number of occurrences of the expression CAT in the sequence.


7.86

To test for a certain disease, the level of a certain substance in the blood is measured. Let be this measurement, considered as a continuous r.v. The patient tests positive (i.e., is declared to have the disease) if and tests negative if , where is a threshold decided upon in advance. Let be the indicator of having the disease. As discussed in Example 2.3.9, the sensitivity of the test is the probability of testing positive given that the patient has the disease, and the specificity of the test is the probability of testing negative given that the patient does not have the disease.

(a) The ROC (receiver operator characteristic) curve of the test is the plot of sensitivity vs. 1 minus specificity, where sensitivity (the vertical axis) and 1 minus specificity (the horizontal axis) are viewed as functions of the threshold . ROC curves are widely used in medicine and engineering as a way to study the performance of procedures for classifying individuals into two groups (in this case, the two groups are “diseased people” and “non-diseased people”).

Given that , has CDF and PDF ; given that , has CDF and PDF . Here and are positive on an interval and 0 outside this interval. Show that the area under the ROC curve is the probability that a randomly selected diseased person has a higher value than a randomly selected non-diseased person.

(b) Explain why the result of (a) makes sense in two extreme cases: when , and when there is a threshold such that and are very close to 1.


7.87

Let be Discrete Uniform on .

(a) Find and , fully simplified, using results from Section A.8 of the math appendix.

(b) Discuss intuitively whether the results in (a) should be approximately the same as the mean and variance (respectively) of a Uniform distribution on a certain interval.

(c) Let be i.i.d. r.v.s, and let be their ranks (the smallest has rank 1, the next has rank 2, …, and the largest has rank ). Explain why

where . Then use this to find and directly using symmetry, linearity, the fundamental bridge, and properties of covariance.

(d) Explain how the results of (a) and (c) relate. Then prove the identities

by giving them probabilistic interpretations.


7.88

Stat110 solution available.

A network consists of nodes, each pair of which may or may not have an edge joining them. For example, a social network can be modeled as a group of nodes (representing people), where an edge between and means they know each other. Assume the network is undirected and does not have edges from a node to itself (for a social network, this says that if knows , then knows and that, contrary to Socrates’ advice, a person does not know himself or herself). A clique of size is a set of nodes where every node has an edge to every other node (i.e., within the clique, everyone knows everyone). An anticlique of size is a set of nodes where there are no edges between them (i.e., within the anticlique, no one knows anyone else). For example, the picture below shows a network with nodes labeled , where is a clique of size 4, and is an anticlique of size 3.

(a) Form a random network with nodes by independently flipping fair coins to decide for each pair whether there is an edge joining them. Find the expected number of cliques of size (in terms of and ).

(b) A triangle is a clique of size 3. For a random network as in (a), find the variance of the number of triangles (in terms of ).

Hint: Find the covariances of the indicator random variables for each possible clique. There are such indicator r.v.s, some pairs of which are dependent.

*(c) Suppose that . Show that there is a network with nodes containing no cliques of size or anticliques of size .

Hint: Explain why it is enough to show that for a random network with nodes, the probability of the desired property is positive; then consider the complement.


7.89

Stat110 solution available.

Shakespeare wrote a total of 884647 words in his known works. Of course, many words are used more than once, and the number of distinct words in Shakespeare’s known writings is 31534 (according to one computation). This puts a lower bound on the size of Shakespeare’s vocabulary, but it is likely that Shakespeare knew words which he did not use in these known writings.

More specifically, suppose that a new poem of Shakespeare were uncovered, and consider the following (seemingly impossible) problem: give a good prediction of the number of words in the new poem that do not appear anywhere in Shakespeare’s previously known works.

Ronald Thisted and Bradley Efron studied this problem in the papers [8] and [9], developing theory and methods and then applying the methods to try to determine whether Shakespeare was the author of a poem discovered by a Shakespearean scholar in 1985. A simplified version of their method is developed in the problem below. The method was originally invented by Alan Turing (the founder of computer science) and I.J. Good as part of the effort to break the German Enigma code during World War II.

Let be the number of distinct words that Shakespeare knew, and assume these words are numbered from 1 to . Suppose for simplicity that Shakespeare wrote only two plays, and . The plays are reasonably long and they are of the same length. Let be the number of times that word appears in play , and be the number of times it appears in play , for .

(a) Explain why it is reasonable to model as being Poisson, and as being Poisson with the same parameter as .

(b) Let the numbers of occurrences of the word “eyeball” (which was coined by Shakespeare) in the two plays be independent r.v.s. Show that the probability that “eyeball” is used in play but not in play is

(c) Now assume that from (b) is unknown and is itself taken to be a random variable to reflect this uncertainty. So let have a PDF . Let be the number of times the word “eyeball” appears in play and be the corresponding value for play . Assume that the conditional distribution of given is that they are independent r.v.s. Show that the probability that “eyeball” is used in play but not in play is the alternating series

Hint: Condition on and use Part (b).

(d) Assume that every word’s numbers of occurrences in and are distributed as in Part (c), where may be different for different words but is fixed. Let be the number of words that appear exactly times in play . Show that the expected number of distinct words appearing in play but not in play is

(This shows that is an unbiased predictor of the number of distinct words appearing in play but not in play : on average it is correct. Moreover, it can be computed just from having seen play , without needing to know or any of the . This method can be extended in various ways to give predictions for unobserved plays based on observed plays.)