Problem 13
Question
A box contains an unknown number \(N\) of identical bolts. In order to get an idea of the size \(N\), we randomly mark one of the bolts from the box. Next we select at random a bolt from the box. If this is the marked bolt we stop, otherwise we return the bolt to the box, and we randomly select a second one, etc. We stop when the selected bolt is the marked one. Let \(X\) be the number of times a bolt was selected. Later (in Exercise 21.11) we will try to find an estimate of \(N\). Here we look at the probability distribution of \(X\). a. What is the probability distribution of \(X ?\) Specify its parameter(s)! b. The drawback of this approach is that \(X\) can attain any of the values \(1,2,3, \ldots\), so that if \(N\) is large we might be sampling from the box for quite a long time. We decide to sample from the box in a slightly different way: after we have randomly marked one of the bolts in the box, we select at random a bolt from the box. If this is the marked one, we stop, otherwise we randomly select a second bolt (we do not return the selected bolt). We stop when we select the marked bolt. Let \(Y\) be the number of times a bolt was selected. Show that \(\mathrm{P}(Y=k)=1 / N\) for \(k=1,2, \ldots, N\) ( \(Y\) has a so-called discrete uniform distribution). c. Instead of randomly marking one bolt in the box, we mark \(m\) bolts, with \(m\) smaller than \(N\). Next, we randomly select \(r\) bolts; \(Z\) is the number of marked bolts in the sample. Show that $$ \mathrm{P}(Z=k)=\frac{\left(\begin{array}{c} m \\ k \end{array}\right)\left(\begin{array}{c} N-m \\ r-k \end{array}\right)}{\left(\begin{array}{c} N \\ r \end{array}\right)}, \quad \text { for } \quad k=0,1,2, \ldots, r $$ ( \(Z\) has a so-called hypergeometric distribution, with parameters \(m, N\), and \(r .\) )
Step-by-Step Solution
VerifiedKey Concepts
Geometric Distribution
- The random variable, denoted as \(X\), counts the number of times we select a bolt before getting the marked one.
- The probability \(p\) of choosing the marked bolt in any single attempt is \( \frac{1}{N} \), where \(N\) is the total number of bolts.
This formula reflects the concept that multiple failures occur before the first success. Hence, it's suitable for scenarios where repeated attempts are involved.
Discrete Uniform Distribution
- Here, the random variable \(Y\) represents the number of trials needed until the marked bolt is selected.
- All values from \(1\) to \(N\) are equally likely outcomes given an unknown number of unmarked and marked bolts.
Such distributions are crucial in simulations and games of chance, where each outcome is supposed to occur with equal likelihood.
Hypergeometric Distribution
- This scenario is represented by the random variable \(Z\), which counts the number of marked bolts in a sample of \(r\) bolts.
- The population includes \(N\) bolts in total, with \(m\) of them being marked.
Applications of hypergeometric distributions include quality control and lottery drawings, where the success of selections is sensitive to the lack of replacement.