Problem 4
Question
During the Second World War, London was hit by numerous flying bombs. The following data are from an area in South London of 36 square kilometers. The area was divided into 576 squares with sides of length \(1 / 4\) kilometer. For each of the 576 squares the number of hits was recorded. In this way we obtain a dataset \(x_{1}, x_{2}, \ldots, x_{576}\), where \(x_{i}\) denotes the number of hits in the \(i\) th square. The data are summarized in the following table which lists the number of squares with no hits, 1 hit, 2 hits, etc. $$ \begin{array}{lcccccccc} \hline \hline \text { Number of hits } & 0 & 1 & 2 & 3 & 4 & 5 & 6 & 7 \\ \text { Number of squares } & 229 & 211 & 93 & 35 & 7 & 0 & 0 & 1 \\ \hline \hline \end{array} $$ An interesting question is whether London was hit in a completely random manner. In that case a Poisson distribution should fit the data. a. If we model the dataset as the realization of a random sample from a Poisson distribution with parameter \(\mu\), then what would you choose as an estimate for \(\mu\) ? b. Check the fit with a Poisson distribution by comparing some of the observed relative frequencies of 0 's, 1 's, 2 's, etc., with the corresponding probabilities for the Poisson distribution with \(\mu\) estimated as in part a.
Step-by-Step Solution
VerifiedKey Concepts
Random Sample
- Randomness ensures every unit or individual from the population deserves equal opportunity to be chosen.
- Maintaining randomness helps in making generalizations about the broader population from which the sample was drawn.
Parameter Estimation
To estimate \( \mu \) for the bombing data, we need to calculate the sample mean, which is the total number of hits divided by the total number of squares. This step gives us an estimate of the average number of hits per square. The formula used is:
\[ \mu = \frac{\text{Total Hits}}{\text{Total Squares}} \]
This estimate serves as a crucial step in applying the Poisson model to predict probabilities of different numbers of hits. Once \( \mu \) is estimated as 0.9323, it sharply informs the subsequent calculation of Poisson probabilities and plays a central role in evaluating the fit of the Poisson model to the observed data.
Probability Distribution
The Poisson distribution is defined by its parameter \( \mu \), which is both the mean and variance of the distribution. The probability of observing exactly \( k \) events is computed using the formula:
\[ P(X = k) = \frac{e^{-\mu} \mu^k}{k!} \]
This expression allows us to determine the likelihood of different numbers of hits, providing a theoretical framework to match with our actual data. When the calculated probabilities closely align with observed relative frequencies, it indicates a particularly good fit of the Poisson model to the dataset. This congruence helps us confirm that the distribution describes the random processes in the scenario accurately.
Relative Frequency
In our example, we calculate the relative frequency for each number of hits over all the squares. For example, the relative frequency for 0 hits is calculated by dividing the number of squares with 0 hits by the total squares: \( P(0) = \frac{229}{576} \). This process is repeated for each hit count.
- Relative frequencies provide an intuitive understanding of the dataset's distribution.
- These frequencies can be directly compared to theoretical probabilities from the Poisson distribution.