Exploring Data
The Practice of Statistics for AP ยท 193 exercises
Q 73.
The histogram below shows the distribution of the per cent of women aged 15 and over who have never married in each of the 50 states and the District of Columbia.
The center of this distribution is in the interval
(a) to
(b) to
(c)
(d) to
(e) to
3 step solution
Q 74.
The histogram below shows the distribution of the percent of women aged and over who have never married in each of the states and the District of Columbia.
In about what percent of states have at least of women aged and over never married?
(a) (b) (c)(d) (e)
3 step solution
Q 75.
Baseball players (Introduction) Here is a small part of a data set that describes Major League Baseball players as of opening day of the 2009 season:
(a) What individuals does this data set describe?
(b) In addition to the player’s name, how many variables does the data set contain? Which of these variables are categorical and which are quantitative?
(c) What do you think are the units of measurement for each of the quantitative variables?
4 step solution
Q 76.
The rating service Arbitron asked adults who used several high-tech devices and services whether they “loved” using them. Below is a graph of the percent who said they did.
(a) Summarize what this graph tells you in a sentence or two.
(b) Would it be appropriate to make a pie chart of these data? Why or why not?
4 step solution
Q 77.
A study in Sweden looked at former elite soccer players, people who had played soccer but not at the elite level, and people of the same age who did not play soccer. Here is a two-way table that classifies these individuals by whether or not they had arthritis of the hip or knee by their mid-fifties:
(a) What percent of the people in this study were elite soccer players? What percent had arthritis?
(b) What percent of the elite soccer players had arthritis? What percent of those who had arthritis were elite soccer players?
4 step solution
Q 78.
Refer to Exercise . We suspect that the more serious soccer players have more arthritis later in life. Do the data confirm this suspicion? Give graphical and numerical evidence to support your answer.
3 step solution
Q 1.1.
Here, once again, is the stem plot of travel times to work for randomly selected New Yorkers. Earlier, we found that the median was minutes.
Based only on the stem plot, would you expect the mean travel time to be less than, about the same as, or larger than the median? Why?
3 step solution
Q 1.2.
Here, once again, is the stem plot of travel times to work for randomly selected New Yorkers. Earlier, we found that the median was
Use your calculator to find the mean travel time. Was your answer to Question correct?
3 step solution
Q 1.3.
Here, once again, is the stem plot of travel times to work for randomly selected New Yorkers. Earlier, we found that the median was minutes.
Interpret your result from Question 2 in context without using the words “mean” or “average".
3 step solution
Q 1.4.
Here, once again, is the stem plot of travel times to work for randomly selected New Yorkers. Earlier, we found that the median was minutes.
Would the mean or the median be a more appropriate summary of the center of this distribution of drive times? Justify your answer.
3 step solution
Q. 1.1
Based only on the stemplot, would you expect the mean travel time to be less than, about the same as, or larger than the median? Why?
2 step solution
Q 2.1.
The roster of the Dallas Cowboys professional football team included offensive linemen. Their weights (in pounds) were
Find the five-number summary for these data by hand. Show your work.
3 step solution
Q 2.2.
The roster of the Dallas Cowboys professional football team included offensive linemen. Their weights (in pounds) were
Calculate the IQR. Interpret this value in the context
3 step solution
Q 2.3.
The roster of the Dallas Cowboys professional football team included 10 offensive linemen. Their weights (in pounds) were
Determine whether there are any outliers using the rule.
3 step solution
Q 2.4.
The roster of the Dallas Cowboys professional football team included offensive linemen. Their weights (in pounds) were
Draw a boxplot of the data.
3 step solution
Q 3.1.
The heights (in inches) of the five starters on a basketball team are . Find the interpretation of mean.
3 step solution
Q 3.2.
The heights (in inches) of the five starters on a basketball team are
Make a table that shows, for each value, its deviation from the mean and its squared deviation from the mean.
3 step solution
Q 3.3.
The heights (in inches) of the five starters on a basketball team are
Show how to calculate the variance and standard deviation from the values in your table.
3 step solution
Q 3.4.
The heights (in inches) of the five starters on a basketball team are
Interpret the meaning of the standard deviation in the given setting.
3 step solution
Q 79.
Quiz grades Joey’s first quiz grades in a marking period were
| 86 | 84 | 91 | 75 | 78 | 80 | 74 |
| 87 | 76 | 96 | 82 | 90 | 98 | 93 |
Calculate the mean. Show your work. Interpret your result in context.
3 step solution
Q 80.
Cowboys the roster of the Dallas Cowboys football team included defensive linemen. Their weights (in pounds) were
Calculate the mean. Show your work. Interpret your result in context.
3 step solution
Q 81.
Refer to Exercise .
(a) Find the median by hand. Show your work. Interpret your result in context.
(b) Suppose Joey has an unexcused absence for the quiz, and he receives a score of zero. Recalculate the mean and the median. What property of measures of center does this illustrate?
4 step solution
Q 82.
Cowboys Refer to Exercise
(a) Find the median by hand. Show your work. Interpret your result in context.
(b) Suppose the lightest lineman had weighed pounds instead of pounds. How would this change affect the mean and the median? What property of measures of center does this illustrate?
5 step solution
Q 83.
Incomes of college grads According to the Census Bureau, the mean and median income of people at least years old who had a bachelor’s degree but no higher degree were and Which of these numbers is the mean and which is the median? Explain your reasoning.
3 step solution
Q 84.
House prices: The mean and median selling prices of existing single-family homes sold in November were and . Which of these numbers is the mean and which is the median? Explain how you know.
3 step solution
Q 85.
Suppose that a Major League Baseball team’s mean yearly salary for its players is $1.2 million and that the team has 25 players on its active roster. What is the team’s total annual payroll?
If you knew only the median salary, would you be able to answer this question? Why or why not?
3 step solution
Q 86.
Mean salary? Last year a small accounting firm paid each of its five clerks , two junior accountants each, and the firm’s owner . What is the mean salary paid at this firm? How many of the employees earn less than the mean? What is the median salary? Write a sentence to describe how an unethical recruiter could use statistics to mislead
prospective employees.
3 step solution
Q 87.
Domain names When it comes to Internet domain names, is shorter better? According to one ranking of Web sites in , the top sites (by number of
“hits”) were yahoo.com, google.com, youtube.com, live.com, msn.com, myspace.com, wikipedia.org, and facebook.com. These familiar sites certainly have short domain names. The histogram below shows the domain name lengths (in a number of letters in the name, not including the extensions .com and .org) for the most popular Web sites.
(a) Estimate the mean and median of the distribution. Explain your method clearly.
(b) If you wanted to argue that shorter domain names were more popular, which measure of the center would you choose—the mean or the median? Justify your answer.
4 step solution
Q 88.
We all know that fruit is good for us. Below is a histogram of the number of servings of fruit per day claimed by seventeen-year-old girls in a study in Pennsylvania?
(a) With a little care, you can find the median and the quartiles from the histogram. What are these numbers? How did you find them?
(b) Estimate the mean of the distribution. Explain your method clearly.
4 step solution
Q 89.
Refer to Exercise 79
(a) Find and interpret the interquartile range (IQR)
(b) Determine whether there are any outliers. Show your work
4 step solution
Q 90.
Cowboys Refer to Exercise
(a) Find and interpret the interquartile range (IQR).
(b) Determine whether there are any outliers. Show your work.
4 step solution
Q 91.
Don’t call me In a September , article titled “Letting Our Fingers Do the Talking,” the New York Times reported that Americans now send more text messages than they make phone calls. According to a study by Nielsen Mobile, “Teenagers ages to are by far the most prolific texters, sending or receiving messages a month.” Mr. Williams, a high school statistics teacher, was skeptical about the claims in the article. So he collected data from his first-period statistics class on the number of text messages and calls they had sent or received in the past hours. Here are the texting data:
(a) Make a boxplot of these data by hand. Be sure to check for outliers.
(b) Do these data support the claim in the article about the number of texts sent by teens? Justify your answer with appropriate evidence.
4 step solution
Q 92.
Acing the first test Here are the scores of Mrs. Liao’s students on their first statistics test:
(a) Make a boxplot of the test score data by hand. Be sure to check for outliers.
(b) How did the students do on Mrs. Liao’s first test? Justify your answer.
4 step solution
Q 93.
Texts or calls? Refer to Exercise 91. A boxplot of the difference (texts – calls) in the number of texts and calls for each student is shown below.
(a) Do these data support the claim in the article about texting versus calling? Justify your answer with appropriate evidence.
(b) Can we draw any conclusion about the preferences of all students in the school based on the data from Mr. Williams’s statistics class? Why or
why not?
4 step solution
Q 94.
Electoral votes To become president of the United States, a candidate does not have to receive a majority of the popular vote. The candidate does have to win a majority of the electoral votes that are cast in the Electoral College. Here is a stemplot of the number of electoral votes for each of the states and the District of Columbia.
(a) Make a boxplot of these data by hand. Be sure to check for outliers.
(b) Which measure of center and spread would you use to summarize the distribution—the mean and standard deviation or the median and IQR? Justify your answer.
4 step solution
Q 95.
Comparing investments Should you put your money into a fund that buys stocks or a fund that invests in real estate? The boxplots compare the daily returns (in percent) on a “total stock market” fund and a real estate fund over a year ending in November
(a) Read the graph: about what were the highest and lowest daily returns on the stock fund?
(b) Read the graph: the median return was about the same on both investments. About what was the median return?
(c) What is the most important difference between the two distributions?
5 step solution
Q 96.
Income and education level Each March, the Bureau of Labor Statistics compiles an Annual Demographic Supplement to its monthly Current Population Survey.44 Data on about individuals between the ages of data-custom-editor="chemistry" who were employed full-time were collected in one of these surveys. The boxplots below compare the distributions of income for people with five levels of education. This figure is a variation of the boxplot idea: because large data sets often contain very extreme observations, we omitted the individuals in each category with the top and bottom of incomes. Write a brief description of how the distribution of income changes with the highest level of education reached. Give specifics from the graphs to support your statements.
3 step solution
Q 97.
The level of various substances in the blood influences our health. Here are measurements of the level of phosphate in the blood of a patient, in milligrams of phosphate per deciliter of blood, made on consecutive visits to a clinic: . A graph of only observations gives little information, so we proceed to compute the mean and standard deviation.
(a) Find the standard deviation from its definition. That is, find the deviations of each observation from the mean, square the deviations, and then obtain the variance and the standard deviation.
(b) Interpret the value of six you obtained in (a)
4 step solution
Q. 97
Phosphate levels The level of various substances in the blood influences our health. Here are measurements of the level of phosphate in the blood of a patient, in milligrams of phosphate per deciliter of blood, made on 6 consecutive visits to a clinic: 5.6, 5.2, 4.6, 4.9, 5.7, 6.4. A graph of only 6 observations gives little information, so we proceed to compute the mean and standard deviation. (a) Find the standard deviation from its definition. That is, find the deviations of each observation from the mean, square the deviations, then obtain the variance and the standard deviation. (b) Interpret the value of sx you obtained in (a)
2 step solution
Q 98.
Feeling sleepy? The first four students to arrive for a first-period statistics class were asked how much sleep (to the nearest hour) they got last night. Their responses were and
(a) Find the standard deviation from its definition. That is, find the deviations of each observation from the mean, square the deviations, then obtain the variance and the standard deviation.
(b) Interpret the value of you obtained in (a).
(c) Do you think it’s safe to conclude that the mean amount of sleep for all students in this class is close to hours? Why or why not?
5 step solution
Q 99.
The figure displays computer output from Data Desk for data on the amount spent by grocery shoppers.
(a) What would you guess is the shape of the distribution based only on the computer output? Explain.
(b) Interpret the value of the standard deviation.
(c) Are there any outliers? Justify your answer.
5 step solution
Q 100.
C-sections Do male doctors perform more cesarean sections (C-sections) than female doctors? A study in Switzerland examined the number of cesarean sections (surgical deliveries of babies) performed in a . Section Describing Quantitative Data with Numbers year by samples of male and female doctors. Here are summary statistics for the two distributions:
(a) Based on the computer output, which distribution would you guess has a more symmetrical shape? Explain.
(b) Explain how the IQRs of these two distributions can be so similar even though the standard deviations are quite different.
(c) Does it appear that males perform more C-sections? Justify your answer
5 step solution
Q 101.
The IQR Is the interquartile range a resistant measure of spread? Give an example of a small data set that supports your answer.
2 step solution
Q 102.
Which of the distributions shown has a larger standard deviation? Justify your answer.
3 step solution
Q 103.
This is a standard deviation contest. You must choose four numbers from the whole numbers to , with repeats allowed.
(a) Choose four numbers that have the smallest possible standard deviation.
(b) Choose four numbers that have the largest possible standard deviation.
(c) Is more than one choice possible in either (a) or (b)? Explain.
5 step solution
Q 104.
What do they measure? For each of the following summary statistics, decide (i) whether it could be used to measure center or spread and (ii) whether it is resistant.
(a)
(b)
5 step solution
Q 105.
Here are the scores on the Survey of Study Habits and Attitudes (SSHA) for 18 first-year college women:
and for 20 first-year college men:
Do these data support the belief that women have better study habits and attitudes toward learning than men? (Note that high scores indicate good study habits and attitudes toward learning.) Follow the four-step process.
3 step solution
Q 106.
Researchers from Amherst College studied the relationship between varieties of the tropical flower Heliconia on the island of Dominica and the different species of hummingbirds that fertilize the flowers. Over time, the researchers believe, the lengths of the flowers and the forms of the hummingbirds’ beaks have evolved to match each other. If that is true, flower varieties fertilized by different hummingbird species should have distinct distributions of length. The table below gives length measurements (in millimeters) for samples of three varieties of Heliconia, each fertilized by a different species of hummingbird. Do these data support the researchers’ belief? Follow the four-step process
3 step solution
Q 107.
If a distribution is skewed to the right with no outliers,
(a) mean median. (d) mean > median.
(b) mean median. (e) We can’t tell without examining the data.
(c) mean median.
3 step solution
Q 108.
You have data on the weights in grams of baby pythons. The mean weight is 31.8 and the standard deviation of the weights is . The correct units for the standard deviation are
(a) no units—it’s just a number.
(b) grams.
(c) grams squared.
(d) pythons.
(e) pythons squared
3 step solution