Q. 3.182

Question

Women Students. The US Department of Education sponsors a report on educational institutions, including colleges and universities, titled Digest of Education Statistics. Among many of the statistics provided are the numbers of men and women enrolled in 2 year and 4 year degree-granting institutions. During one year, the percentage of full-time enrolled students that were women, for each of the 50states and the District of Columbia, is as presented on the WeissStats site.

a. obtain and interpret the quartiles. 

b. determine and interpret the interquartile range.

e. find and interpret the five-number summary

d. identify potential outliers, if any.

e. obtain and interpret boxplot.

Step-by-Step Solution

Verified
Answer

a) The quartiles are 535556

(b) The interquartile range is, 3

(c) Five-number summary is, 48, 53, 55, 56, 58

(d) Potential outliers are, 48

(e) The required boxplot is given below.

1Part (a) Step 1: Given information

We are given that,

Sorted data is given in the Weiss stats which are as follows,

48, 48, 50, 52, 52, 52, 52, 52, 52, 53, 53, 53, 53, 53, 53, 53, 53, 54, 54,54, 54, 54, 54, 54, 54, 55, 55, 55, 55, 55, 55, 55, 55, 55, 55, 56, 56, 56,56, 56, 56, 56, 56, 57, 57, 57, 57, 57, 58, 58, 58

2Part (a) Step 2: Simplify

As we know that median is the middle value of a sorted data set. Because the number of data values is odd, the median is:- 

     Q2=55

So, roughly 50%{"x":[[34,6,7,5,6,13,25,32,34,32,23,10,4],[55,43,41,42,49,60,69,70,69,67,55],[178.5,171.5,156.5,127.5,125.5,123.5,122.5],[114.5],[198.5]],"y":[[9,9,9,51,51,50,49,58,84,110,116,114,95],[116,113,63,19,9,9,21,55,94,113,117],[16,37,78,174,179,182,182],[29],[169]],"t":[[0,0,0,0,0,0,0,0,0,0,0,0,0],[0,0,0,0,0,0,0,0,0,0,0],[1650707898532,1650707898629,1650707898661,1650707898765,1650707898784,1650707898814,1650707898906],[1650707899773],[1650707900772]],"version":"2.0.0"} of the states have a percentage of full-time enrolled students that were women below 55%{"x":[[34,6,7,5,6,13,25,32,34,32,23,10,4],[71,43,43,42,43,52,64,70,71,70,65,46,41],[179,166,140,125,120],[118],[163]],"y":[[9,9,9,51,51,50,49,58,84,110,116,114,95],[9,9,9,51,51,48,51,63,88,107,116,116,97],[14.5,52.5,116.5,161.5,177.5],[57.5],[137.5]],"t":[[0,0,0,0,0,0,0,0,0,0,0,0,0],[0,0,0,0,0,0,0,0,0,0,0,0,0],[1650708006529,1650708006604,1650708006637,1650708006656,1650708006670],[1650708007590],[1650708008347]],"version":"2.0.0"}

Now, the first quartile is the median of values below Q2 i.e.

     Q1=53

So, roughly 25%{"x":[[5,4,16,30,35,27,4,4,35],[73,45,45,44,45,54,66,72,73,72,67,48,43],[167,125,109],[142],[153]],"y":[[30,16,8,11,25,51,116,116,116],[9,9,9,51,51,48,51,63,88,107,116,116,97],[12,123,180],[25],[160]],"t":[[0,0,0,0,0,0,0,0,0],[0,0,0,0,0,0,0,0,0,0,0,0,0],[1650708344414,1650708344609,1650708344675],[1650708345674],[1650708346595]],"version":"2.0.0"} of the states have a percentage of full-time enrolled students that were women below 53%{"x":[[34,6,7,5,6,13,25,32,34,32,23,10,4],[43,72,73,43,44,56,67,72,71,59,47,41],[162,148,105],[110],[127]],"y":[[9,9,9,51,51,50,49,58,84,110,116,114,95],[9,9,9,51,51,50,56,72,108,117,115,101],[22,54,136],[69],[113]],"t":[[0,0,0,0,0,0,0,0,0,0,0,0,0],[0,0,0,0,0,0,0,0,0,0,0,0],[1650708391317,1650708391492,1650708391614],[1650708392735],[1650708393748]],"version":"2.0.0"}

Now,  the third quartile is the median of values below Q2 I.E.

      Q3=56

So, roughly 75%{"x":[[4,32,32,4],[71,43,43,42,43,52,64,70,71,70,65,46,41],[166],[166],[168,161,138,118,116,115,112,111,107,105],[118],[188]],"y":[[9,9,9,115],[9,9,9,51,51,48,51,63,88,107,116,116,97],[0.5],[0.5],[-1.5,15.5,68.5,126.5,129.5,131.5,138.5,141.5,152.5,155.5],[24.5],[142.5]],"t":[[0,0,0,0],[0,0,0,0,0,0,0,0,0,0,0,0,0],[1650708528222],[1650708528379],[1650708531845,1650708531989,1650708532067,1650708532134,1650708532148,1650708532165,1650708532183,1650708532198,1650708532243,1650708532265],[1650708533333],[1650708534264]],"version":"2.0.0"} of the states have a percentage of full-time enrolled students that were women below 56%{"x":[[34,6,7,5,6,13,25,32,34,32,23,10,4],[69,43,41,48,65,67,57,44],[140,132,126,114,113,112,112],[96],[161]],"y":[[9,9,9,51,51,50,49,58,84,110,116,114,95],[9,47,93,119,112,66,49,60],[6,66,104,167,171,174,175],[3],[157]],"t":[[0,0,0,0,0,0,0,0,0,0,0,0,0],[0,0,0,0,0,0,0,0],[1650708581198,1650708581371,1650708581405,1650708581499,1650708581525,1650708581541,1650708581554],[1650708582460],[1650708583445]],"version":"2.0.0"}

3Part (b) Step 1: Given information

We need to find out the interquartile range 

4Part (b) Step 2: Simplify

The interquartile range is calculated as the difference between Q1 and Q3

     IQR=Q3-Q1=56-53=3

The middle 50%{"x":[[34,6,7,5,6,13,25,32,34,32,23,10,4],[55,43,41,42,49,60,69,70,69,67,55],[143.5,125.5,90.5,88.5,83.5,83.5,83.5],[111.5],[116.5],[116.25]],"y":[[9,9,9,51,51,50,49,58,84,110,116,114,95],[116,113,63,19,9,9,21,55,94,113,117],[13,49,117,122,139,140,141],[29],[96],[59]],"t":[[0,0,0,0,0,0,0,0,0,0,0,0,0],[0,0,0,0,0,0,0,0,0,0,0],[1650708861670,1650708861835,1650708861902,1650708861913,1650708861972,1650708861981,1650708861998],[1650708862888],[1650708863820],[1650708882803]],"version":"2.0.0"} of the states have a percentage varying by 3%{"x":[[5,35,36,5,5,17,27,34,35,32,20,8,4],[131.5,131.5,128.5,127.5,111.5,78.5,74.5],[81.5],[122.5]],"y":[[8,8,8,50,50,50,54,64,88,111,116,113,101],[17,18,22,27,68,165,174],[23],[125]],"t":[[0,0,0,0,0,0,0,0,0,0,0,0,0],[1650708969864,1650708969974,1650708969997,1650708970013,1650708970033,1650708970148,1650708970180],[1650708970998],[1650708972534]],"version":"2.0.0"}

5Part (c) Step 1: Given information

We need to find out the five-number summary 

6Part (c) Step 2: Explanation

The five-number summary is minimum=48, first quartile=53, second quartile=55, third quartile=56 and maximum=58

7Part (d) Step 1: Given information

We need to find out the potential outliers. 

8Part (d) Step 2: Simplify

An outlier is more than 1.5 IQR or greater than Q3 or less than Q1

Therefore,

        Q3+1.5 IQR=56+1.5×3=60.5Q1-1.5 IQR=53-1.5×3=48.5

Hence, there is one outlier i.e. 48 because it does not lie between 48.5 and 60.5

9Part (e) Step 1: Given information

We need to find the boxplot which is given below 

10Part(e) Step 2: Simplify

The whiskers of the boxplot are at a low and high value. The box starts at Q1 and ends at Q3  and has a straight line at the median.