Which box plot represents a set of data with the largest interquartile range

  1. Statistics
  2. Box and Whisker Plot

What does a box plot tell you?

By , published 2019


What is a box plot?

In descriptive statistics, a box plot or boxplot (also known as box and whisker plot) is a type of chart often used in explanatory data analysis. Box plots visually show the distribution of numerical data and skewness through displaying the data quartiles (or percentiles) and averages.

Box plots show the five-number summary of a set of data: including the minimum score, first (lower) quartile, median, third (upper) quartile, and maximum score.

Which box plot represents a set of data with the largest interquartile range

Definitions

Minimum Score

The lowest score, excluding outliers (shown at the end of the left whisker).

Lower Quartile

Twenty-five percent of scores fall below the lower quartile value (also known as the first quartile).

Median

The median marks the mid-point of the data and is shown by the line that divides the box into two parts (sometimes known as the second quartile). Half the scores are greater than or equal to this value and half are less.

Upper Quartile

Seventy-five percent of the scores fall below the upper quartile value (also known as the third quartile). Thus, 25% of data are above this value.

Maximum Score

The highest score, excluding outliers (shown at the end of the right whisker).

Whiskers

The upper and lower whiskers represent scores outside the middle 50% (i.e. the lower 25% of scores and the upper 25% of scores).

The Interquartile Range (or IQR)

This is the box plot showing the middle 50% of scores (i.e., the range between the 25th and 75th percentile).

Why are box plots useful?

Box plots divide the data into sections that each contain approximately 25% of the data in that set.

Which box plot represents a set of data with the largest interquartile range

Box plots are useful as they provide a visual summary of the data enabling researchers to quickly identify mean values, the dispersion of the data set, and signs of skewness.

Note the image above represents data which is a perfect normal distribution and most box plots will not conform to this symmetry (where each quartile is the same length).

Box plots are useful as they show the average score of a data set.

The median is the average value from a set of data and is shown by the line that divides the box into two parts. Half the scores are greater than or equal to this value and half are less.

Box plots are useful as they show the skewness of a data set

The box plot shape will show if a statistical data set is normally distributed or skewed.

Which box plot represents a set of data with the largest interquartile range

When the median is in the middle of the box, and the whiskers are about the same on both sides of the box, then the distribution is symmetric.

When the median is closer to the bottom of the box, and if the whisker is shorter on the lower end of the box, then the distribution is positively skewed (skewed right).

When the median is closer to the top of the box, and if the whisker is shorter on the upper end of the box, then the distribution is negatively skewed (skewed left).

Box plots are useful as they show the dispersion of a data set.

In statistics, dispersion (also called variability, scatter, or spread) is the extent to which a distribution is stretched or squeezed.

The smallest value and largest value are found at the end of the ‘whiskers’ and are useful for providing a visual indicator regarding the spread of scores (e.g. the range).

Which box plot represents a set of data with the largest interquartile range

The interquartile range (IQR) is the box plot showing the middle 50% of scores and can be calculated by subtracting the lower quartile from the upper quartile (e.g. Q3−Q1).

Box plots are useful as they show outliers within a data set.

An outlier is an observation that is numerically distant from the rest of the data.

When reviewing a box plot, an outlier is defined as a data point that is located outside the whiskers of the box plot.

Which box plot represents a set of data with the largest interquartile range

Source: https://towardsdatascience.com/understanding-boxplots-5e2df7bcbd51

For example, outside 1.5 times the interquartile range above the upper quartile and below the lower quartile (Q1 - 1.5 * IQR or Q3 + 1.5 * IQR).

How to compare box plots

Box plots are a useful way to visualize differences among different samples or groups. They manage to provide a lot of statistical information, including — medians, ranges, and outliers.

Note, although box plots have been presented horizontally in this article, it is more common to view them vertically in research papers

Step 1: Compare the medians of box plots

Compare the respective medians of each box plot. If the median line of a box plot lies outside of the box of a comparison box plot, then there is likely to be a difference between the two groups.

Which box plot represents a set of data with the largest interquartile range

Source: https://blog.bioturing.com/2018/05/22/how-to-compare-box-plots/


Step 2: Compare the interquartile ranges and whiskers of box plots

Compare the interquartile ranges (that is, the box lengths), to examine how the data is dispersed between each sample. The longer the box the more dispersed the data. The smaller the less dispersed the data.

Which box plot represents a set of data with the largest interquartile range

Next, look at the overall spread as shown by the extreme values at the end of two whiskers. This shows the range of scores (another type of dispersion). Larger ranges indicate wider distribution, that is, more scattered data.


Step 3: Look for potential outliers (see above image)

When reviewing a box plot, an outlier is defined as a data point that is located outside the whiskers of the box plot.


Step 4: Look for signs of skewness

If the data do not appear to be symmetric, does each sample show the same kind of asymmetry?

Which box plot represents a set of data with the largest interquartile range

Which box plot represents a set of data with the largest interquartile range

How to reference this article:

How to reference this article:

McLeod, S. A. (2019, July 19). What does a box plot tell you? Simply psychology: https://www.simplypsychology.org/boxplots.html

Home | About Us | Privacy Policy | Advertise | Contact Us

Simply Psychology's content is for informational and educational purposes only. Our website is not intended to be a substitute for professional medical advice, diagnosis, or treatment.

© Simply Scholar Ltd - All rights reserved

Which box plot represents a set of data with the largest interquartile range

How can you tell which Boxplot has a bigger IQR?

Notice: A long box in the boxplot indicates a large IQR, so the middle half of the data has a lot of variability. A short box in the boxplot indicates a small IQR.

Which Boxplot has higher range?

7/13 to 7/15 has the largest maximum, the largest median, the largest range and largest interquartile range.

Which part of a box plot indicates the interquartile range?

The figure shows the shape of a box and whisker plot and the position of the minimum, lower quartile, median, upper quartile and maximum. In a box and whisker plot: The left and right sides of the box are the lower and upper quartiles. The box covers the interquartile interval, where 50% of the data is found.