To find the IQR from a cumulative frequency graph, you need to locate the quartiles using the total frequency (). Because the exact raw data is grouped, values read from a graph are estimates.
A cumulative frequency graph shows the masses, in grams, of 120 apples. The total frequency () is 120. Calculate an estimate for the interquartile range.
Step 1: Find the Lower Quartile () position and value.
Step 2: Find the Upper Quartile () position and value.
Step 3: Calculate the Interquartile Range.
A box plot (or box-and-whisker diagram) is a visual graph drawn on a linear scale that represents the five-number summary of a dataset.
Step-by-Step: Drawing a Box Plot from Raw Data To draw a box plot for the data set:
An outlier is an extreme value that does not fit the overall pattern. On Higher Tier papers, an outlier is any value or . These are marked with a cross () and the whisker stops at the next nearest "normal" value.
When asked to compare two populations using box plots, you must provide a paired comparison of location (average) and dispersion (spread).
Example: Side-by-Side Comparison of Test Scores Imagine two box plots, Class A and Class B, are drawn on the same scale.
| Five-Number Summary | Class A | Class B |
|---|---|---|
| Minimum | 10 | 22 |
| Lower Quartile () | 35 | 42 |
| Median | 48 | 55 |
| Upper Quartile () | 60 | 68 |
| Maximum | 95 | 92 |
| IQR () | 25 | 26 |
Standard Comparison Structure:
Note: Always use the context of the question (e.g., 'test scores', 'running times') to gain full marks.
Students often calculate the full range (Maximum - Minimum) when asked for the interquartile range. Always ensure you subtract the Lower Quartile from the Upper Quartile.
When reading quartiles from a cumulative frequency graph, always use a ruler to draw straight horizontal and vertical lines; examiners look for these lines to award method marks.
In a 4-mark comparison question, you must: 1. State the numerical values; 2. Use comparative words (higher/lower); 3. Interpret what it means in context (e.g., 'faster', 'heavier').
When comparing box plots, never use the word 'mean', as a box plot only provides the median as an average.
Check the scale carefully! In side-by-side comparisons, one box plot might be shifted along the axis. Always read the values from the provided linear scale, not by measuring the boxes with a ruler.
Summary statistics
Single numerical values, such as the median or interquartile range, used to describe and summarize a larger dataset.
Measure of central tendency
A mathematical value representing the center or typical value of a dataset, such as the mean, median, or mode.
Measure of spread
A value that describes how stretched out or varied a set of data is, such as the range or interquartile range. Also known as a measure of dispersion.
Cumulative frequency
The running total of frequencies as you move through the class intervals in a grouped frequency table.
Lower Quartile
The value that is 25% of the way through an ordered dataset, often denoted as Q₁.
Upper Quartile
The value that is 75% of the way through an ordered dataset, often denoted as Q₃.
Interquartile Range
A measure of spread representing the range of the middle 50% of the data, calculated as Q₃ - Q₁.
Box plot
A graphical representation of the distribution of a dataset based on its five-number summary.
Five-number summary
The five key values used to draw a box plot: the minimum, lower quartile, median, upper quartile, and maximum.
Outlier
An extreme data point that lies outside the overall pattern of the distribution, marked with a cross on a box plot.
Consistent
A term used to describe a dataset with a small measure of spread, meaning the values are closely packed together and reliable.
Contextual interpretation
Explaining the mathematical differences between two datasets using the real-world subject matter provided in the question.
Put your knowledge into practice — try past paper questions for Mathematics
Summary statistics
Single numerical values, such as the median or interquartile range, used to describe and summarize a larger dataset.
Measure of central tendency
A mathematical value representing the center or typical value of a dataset, such as the mean, median, or mode.
Measure of spread
A value that describes how stretched out or varied a set of data is, such as the range or interquartile range. Also known as a measure of dispersion.
Cumulative frequency
The running total of frequencies as you move through the class intervals in a grouped frequency table.
Lower Quartile
The value that is 25% of the way through an ordered dataset, often denoted as Q₁.
Upper Quartile
The value that is 75% of the way through an ordered dataset, often denoted as Q₃.
Interquartile Range
A measure of spread representing the range of the middle 50% of the data, calculated as Q₃ - Q₁.
Box plot
A graphical representation of the distribution of a dataset based on its five-number summary.
Five-number summary
The five key values used to draw a box plot: the minimum, lower quartile, median, upper quartile, and maximum.
Outlier
An extreme data point that lies outside the overall pattern of the distribution, marked with a cross on a box plot.
Consistent
A term used to describe a dataset with a small measure of spread, meaning the values are closely packed together and reliable.
Contextual interpretation
Explaining the mathematical differences between two datasets using the real-world subject matter provided in the question.