Have you ever tried to make sense of a massive spreadsheet full of survey results? Staring at raw numbers is confusing, which is why we use statistical diagrams to spot patterns instantly. But choosing the wrong diagram is like using a ruler to measure weight—it just doesn't work. The exam requires you to know exactly which diagram fits which type of data, and more importantly, why.
Before selecting a diagram, you must classify your data.
When dealing with categories or counted values, your goal is usually to compare totals or look at fractions of a whole.
To draw a pie chart, calculate the angle for each sector using this formula:
Continuous data is measured along an unbroken scale, meaning there can be no gaps between the groups.
Here are the key formulas for working with histograms, using the class width (the difference between the upper and lower boundaries of a group):
Sometimes, you need to understand how spread out the data is, or you need to keep the exact original numbers visible.
When analysing two variables or tracking data over time, you need specialised graphs.
Students often plot frequency instead of frequency density on the y-axis when drawing histograms. You must calculate frequency density if the class widths are unequal.
Do not use a bar chart for continuous data. Continuous scales cannot have gaps, which is why histograms (where bars touch) must be used instead.
When asked to justify your choice of diagram, stating 'it shows the data' will score zero marks. Use specific rationale from the spec: e.g., 'A pie chart shows proportions' or 'A stem and leaf diagram retains the raw data values'.
In 2-mark comparison questions for box plots, examiners expect you to compare one measure of average (the median) and one measure of spread (the IQR or range), and relate both back to the real-world context of the question.
If an exam question asks you to 'describe the correlation', answer with 'Positive' or 'Negative'. If it asks you to 'describe the relationship', you must use context (e.g., 'As temperature increases, ice cream sales increase').
Categorical data
Non-numerical data described in words, sorted into groups or labels.
Discrete data
Numerical data resulting from counting that can only take specific values.
Continuous data
Numerical data resulting from measurement that can take any value within a range.
Bivariate data
Data consisting of pairs of values for two variables, used to explore relationships.
Frequency
The number of times a specific value or category occurs in a dataset.
Sector
The "slice" of a pie chart representing a specific category's proportion.
Frequency density
The frequency per unit of class width, plotted on the y-axis of a histogram.
Class width
The difference between the upper and lower boundaries of a grouped class interval.
Median
The middle value of an ordered dataset (the 50th percentile).
Interquartile range
A measure of spread representing the middle 50% of the data, calculated by subtracting the lower quartile from the upper quartile.
Lower quartile
The value at the 25th percentile of an ordered dataset.
Upper quartile
The value at the 75th percentile of an ordered dataset.
Interpolation
Making a prediction within the known range of data on a scatter graph.
Extrapolation
Making a prediction outside the known range of data on a scatter graph, which is often unreliable.
Outlier
A data point that does not fit the general pattern or trend of the dataset.
Causation
When a change in one variable directly causes a change in another variable.
Bar Chart
A diagram used to compare absolute frequencies of categorical or discrete data, featuring bars of equal width separated by gaps.
Pie Chart
A circular chart divided into sectors, used to show proportions or 'part-to-whole' relationships for categorical data.
Vertical Line Chart
A chart used for discrete numerical data with many possible outcomes, using vertical lines instead of thick bars to prevent clutter.
Histogram
A diagram for continuous grouped data where the area of touching bars represents frequency, and the vertical axis shows frequency density.
Frequency Polygon
A graph drawn by joining the midpoints of the tops of histogram bars with straight lines, used to compare continuous distributions.
Box Plot
A diagram that summarises the median, quartiles, and spread of continuous data.
Stem and Leaf Diagram
A diagram that organises numerical data by splitting numbers into a stem and a leaf, retaining the raw data values.
Scatter Graph
A graph used to plot pairs of values for bivariate data to explore relationships and correlation.
Time Series Graph
A line graph with time plotted on the x-axis, used to show trends in data recorded at intervals over time.
Put your knowledge into practice — try past paper questions for Mathematics
Categorical data
Non-numerical data described in words, sorted into groups or labels.
Discrete data
Numerical data resulting from counting that can only take specific values.
Continuous data
Numerical data resulting from measurement that can take any value within a range.
Bivariate data
Data consisting of pairs of values for two variables, used to explore relationships.
Frequency
The number of times a specific value or category occurs in a dataset.
Sector
The "slice" of a pie chart representing a specific category's proportion.
Frequency density
The frequency per unit of class width, plotted on the y-axis of a histogram.
Class width
The difference between the upper and lower boundaries of a grouped class interval.
Median
The middle value of an ordered dataset (the 50th percentile).
Interquartile range
A measure of spread representing the middle 50% of the data, calculated by subtracting the lower quartile from the upper quartile.
Lower quartile
The value at the 25th percentile of an ordered dataset.
Upper quartile
The value at the 75th percentile of an ordered dataset.
Interpolation
Making a prediction within the known range of data on a scatter graph.
Extrapolation
Making a prediction outside the known range of data on a scatter graph, which is often unreliable.
Outlier
A data point that does not fit the general pattern or trend of the dataset.
Causation
When a change in one variable directly causes a change in another variable.
Bar Chart
A diagram used to compare absolute frequencies of categorical or discrete data, featuring bars of equal width separated by gaps.
Pie Chart
A circular chart divided into sectors, used to show proportions or 'part-to-whole' relationships for categorical data.
Vertical Line Chart
A chart used for discrete numerical data with many possible outcomes, using vertical lines instead of thick bars to prevent clutter.
Histogram
A diagram for continuous grouped data where the area of touching bars represents frequency, and the vertical axis shows frequency density.
Frequency Polygon
A graph drawn by joining the midpoints of the tops of histogram bars with straight lines, used to compare continuous distributions.
Box Plot
A diagram that summarises the median, quartiles, and spread of continuous data.
Stem and Leaf Diagram
A diagram that organises numerical data by splitting numbers into a stem and a leaf, retaining the raw data values.
Scatter Graph
A graph used to plot pairs of values for bivariate data to explore relationships and correlation.
Time Series Graph
A line graph with time plotted on the x-axis, used to show trends in data recorded at intervals over time.