6.1.2.3 Selecting the Appropriate Statistical Diagram — Statistical Charts — AQA Mathematics 8300 | GradeGen.AI

Selecting the Appropriate Statistical Diagram

AQA GCSE Mathematics (8300) — Statistics

Sorting the Noise

Have you ever tried to make sense of a massive spreadsheet full of survey results? Staring at raw numbers is confusing, which is why we use statistical diagrams to spot patterns instantly. But choosing the wrong diagram is like using a ruler to measure weight—it just doesn't work. The exam requires you to know exactly which diagram fits which type of data, and more importantly, why.

Step 1: Identify Your Data Type

Before selecting a diagram, you must classify your data.

Categorical data is non-numerical data described in words, sorted into groups or labels (e.g., eye colour, favourite car brand).
Discrete data is numerical data resulting from counting. It can only take specific, separate values (e.g., shoe size, number of siblings).
Continuous data is numerical data resulting from measurement. It can take any value within a given range, including decimals (e.g., exact height, weight, or time).
Bivariate data consists of pairs of values for two different variables, used to see if they are connected (e.g., ice cream sales versus daily temperature).

Diagrams for Categorical and Discrete Data

When dealing with categories or counted values, your goal is usually to compare totals or look at fractions of a whole.

Choose Bar Charts to compare absolute frequency (the number of times a category or value occurs), such as comparing the number of students who travel to school by bus, car, or walking. Bars must be of equal width and separated by gaps.
Choose Pie Charts to show proportions or "part-to-whole" relationships, such as the market share of different phone brands. They display relative sizes as a sector of a circle, not actual frequencies (unless the total is given).
Choose Vertical Line Charts for discrete numerical data that has many possible outcomes (e.g., plotting the frequency of specific test scores out of 50), as thick bars would look too cluttered.

Key Terms(25)

Categorical data: Non-numerical data described in words, sorted into groups or labels.
Discrete data: Numerical data resulting from counting that can only take specific values.
Continuous data: Numerical data resulting from measurement that can take any value within a range.
Bivariate data: Data consisting of pairs of values for two variables, used to explore relationships.
Frequency: The number of times a specific value or category occurs in a dataset.
Sector: The "slice" of a pie chart representing a specific category's proportion.
Frequency density: The frequency per unit of class width, plotted on the y-axis of a histogram.
Class width: The difference between the upper and lower boundaries of a grouped class interval.
Median: The middle value of an ordered dataset (the 50th percentile).
Interquartile range: A measure of spread representing the middle 50% of the data, calculated by subtracting the lower quartile from the upper quartile.
Lower quartile: The value at the 25th percentile of an ordered dataset.
Upper quartile: The value at the 75th percentile of an ordered dataset.
Interpolation: Making a prediction within the known range of data on a scatter graph.
Extrapolation: Making a prediction outside the known range of data on a scatter graph, which is often unreliable.
Outlier: A data point that does not fit the general pattern or trend of the dataset.
Causation: When a change in one variable directly causes a change in another variable.
Bar Chart: A diagram used to compare absolute frequencies of categorical or discrete data, featuring bars of equal width separated by gaps.
Pie Chart: A circular chart divided into sectors, used to show proportions or 'part-to-whole' relationships for categorical data.
Vertical Line Chart: A chart used for discrete numerical data with many possible outcomes, using vertical lines instead of thick bars to prevent clutter.
Histogram: A diagram for continuous grouped data where the area of touching bars represents frequency, and the vertical axis shows frequency density.
Frequency Polygon: A graph drawn by joining the midpoints of the tops of histogram bars with straight lines, used to compare continuous distributions.
Box Plot: A diagram that summarises the median, quartiles, and spread of continuous data.
Stem and Leaf Diagram: A diagram that organises numerical data by splitting numbers into a stem and a leaf, retaining the raw data values.
Scatter Graph: A graph used to plot pairs of values for bivariate data to explore relationships and correlation.
Time Series Graph: A line graph with time plotted on the x-axis, used to show trends in data recorded at intervals over time.

Key Terms(25)

Categorical data: Non-numerical data described in words, sorted into groups or labels.
Discrete data: Numerical data resulting from counting that can only take specific values.
Continuous data: Numerical data resulting from measurement that can take any value within a range.
Bivariate data: Data consisting of pairs of values for two variables, used to explore relationships.
Frequency: The number of times a specific value or category occurs in a dataset.
Sector: The "slice" of a pie chart representing a specific category's proportion.
Frequency density: The frequency per unit of class width, plotted on the y-axis of a histogram.
Class width: The difference between the upper and lower boundaries of a grouped class interval.
Median: The middle value of an ordered dataset (the 50th percentile).
Interquartile range: A measure of spread representing the middle 50% of the data, calculated by subtracting the lower quartile from the upper quartile.
Lower quartile: The value at the 25th percentile of an ordered dataset.
Upper quartile: The value at the 75th percentile of an ordered dataset.
Interpolation: Making a prediction within the known range of data on a scatter graph.
Extrapolation: Making a prediction outside the known range of data on a scatter graph, which is often unreliable.
Outlier: A data point that does not fit the general pattern or trend of the dataset.
Causation: When a change in one variable directly causes a change in another variable.
Bar Chart: A diagram used to compare absolute frequencies of categorical or discrete data, featuring bars of equal width separated by gaps.
Pie Chart: A circular chart divided into sectors, used to show proportions or 'part-to-whole' relationships for categorical data.
Vertical Line Chart: A chart used for discrete numerical data with many possible outcomes, using vertical lines instead of thick bars to prevent clutter.
Histogram: A diagram for continuous grouped data where the area of touching bars represents frequency, and the vertical axis shows frequency density.
Frequency Polygon: A graph drawn by joining the midpoints of the tops of histogram bars with straight lines, used to compare continuous distributions.
Box Plot: A diagram that summarises the median, quartiles, and spread of continuous data.
Stem and Leaf Diagram: A diagram that organises numerical data by splitting numbers into a stem and a leaf, retaining the raw data values.
Scatter Graph: A graph used to plot pairs of values for bivariate data to explore relationships and correlation.
Time Series Graph: A line graph with time plotted on the x-axis, used to show trends in data recorded at intervals over time.

Selecting the Appropriate Statistical Diagram

Sorting the Noise

Step 1: Identify Your Data Type

Diagrams for Categorical and Discrete Data

Diagrams for Continuous Data (Higher Tier)

Diagrams for Analysing Spread and Retaining Data

Diagrams for Relationships and Trends

Sign up to continue reading

Exam Tips

Key Terms(25)

Key Terms(25)