Data Presentation (Edexcel A Level Maths: Statistics): Exam Questions

Exam code: 9MA0

3 hours33 questions
1a
1 mark

Each member of a group of 27 people was timed when completing a puzzle.

The time taken, x minutes, for each member of the group was recorded.

These times are summarised in the following box and whisker plot.

Box plot showing data distribution in minutes from 0 to 70. 'Box' goes from 14 to 25 with a line at 20. 'Whiskers' extend to 7 and 40. 'x' marks at 46 and 68.

Find the range of the times.

1b
1 mark

Find the interquartile range of the times.

2a
3 marks

The box and whisker diagram below shows the train journey times, in minutes, between March and Peterborough on a weekday.

Box and whisker diagram showing weekday train journey times in minutes between March and Peterborough, with whiskers extending from approximately 17 to 25 minutes and a box from 18 to 21 minutes

(i) Find the median journey time.

(ii) Find the lower and upper quartiles.

(iii) Find the interquartile range.

2b
2 marks

The box and whisker diagram in part (a) shows the journey times on a weekday. The table below summarises the times for the same journey on a Saturday.

Journey time (minutes)

Fastest

16

Lower quartile

18

Median

19

Upper quartile

20

Slowest

25

On the grid below, draw a box plot for the information given in the table.

Blank grid with a horizontal axis labelled 'Train journey times (mins)' showing tick marks at 15, 20, and 25
3a
3 marks

In a paper aeroplane competition, 55 contestants flew their paper aeroplanes in the airtime pre-eliminations round. The flight times achieved by the contestants' paper aeroplanes are shown in the table below.

Time, t seconds

Frequency f

0 \leq t < 4

12

4 \leq t < 8

25

8 \leq t < 12

16

12 \leq t < 16

2

On the grid below, draw a cumulative frequency graph for the information in the table.

Blank cumulative frequency grid with x-axis labelled 'Time (seconds)' from 0 to 20 and y-axis labelled 'Cumulative frequency' from 0 to 60
3b
1 mark

Use your graph to estimate the median time.

4a
3 marks

The heart rates, in beats per minute (bpm), of 60 randomly selected athletes during a training session were recorded. The data are summarised in the table below.

Heart rate, b (bpm)

Frequency

140 \leq b < 160

10

160 \leq b < 170

20

170 \leq b < 175

20

175 \leq b < 180

10

On the grid below, draw a histogram to represent these data.

Blank histogram grid with x-axis labelled 'BPM' from 130 to 190 and y-axis labelled 'Frequency density' from 0 to 4
4b
2 marks

Use the histogram to estimate the number of athletes with a heart rate less than 150 bpm.

5a
1 mark

A company recorded the commute times, in minutes, for a sample of 15 employees travelling to work one morning. The times are given below.

4

8

12

9

7

14

6

5

8

7

9

10

7

3

6

Find the median commute time.

5b
2 marks

Find the interquartile range of the commute times.

5c
1 mark

On the grid below, draw a box plot for the commute times.

Blank horizontal number line from 0 to 15 minutes, labelled “Commute times (mins)” with major ticks at 0, 5, 10 and 15 and a right arrow.
6a
2 marks

To quality-control the elasticity of elastic bands, a company selects random elastic bands from the end of their production line and has a machine stretch them until they snap. The length, in millimetres, of an elastic band at the moment it snaps is recorded. The histogram and frequency table below show the results, but each is incomplete.

Incomplete histogram showing the snap length distribution of elastic bands; bars for the classes 100–150, 150–175 and 175–200 mm are drawn with frequency densities 0.1, 0.4 and 0.8, but no bars are drawn for 200–225 mm or 225–275 mm

Snap length, l (mm)

Frequency

Frequency density

100 \leq l < 150

5

0.1

150 \leq l < 175

0.4

175 \leq l < 200

0.8

Use the histogram to complete the frequency table.

6b
2 marks

Use the frequency table below to complete the histogram by drawing the missing bars for the classes 200 \leq l < 225 and 225 \leq l < 275.

Snap length, l (mm)

Frequency

200 \leq l < 225

15

225 \leq l < 275

10

Incomplete histogram showing the snap length distribution of elastic bands; bars for the classes 100–150, 150–175 and 175–200 mm are drawn with frequency densities 0.1, 0.4 and 0.8, but no bars are drawn for 200–225 mm or 225–275 mm
7a
3 marks

In a paper aeroplane competition, 40 contestants flew their paper aeroplanes in the distance pre-eliminations round. The distances achieved by the contestants' paper aeroplanes are shown in the cumulative frequency diagram below.

Cumulative frequency diagram for the distances thrown, with the horizontal axis showing distance in metres from 0 to 60 and the vertical axis showing cumulative frequency from 0 to 40. The curve rises slowly to about cumulative frequency 8 by 25 metres, then steeply through about (33, 20) before levelling off near (50, 40)

Use the cumulative frequency graph to estimate

(i) the median distance

(ii) the interquartile range.

7b
2 marks

The contestants whose paper aeroplanes flew the furthest 9 distances qualified for the super finals. Use the graph to estimate the minimum distance a paper aeroplane needed to fly to qualify for the super finals.

1a
Sme Calculator
4 marks
Partially completed box plot on a grid with horizontal axis numbered from 7 to 33. A rectangle is drawn going from 19.4 to 26.6, with a vertical line at 23.6 separating it into two parts.
Figure 1

The partially completed box plot in Figure 1 shows the distribution of daily mean air temperatures using the data from the large data set for Beijing in 2015.

An outlier is defined as a value

  • more than 1.5 cross times IQR below Q subscript 1 or

  • more than 1.5 cross times IQR above Q subscript 3

The three lowest air temperatures in the data set are 7.6 °C, 8.1 °C and 9.1 °C.

The highest air temperature in the data set is 32.5 °C.

Complete the box plot in Figure 1 showing clearly any outliers.

1b
1 mark

Using your knowledge of the large data set, suggest from which month the two outliers are likely to have come.

2a
1 mark

Jiang is studying the variable Daily Mean Pressure from the large data set.

He drew the following box and whisker plot for these data for one of the months for one location using a linear scale but

  • he failed to label all the values on the scale

  • he gave an incorrect value for the median

Box plot showing daily mean pressure in hPa with a central box, whiskers, and an arrowed axis labelled Daily Mean Pressure (hPa). The centre line of the 'box' is directly above 1200 on the horizontal axis.

Using your knowledge of the large data set, suggest a suitable value for the median.

(You are not expected to have memorised values from the large data set. The question is simply looking for sensible answers.)

2b
1 mark

Using your knowledge of the large data set, suggest a suitable value for the range.

(You are not expected to have memorised values from the large data set. The question is simply looking for sensible answers.)

3
Sme Calculator
2 marks

A medical researcher is studying the number of hours, T, a patient stays in hospital following a particular operation.

The histogram summarises the results for a random sample of 90 patients.

Histogram showing frequency density of time in hours. 0-4 bar has height 2.5, 4-7 bar has height 3, 7-12 bar has height 4.2, 12-16 bar has height 4, 16-20 bar has height 3.5, 20-40 bar has height 1.

Use the histogram to estimate straight P open parentheses 10 less than T less than 30 close parentheses.

4a
Sme Calculator
2 marks

The histogram and its frequency polygon below give information about the weights, in grams, of 50 plums.

Histogram and line graph showing frequency density of weight in grams, ranging from 60 to 68. Bars are divided into intervals with superimposed line plot.

Show that an estimate for the mean weight of the 50 plums is 63.72 grams.

4b
Sme Calculator
2 marks

Calculate an estimate for the standard deviation of the 50 plums.

4c
1 mark

Later it was discovered that the scales used to weigh the plums were broken.

Each plum actually weighs 5 grams less than originally thought.

State the effect this will have on the estimate of the standard deviation in part (b).

Give a reason for your answer.

5a
4 marks

A meteorologist is investigating the Daily Total Rainfall, r mm, in Heathrow using a random sample of 120 days from the large data set.

The results are summarised in the table below.

Rainfall, bold italic r (mm)

Frequency

0 less or equal than r less than 2

18

2 less or equal than r less than 5

36

5 less or equal than r less than 10

42

10 less or equal than r less than 20

16

20 less or equal than r less than 40

8

On the grid below, draw a histogram to represent these data.

Grid paper with small squares, featuring horizontal and vertical axes with arrowheads, suggesting a blank graph or plot area.
5b
2 marks

A "light-rain" day is defined as a day with rainfall between 1 mm and 5 mm.

Calculate an estimate for the number of "light-rain" days recorded in this sample.

5c
1 mark

Before producing the grouped frequency table, the meteorologist had to clean the data.

Using your knowledge of the large data set, explain why the daily total rainfall data needed to be cleaned.

6a
Sme Calculator
2 marks

The box plot in Figure 1 shows the Daily Mean Wind Speed, w knots, for the 31 days in October 2015 in Hurn from the large data set.

Box plot with lowest line at 2, next line at 5, next at 7, next at 8, and last at 10. Another point is indicated at 13
Figure 1

Show that the value 13 is an outlier.

6b
2 marks

The Daily Mean Wind Speed data for Leuchars for the same period (October 2015) is summarised below.

Lowest Value

3

Lower Quartile

4

Median

6

Upper Quartile

9

Highest Value

22

Compare the Daily Mean Wind Speed in Hurn and Leuchars for October 2015.

6c
2 marks

A meteorologist wants to calculate the mean wind speed for Leuchars. The data in the large data set contains some entries recorded as "n/a".

State what "n/a" represents in the large data set and how the meteorologist should handle these entries.

7a
3 marks

A biologist studying otter populations records the mass, in grams, of each baby otter at a research centre. The data is illustrated in the box and whisker diagram below.

Box and whisker diagram showing the masses in grams of baby otters, with whiskers extending from approximately 95 to 136 grams and a box from 104 to 120 grams

(i) Find the median mass of the otters.

(iii) Find the interquartile range.

7b
3 marks

The same otters are weighed monthly to track their growth. Summary data on the masses, in grams, of the otters after one month is shown in the table below.

Mass (g)

Smallest mass

125

Range

48

Median

152

Upper quartile

164

Interquartile range

33

On the grid below, draw a box plot for the information given above.

Blank grid with a horizontal axis arrow for drawing a box plot of otter masses
8a
3 marks

120 competitors enter an elimination race for charity. Runners set off from the same start running as many laps of the course as possible. Their total distance is tracked and the competitor who runs the furthest over a 6-hour period is the winner.

The distances runners achieved are recorded in the table below.

Distance, bold italic d (miles)

Frequency, bold space bold italic f

25 less or equal than d less than 30

8

30 less or equal than d less than 35

10

35 less or equal than d less than 40

32

40 less or equal than d less than 45

54

45 less or equal than d less than 50

10

50 less or equal than d less than 55

6

On the grid below, draw a cumulative frequency graph for the information in the table.

Blank cumulative frequency grid with horizontal axis distance in miles from 25 to 55 and vertical axis cumulative frequency from 0 to 130
8b
3 marks

Use your graph to estimate

(i) the median distance run

(ii) the interquartile range.

9a
Sme Calculator
3 marks

The total amount of time cleaners spent dealing with unplanned incidents in a supermarket was recorded each day. Data collected over 49 days is summarised in the table below.

Time, bold italic t (minutes)

Frequency, bold space bold italic f

0 less or equal than t less than 90

9

90 less or equal than t less than 120

24

120 less or equal than t less than 200

12

200 less or equal than t less than 250

4

On the grid below, draw a histogram to represent this data.

Blank histogram grid with horizontal axis labelled time in minutes and vertical axis for frequency density
9b
2 marks

Estimate the number of days that cleaners spent longer than 3 hours dealing with incidents.

10a
2 marks

Filmworld cinemas collected data on the ages of visitors to their cinemas during a 24-hour period. The incomplete histogram and frequency table show some of the information they collected.

Incomplete histogram showing four drawn bars on age intervals 0-5, 5-10, 10-20, and 20-30 with no bars yet drawn for 30-50 and 50-60

Age, bold italic a (years)

Frequency, bold space bold italic f

0 less or equal than a less than 5

15

5 less or equal than a less than 10

10 less or equal than a less than 20

20 less or equal than a less than 30

12

30 less or equal than a less than 50

18

50 less or equal than a less than 60

7

Use the histogram to complete the frequency table.

10b
2 marks

Use the frequency table to complete the histogram.

11a
1 mark

Safety officers check the speed of vehicles travelling along a stretch of highway. The cumulative frequency curve below summarises the data for the speeds, in kmph, of 80 vehicles.

Cumulative frequency curve showing speed in kmph on the horizontal axis from 0 to 100, with cumulative frequency on the vertical axis from 0 to 80

Use the graph to estimate the median speed.

11b
Sme Calculator
3 marks

The speed limit for this section of road is 80 kmph.

Vehicles travelling above the speed limit are issued with a speeding ticket. Those travelling more than 10% over the speed limit are pulled over.

Use the graph to estimate the percentage of vehicles that the safety officers pull over.

1a
1 mark

Taruni is studying the time it takes members of her company to travel to the office.

Taruni decided to ask every member of the company the time, x minutes, it takes them to travel to the office.

Taruni’s results are summarised by the box plot below.

Box plot showing journey times from 20 to 90 minutes, with outliers at 118 and 124 minutes. The median is 40 minutes. The quartiles are at 26 and 58.

Write down the interquartile range for these data.

1b
3 marks

Rana and David both work for the company and have both moved house since Taruni collected her data.

Rana’s journey to work has changed from 75 minutes to 35 minutes and David’s journey to work has changed from 60 minutes to 33 minutes.

Taruni drew her box plot again and only had to change two values.

Explain which two values Taruni must have changed and whether each of these values has increased or decreased.

2a
3 marks

The partially completed table and partially completed histogram give information about the ages of passengers on an airline.

There were no passengers aged 90 or over.

Age (x years)

0 less or equal than x less than 5

5 less or equal than x less than 20

20 less or equal than x less than 40

40 less or equal than x less than 65

65 less or equal than x less than 80

80 less or equal than x less than 90

Frequency

5

45

90

1

Histogram showing frequency density of age groups, 5-20 has height of 6 large squares, 40-65 has height of 5.2 large squares, 65-80 has height of 4 large squares and 80-90 has height of 0.1 large squares.

Complete the histogram.

2b
Sme Calculator
5 marks

Use linear interpolation to estimate the median age.

2c
Sme Calculator
2 marks

An outlier is defined as a value greater than Q subscript 3 plus 1.5 cross times interquartile space range.

Given that Q subscript 1 equals 27.3 and Q subscript 3 equals 58.9, determine, giving a reason, whether or not the oldest passenger could be considered as an outlier.

3a
1 mark

The amounts of time engineers spent dealing with individual faults in a power plant were recorded to the nearest minute. Data on 30 different faults is summarised in the table below.

Time, t (minutes)

Frequency, f

90 – 129

6

130 – 169

8

170 – 199

12

200 – 249

4

Give a reason to justify the use of a histogram to represent these data.

3b
3 marks

On the grid below, draw a histogram to represent the data.

Blank histogram grid for plotting frequency density against time in minutes
3c
Sme Calculator
3 marks

Use your histogram to estimate the proportion of individual faults on which engineers spent longer than three hours.

4a
3 marks

A teacher took 19 students on an international trip. The incomplete box plot below shows part of the summary of the weights, in kg, of the luggage brought by each student. Each student's luggage weighed a different amount.

Incomplete box plot showing the median at 20 kg, the upper quartile at 23 kg, and the right whisker extending to 28 kg, with the left half of the box and the left whisker missing

The median weight is 4 kg more than the lower quartile, and the range of weights is three times the interquartile range.

Use this information to complete the box plot.

4b
1 mark

Calculate the proportion of luggage weights which were less than 20 kg.

4c
2 marks

Students had to pay an additional fee if the weight of their luggage exceeded 23 kg.

Find the number of students who had to pay the additional fee.

5a
1 mark

A biologist times how long it takes each of 80 mice to find the exit to a maze. Every two and a half minutes she records the number of mice which have found the exit, which she then represents as a cumulative frequency curve.

Cumulative frequency curve showing time in minutes on the horizontal axis from 0 to 35 and cumulative frequency on the vertical axis from 0 to 80, with the curve starting at (2.5, 0) and rising to (35, 80)

Based on the graph, state an inequality for the time, t, taken by the fastest mouse.

5b
3 marks

The biologist also recorded the actual times taken by the fastest and slowest mice. She has used this information to begin constructing a box plot to represent the data.

Use the cumulative frequency curve to complete the box plot for the times.

Partial box plot showing only the left whisker tip at 4 minutes and the right whisker tip at 33 minutes, with no box or median line drawn
6a
Sme Calculator
1 mark

An annual cheese-rolling contest involves participants chasing a 4 kg round of cheese down a steep 200 yard long hillside. A group of 60 friends participated in the contest and the table below summarises the distances travelled by each before first falling over.

Distance, d (yards)

Frequency, f

0 \leq d < 40

23

40 \leq d < 80

11

80 \leq d < 120

9

120 \leq d < 160

7

160 \leq d < 200

6

Find how many of the 60 friends made it to the bottom without falling over.

6b
3 marks

On the grid, draw a cumulative frequency graph for the information in the table.

Blank cumulative frequency grid with horizontal axis distance in yards from 0 to 200 and vertical axis cumulative frequency from 0 to 60
6c
2 marks

The steepest part of the hill is between 100 and 140 yards away from the start.

Use your graph to estimate the number of people who fell during this section of the hill.

7a
Sme Calculator
5 marks

The histogram below shows the masses, in grams, of 80 apples.

Histogram showing the masses of 80 apples in grams, with frequency density on the vertical axis and mass on the horizontal axis from 40 to 140

Use the histogram to find an estimate for

(i) the median mass

(ii) the interquartile range.

7b
3 marks

Given that the lightest apple weighs 41 g and that the range of masses is 97 g, draw a box plot to show the distribution of the masses of the apples.

Blank grid with a horizontal axis arrow for drawing a box plot of apple masses
8a
1 mark

Wendy is using the large data set to learn about the daily mean wind speeds for Leeming in June 2015. She lists the data below.

4

4

4

5

5

5

5

6

6

6

6

7

7

7

7

7

8

8

8

9

9

9

10

10

10

11

11

16

17

17

Using your knowledge of the large data set, state the units for the values in the table.

8b
Sme Calculator
3 marks

An outlier is defined as a value more than 1.5 \times \text{IQR} below Q_{1} or more than 1.5 \times \text{IQR} above Q_{3}.

On the grid below, draw a box plot for the information above.

Blank grid with a horizontal axis arrow for drawing a box plot of windspeeds
1a
Sme Calculator
4 marks

The histogram summarises the heights of 256 seedlings two weeks after they were planted.

Histogram with 'Frequency density' on the vertical axis and 'Height of seedling (cm)' on the horizontal address. There are 6 bars: from 0 to 1 cm with a height of 15,  from 1 to 2 cm with a height of 35,  from 2 to 3.5 cm with a height of 50, from 3.5 to 4.5 cm with a height of 55,  from 4.5 to 6.5 cm with a height of 26,  and from 6.5 to 8 cm with a height of 16.

Use linear interpolation to estimate the median height of the seedlings.

1b
Sme Calculator
3 marks

Chris decides to model the frequency density for these 256 seedlings by a curve with equation

y equals k x open parentheses 8 space – space x close parentheses space space space space space space space space space space space 0 less or equal than x less or equal than 8

where k is a constant.

Find the value of k.

1c
1 mark

Using this model, write down the median height of the seedlings.

2a
4 marks

The cumulative frequency graph below shows the times, in seconds, taken by 160 employees of a company to complete a training task.

Cumulative frequency graph showing the time in seconds on the horizontal axis from 60 to 130, and cumulative frequency on the vertical axis from 0 to 160

Use your graph to estimate the number of employees whose time is within 5 seconds of the median time.

2b
2 marks

The fastest 10\% of employees are invited to join an advanced project.

Estimate the highest time of an employee who is invited.

2c
2 marks

The slowest 5\% of employees are offered additional support.

Estimate the lowest time of an employee who is offered support.

3a
1 mark

There are 180 dogs at a vet's clinic. The histogram below shows the highest sound level reached by each individual dog's bark, measured in decibels (dB).

Histogram showing the highest sound levels in decibels reached by 180 dogs' barks, with frequency density on the unlabelled vertical axis and sound level on the horizontal axis from 90 to 110 dB

State what is represented by the area of each bar in a histogram.

3b
Sme Calculator
4 marks

Use the histogram to estimate the number of dogs which had a bark with a highest sound level between 99 dB and 107 dB.

4a
Sme Calculator
3 marks

Mr Shapesphere, a history teacher, records the time, to the nearest minute, it takes him to mark each student's essay. The times were summarised in a grouped frequency table and an extract is shown below.

Time, t (minutes)

Frequency, f

0 – 10

7

11 – 30

16

31 – 35

4

A histogram was drawn to represent these data. The 1130 group was represented by a bar of width 6 cm and height 4.5 cm.

Find the width and height of the bar for the 010 group.

4b
Sme Calculator
3 marks

The total area under the histogram is 60.75 \text{ cm}^{2}.

Find the number of essays which Mr Shapesphere recorded as taking longer than 35 minutes, to the nearest minute.

5a
3 marks

The grouped frequency table below contains information about the lengths of time of Susie's calls with her customers. The table was used to draw the cumulative frequency curve also shown below.

Time (minutes)

4 < t \leq 8

8 < t \leq 12

12 < t \leq 16

16 < t \leq 20

Frequency

16

a

b

c

Cumulative frequency curve showing time in minutes on the horizontal axis from 0 to 20 and cumulative frequency on the vertical axis from 0 to 160

Use the graph to find the values of a, b and c.

5b
3 marks

Use the graph to estimate the interquartile range of times for Susie's calls.

5c
Sme Calculator
2 marks

Use the graph to estimate the percentage of customers whose calls lasted longer than 10 minutes.

6a
Sme Calculator
5 marks

Crystal is given an incomplete box plot showing the lengths of 99 unicorn horns. She also knows that the median length is the midpoint of the minimum and maximum lengths, and that the range is 2.5 times as big as the interquartile range.

Complete the diagrams below to show that there are two possible distributions given the information above.

Two identical incomplete box plots showing a box from 56 to 72 with the right whisker tip at 81 cm, and the median and left whisker missing
6b
2 marks

The box plot below shows the masses of the 99 unicorn horns.

Box plot showing the masses of 99 unicorn horns in kg, with whisker low at 3 kg, lower quartile at 7.6 kg, median at 9.2 kg, upper quartile at 10.2 kg and whisker high at 11.2 kg

Crystal discovers that two masses were recorded incorrectly: 11 kg should have been 8 kg, and 9 kg should have been 10 kg.

Explain why at most one feature of the box plot will need to change.

6c
1 mark

Explain why it is possible that the box plot will remain unchanged when it is fixed.

7a
Sme Calculator
2 marks

The table below shows the daily maximum relative humidity, rounded to the nearest per cent, for Leuchars between June and August 2015.

Daily maximum relative humidity, x (%)

Frequency, f

80 – 89

7

90 – 95

21

96 – 98

21

99 – 100

43

Using your knowledge of the large data set, explain why roughly 70\% of these days contained fog and/or mist.

7b
Sme Calculator
2 marks

The data from the table are to be presented on a statistical diagram.

For a histogram, the frequency density for the 9698 class is 7.

Find the frequency density for the 8089 class.

7c
2 marks

For a cumulative frequency graph, state the coordinates of all the points that should be plotted.

7d
1 mark

Explain why an exact box plot cannot be drawn using only the information from the table.