Large Data Set | Edexcel A Level Maths: Statistics Exam Questions & Answers 2017 [PDF]

1a

1 mark

Jiang is studying the variable Daily Mean Pressure from the large data set.

He drew the following box and whisker plot for these data for one of the months for one location using a linear scale but

he failed to label all the values on the scale
he gave an incorrect value for the median

Box plot showing daily mean pressure in hPa with a central box, whiskers, and an arrowed axis labelled Daily Mean Pressure (hPa). The centre line of the 'box' is directly above 1200 on the horizontal axis.

Using your knowledge of the large data set, suggest a suitable value for the median.

(You are not expected to have memorised values from the large data set. The question is simply looking for sensible answers.)

How did you do?

1b

1 mark

Using your knowledge of the large data set, suggest a suitable value for the range.

(You are not expected to have memorised values from the large data set. The question is simply looking for sensible answers.)

How did you do?

Was this exam question helpful?

2a

4 marks

A meteorologist is investigating the Daily Total Rainfall, $r$ mm, in Heathrow using a random sample of 120 days from the large data set.

The results are summarised in the table below.

Rainfall, $r$ (mm)	Frequency
$0 \leq r < 2$	$18$
$2 \leq r < 5$	$36$
$5 \leq r < 10$	$42$
$10 \leq r < 20$	$16$
$20 \leq r < 40$	$8$

On the grid below, draw a histogram to represent these data.

Grid paper with small squares, featuring horizontal and vertical axes with arrowheads, suggesting a blank graph or plot area.

How did you do?

2b

2 marks

A "light-rain" day is defined as a day with rainfall between 1 mm and 5 mm.

Calculate an estimate for the number of "light-rain" days recorded in this sample.

How did you do?

2c

1 mark

Before producing the grouped frequency table, the meteorologist had to clean the data.

Using your knowledge of the large data set, explain why the daily total rainfall data needed to be cleaned.

How did you do?

Was this exam question helpful?

7a

2 marks

The box plot in Figure 1 shows the Daily Mean Wind Speed, $w$ knots, for the 31 days in October 2015 in Hurn from the large data set.

Box plot with lowest line at 2, next line at 5, next at 7, next at 8, and last at 10. Another point is indicated at 13 — **Figure 1**

Show that the value 13 is an outlier.

How did you do?

7b

2 marks

The Daily Mean Wind Speed data for Leuchars for the same period (October 2015) is summarised below.

Lowest Value	3
Lower Quartile	4
Median	6
Upper Quartile	9
Highest Value	22

Compare the Daily Mean Wind Speed in Hurn and Leuchars for October 2015.

How did you do?

7c

2 marks

A meteorologist wants to calculate the mean wind speed for Leuchars. The data in the large data set contains some entries recorded as "n/a".

State what "n/a" represents in the large data set and how the meteorologist should handle these entries.

How did you do?

Was this exam question helpful?

2a

1 mark

Stav is studying the large data set for September 2015.

He codes the variable Daily Mean Pressure, $x$ , using the formula $y = x - 1010$ .

The data for all 30 days from Hurn are summarised by

$\sum y = 214 \sum y^{2} = 5912$

State the units of the variable $x$ .

How did you do?

2b

2 marks

Find the mean Daily Mean Pressure for these 30 days.

How did you do?

2c

3 marks

Find the standard deviation of Daily Mean Pressure for these 30 days.

How did you do?

2d

2 marks

Stav knows that, in the UK, winds circulate

in a clockwise direction around a region of high pressure
in an anticlockwise direction around a region of low pressure

The table gives the Daily Mean Pressure for 3 locations from the large data set on 26/09/2015

Location	Heathrow	Hurn	Leuchars
Daily Mean Pressure	1029	1028	1028
Cardinal Wind Direction

The Cardinal Wind Directions for these 3 locations on 26/09/2015 were, in random order,

W NE E

You may assume that these 3 locations were under a single region of pressure.

Using your knowledge of the large data set, place each of these Cardinal Wind Directions in the correct location in the table.

Give a reason for your answer.

How did you do?

Was this exam question helpful?

4a

1 mark

Helen is studying one of the qualitative variables from the large data set for Heathrow from 2015.

She started with the data from 3rd May and then took every 10th reading.

There were only 3 different outcomes with the following frequencies

Outcome	$A$	$B$	$C$
Frequency	16	2	1

State the sampling technique Helen used.

How did you do?

4b

2 marks

From your knowledge of the large data set

(i) suggest which variable was being studied,

(ii) state the name of outcome $A$ .

How did you do?

4c

1 mark

George is also studying the same variable from the large data set for Heathrow from 2015.

He started with the data from 5th May and then took every 10th reading and obtained the following

Outcome	$A$	$B$	$C$
Frequency	16	1	1

Helen and George decided they should examine all of the data for this variable for Heathrow from 2015 and obtained the following

Outcome	$A$	$B$	$C$
Frequency	155	26	3

State what inference Helen and George could reliably make from their original samples about the outcomes of this variable at Heathrow, for the period covered by the large data set in 2015.

How did you do?

Was this exam question helpful?

12a

2 marks

An ice cream shop owner in Camborne is trying to use data from the large data set alongside their own past sales data to help them estimate future sales. The mean daily temperature per month, $T$ °C, is shown with the mean daily number of ice creams sold per month, $I$ , from 2015 in the table below.

Month	May	June	July	August	September	October
$T$	11.2	13.8	15.7	15.4	13.6	12.2
$I$	57	132	259	227	133	101

The equation for the regression line of $I$ on $T$ is $I = - 429.5 + 42.5 T$ .

Find an estimate for the expected total number of ice creams sold in the month of July if the average daily temperature for that month is 14.9 °C.

How did you do?

12b

1 mark

Suggest one other variable from the large data set which could be used to improve this model.

How did you do?

12c

1 mark

The ice cream shop owner claims that there is a causal link between $I$ and $T$ , and so if the shop sells more ice cream, the month will be hotter.

Comment on this claim.

How did you do?

Was this exam question helpful?

1a

6 marks

Roger has been looking at some data on the daily mean air temperature, t, in two different locations, Perth and Jacksonville, taken from the large data set. All the data is taken from the month of July in 2015.

	$n$	$Σ t$	$Σ t^{2}$	$\bar{t}$	$σ$
Location A	31	836.3	22593.0
Location B	31			13.3	2.167

Unfortunately, some of the information has been lost and Roger does not know which data is for which location.

Complete the table.

How did you do?

1b

1 mark

Using your knowledge of the large data set, state which of the locations is most likely to be Jacksonville, giving a reason for your answer.

How did you do?

Was this exam question helpful?

2a

2 marks

The table below shows the daily maximum relative humidity, rounded to the nearest per cent, for Leuchars between June and August $2015$ .

Daily maximum relative humidity, $x$ (%)	Frequency, $f$
80 – 89	7
90 – 95	21
96 – 98	21
99 – 100	43

Using your knowledge of the large data set, explain why roughly $70 %$ of these days contained fog and/or mist.

How did you do?

2b

2 marks

The data from the table are to be presented on a statistical diagram.

For a histogram, the frequency density for the $96$ – $98$ class is $7$ .

Find the frequency density for the $80$ – $89$ class.

How did you do?

2c

2 marks

For a cumulative frequency graph, state the coordinates of all the points that should be plotted.

How did you do?

2d

1 mark

Explain why an exact box plot cannot be drawn using only the information from the table.

How did you do?

Was this exam question helpful?

Daily Mean Temp. °C Beijing October 1987	20.6	19.1	21.1	20.4	19.8	19.3	17.1	16.5	18	18.9
Daily Mean Temp. °C Beijing October 2015	16.1	19.4	18.6	18.4	18.9	20.3	20.5	14.5	14.7	14

Daily mean total cloud cover (oktas)	0	1	2	3	4	5	6	7	8
Frequency (number of days)	0	1	4	7	10	30	52	52	28

$p$	1007	1023	1011	1022	1011	1019	1017	1016	1022	997	1030	1023
$s$	0	6.3	2.4	6.2	1.7	8.4	1.9	6.7	7.7	2.3	10.3	4.1

	Min	Max	Median	$Σ x$	$Σ x^{2}$
1987	2	16	5	185	1401
2015	3	10	6	197	1357

$x$	$a$	0.7	3.3	6.9	4.7	5.4	5.5	0.1	5.7	7.5
$y$	5	6	7	5	6	6	5	7	4	4

Dailymean total clou cover (oktas)	0	1	2	3	4	5	6	7	8
Frequency (number of days)	0	0	1	1	2	1	5	9	9

$x$	0	0.1-0.5	0.6-1.0	1.1-1.9	2.0-4.0	4.1-6.9	7.0-12.0	12.1-20.9	21.0-32.0	tr
Frequency	55	18	18	21	17	9	9	6	2	29

	Daily mean air temperature		Daily Mean Pressure
	$\bar{t}$	$σ_{t}$	$\bar{p}$	$σ_{p}$
Beijing	$a$	$b$	1004	3.81
Jacksonville	26.4	1.80	1017	1.88
Perth	14.8	2.37	1021	5.63

	Number of available days	Maximum value	$Σ x$	$S_{x x}$
1987	25	61	665	3462

Large Data Set (Edexcel A Level Maths: Statistics): Exam Questions