Collecting Data & Sampling (AQA Level 3 Mathematical Studies (Core Maths)): Revision Note
Exam code: 1350
Collecting Data & Using Samples
What is a population?
A population refers to the whole set of things which you are interested in
E.g. if a teacher wanted to know how long pupils in year 11 at their school spent revising each week then the population would be all the year 11 pupils at the school
Population does not necessarily refer to a number of people or animals
E.g. if an IT expert wanted to investigate the speed of mobile phones then the population would be all the different makes and models of mobile phones in the world
What is a sample?
A sample refers to a selected part (called a subset) of the population which is used to collect data from
E.g. for the teacher investigating year 11 revision times a sample would be a certain number of pupils from year 11
A random sample is where every item in the population has an equal chance of being selected
E.g. every pupil in year 11 would have the same chance of being selected for the teacher's sample
A biased sample is where the sample is not random
E.g. the teacher asks pupils from just one class
What are the advantages and disadvantages of using a population?
A census is when data is collected from every member of the whole population
Advantages of using a population
Accurate results as every member/item of the population is used
In reality it would be close to every member for practical reasons
All options/opinions/responses will be included in the results
Disadvantages of using a population
Time consuming to collect the data
Expensive due to the large numbers involved
Large amounts of data to organise and analyse
What are the advantages and disadvantages of using a sample?
The advantages of using a sample
Quicker to collect the data
Cheaper as not so much work involved
Less data to organise and analyse
The disadvantages of using a sample
A small sample size can lead to unreliable results
The reliability of a sample can be improved by taking a larger sample size
A sample can introduce bias
This is particularly a problem if the sample is not random
A sample might not be representative of the population
Only a selection of options/opinions/responses might be accounted for
The members/items used in the sample may all have similar responses
It is important to recognise that different samples (from the same population) may produce different results
A larger sample size will increase the reliability
Worked Example
Mike is a biologist studying mice and has access to 600 mice that live in an enclosure.
Mike wants to sample some of the mice for a study into their response to a new drug.
He decides to sample 10 mice, selecting those nearest to the enclosure's entrance.
(a) State the population in this situation.
The population is the 600 mice living in the enclosure
(b) State two possible issues with the sample method Mike intends using.
The sample size is very small - just 10 mice
The mice are not being selected at random - those nearest the entrance have a greater chance of being selected
(c) Suggest one way in which Mike could improve the reliability of the results from his sample.
Mike should increase the sample size to increase the reliability of the results
Sampling Methods
What sampling techniques do I need to know?
There are a number of different types of sampling methods, these include
Simple random sampling
Stratified sampling
Cluster sampling
Quota sampling
You should know
How each sampling method is carried out
The strengths and limitations of each method
What is simple random sampling?
In simple random sampling every member from the population has an equal probability of being selected for the sample
Method
Uniquely number every member of a population
Randomly select n different numbers using a random number generator
Strengths
Useful when you have a small population or want a small sample
Limitations
Time-consuming if the sample or population is large
May not give a sample that is representative of the whole population
Cannot be used if it is not possible to number or list all the members of the population
What is stratified sampling?
Stratified sampling is where the population is divided into groups based on characteristics that may affect the investigation, e.g. age, and a random sample is taken from each group
The proportion of a group that is sampled is equal to the proportion of the population that belong to that group
Method
Calculate the number of members sampled from each stratum
Take a random sample from each group
Strengths
Useful when there are very different groups of members within a population
The sample will be representative of the population structure
The members selected from each stratum are chosen randomly
Limitations
This can not be used if the population can not be split into groups or if the groups overlap
What is cluster sampling?
Cluster sampling is where the population already naturally falls into groups (clusters), e.g. streets in a town
A random number of clusters are selected and all members from within these clusters are used in the sample
Method
Identify the clusters within the population
Randomly select a number of clusters
Use all members in the selected clusters
Strengths
Easy to complete
Cheaper and quicker than some other types of sampling
When appropriate clusters are used, the sample will be representative of the population
Limitations
If appropriate clusters are not used, the sample may not be representative of the population
What is quota sampling?
Quota sampling is where the population is split into groups (similar to stratified sampling) and members of the population are selected until each quota is filled
Method
Calculate how many people you need from each group
Select members from each group until that quota is filled
The members do not have to be selected randomly,
e.g. members of the public walking past that fit that group may be selected
Strengths
Quick and inexpensive to complete
Useful when a sampling frame is not available
Limitations
Some members of the population might choose not to be included in the sample
Members may not have been selected randomly
Worked Example
Mike is a biologist studying mice in an open enclosure. He has access to approximately 540 field mice and 260 harvest mice. Mike wants to sample 10 mice and he wants the proportions of the two types of mice in his sample to reflect their respective proportions of the population.
(a) Calculate the number of field mice and harvest mice that Mike should include in his sample.
Calculate the total number of mice
Find the number of field mice required in the sample
Multiply the proportion of field mice in the population by the sample size
Find the number of harvest mice required in the sample
Multiply the proportion of harvest mice in the population by the sample size
Include 7 field mice and 3 harvest mice in the sample
(b) Given that Mike does not have a list of all mice in the enclosure, state the name of this sampling method.
A certain number of mice from each group has been selected
There is no list of the population so it cannot be a random sample
Quota sampling
(c) Suggest one way in which Mike could improve his sampling method.
Mike could improve his sampling method by increasing his sample size
You've read 0 of your 5 free revision notes this week
Unlock more, it's free!
Did this page help you?