Sampling Distributions for Differences in Sample Means (College Board AP® Statistics): Study Guide
Sampling distributions for differences in sample means
What is a one-sample problem?
When one random sample of size
has been taken from one population
with population mean
and population standard deviation
The sample mean is
This is a one-sample problem
What is a two-sample problem?
If one random sample of size
is taken from one population with population mean
and population standard deviation
then a different random sample of size
is taken from a different population (that is independent to the first population) with population mean
and population standard deviation
then this is a two-sample problem
The sample means are
and
What is the difference in sample means?
In a two-sample problem you can compare the sample means from separate samples of two independent populations
You can look at the difference in sample means,
e.g. if
then the mean of the first sample is greater than the mean of the second sample
What is the sampling distribution for differences in sample means?
You can find the differences in sample means, if
you take all possible samples of size
from the first population and calculate their sample means,
then take all possible samples of size
from the second population and calculate their sample means,
then work out all the possible values that the difference
can take
The collection of all these values is called the sampling distribution for differences in sample means
What are the mean and standard deviation of the sampling distribution for differences in sample means?
If the first population has a population mean of
and a population standard deviation of
and the second independent population has a population mean of
and population standard deviation of
Then the sampling distribution for differences in sample means,
has a mean of
and a standard deviation of
where
is the size of the first sample
and
is the size of the second sample
The standard deviation of
assumes sampling was done with replacement
If sampling without replacement, make sure that each sample size is less than 10% of its population size to be able to use
otherwise the standard deviation will be smaller
Examiner Tips and Tricks
The mean, , and the standard deviation,
, are given in the exam under 'Sampling distributions for means', in the row called 'For two populations'.
What conditions are needed for normality?
If in addition to the above, the two independent populations are also known to be normally distributed
then the sampling distribution for differences in sample means is also normally distributed
with mean
and standard deviation
You can use these properties to calculate probabilities involving differences in sample means,
, as they follow a normal distribution
Its standardized z-statistic is
,
,
and
will be given in the question
What do I do if the populations are not normally distributed?
If the populations are not normally distributed, then you cannot say the sampling distribution for differences in sample means is normally distributed
This means you cannot work out any probabilities
However, despite not knowing its shape, the sampling distribution for differences in sample means still has a
mean of
and a standard deviation of
i.e. you can always write these down, even though the distribution is unknown
Can I use the Central Limit theorem if populations are not normally distributed?
If the populations are not normally distributed, but both sample sizes are greater than or equal to 30 (
and
)
then the Central Limit theorem can be applied
meaning the sampling distribution for differences in sample means is approximately normally distributed with the parameters above
i.e. mean
and standard deviation
You can use these properties to estimate probabilities involving differences in sample means,
, as they follow an approximate normal distribution
Its standardized z-statistic is
,
,
and
will be given in the question
Worked Example
The average lifetime of bulbs from a company called Brite have a mean of 900 hours and a standard deviation of 25 hours. The average lifetime of bulbs from a company called Shine have a mean of 800 hours and a standard deviation of 15 hours.
Estimate the probability that the mean of a sample of 40 bulbs from Brite is at least 108 hours more than the mean of a sample of 50 bulbs from Shine.
Answer:
This a probability question about the difference in means of two samples, so requires the sampling distribution for differences in sample means
Start by labeling each population
Population 1 is the average lifetime of bulbs from Brite
Population 2 is the average lifetime of bulbs from Shine
You are not told the lifetimes of the bulbs are normally distributed but both sample sizes are greater than 30 so the Central Limit theorem can be applied
and
so use the Central Limit theorem
The difference in sample means follows an approximate normal distribution with mean and standard deviation
Substitute and
into
Substitute ,
,
and
into
The wording in the question asks for the probability that
Rearrange this to form the difference of sample means,
The difference in sample means follows an approximate normal distribution with mean 100 and standard deviation 4.4860896... from above
To find the probability that the difference in sample means is greater than 108, first calculate the z-score for 108
Then find , e.g. using the normal tables
The probability that the mean of a sample of 40 bulbs from Brite is at least 108 hours more than the mean of a sample of 50 bulbs from Shine is approximately 0.0375
You've read 0 of your 5 free study guides this week
Unlock more, it's free!
Did this page help you?