Sampling Distributions for Differences in Sample Proportions (College Board AP® Statistics): Study Guide
Sampling distributions for differences in sample proportions
What is a one-sample problem?
So far we've only considered one random sample of size
being taken from one population with a population proportion of
The sample proportion is
This is a one-sample problem
What is a two-sample problem?
If one random sample of size
is taken from one population with population proportion of
and a different random sample of size
is taken from a different population (that is independent to the first population) with population proportion of
then this is a two-sample problem
The sample proportions are
and
What is the difference in sample proportions?
In a two-sample problem you can compare the sample proportions from separate samples of two independent populations
You can look at the difference in sample proportions,
e.g. if
then the proportion of successes in the first sample is greater than the proportion of successes of the second sample
What is the sampling distribution for differences in sample proportions?
In a sample of size
taken from the first population
let
count the number of successes in the sample
so
is the number of successes in
trials
and each trial is either a success or a failure
follows a binomial distribution with probability of success
where
is the population proportion
The sample proportion,
, is given by
The number of successes in the sample divided by the total number of individuals in the sample
Similarly, for a sample of size
taken from the second population with
successes
The sample proportion,
, is given by
If the sample sizes are large enough such that the conditions
are all satisfied
then the difference in sample proportions,
, will follow:
an approximate normal distribution
with mean
and standard deviation
This is the sampling distribution for the difference in sample proportions
What else should I know about the sampling distribution for differences in sample proportions?
You need to know that
The standard deviation
assumes sampling was done with replacement
If sampling without replacement, make sure both sample sizes are less than 10% of their population size to be able to use
otherwise the standard deviation will be smaller
Because the distribution is approximately normal, you can use the normal distribution to calculate probabilities involving differences of sample proportions,
Its standardized z-statistic is
and
, the population proportions, will be given in the question
If the sample sizes are not large enough (i.e. the four conditions are not satisfied) then the sampling distribution is not approximately normal
but the mean and standard deviation formulas still hold
Examiner Tips and Tricks
The mean, , and the standard deviation,
, are given in the exam under 'Sampling distributions for proportions', in the row called 'For two populations'.
Worked Example
In Twiggy National Park, 35% of all eagles are male and in Dusty National Park, 20% of all eagles are male. A sample of 40 eagles is taken from Twiggy National Park and a sample of 50 eagles is taken from Dusty National Park.
Find the probability that the proportion of male eagles sampled in Twiggy National Park is less than the proportion of male eagles sampled in Dusty National Park.
Answer:
Start by labeling each population
Population 1 consists of all eagles in Twiggy National Park
Population 2 consists of all eagles in Dusty National Park
The question is about one sample proportion being less than another,
This can be rearranged into the difference of two sample proportions,
The difference in sample proportions follows an approximate normal distribution with mean and standard deviation
so long as
,
,
and
Test the four conditions with ,
,
and
The conditions are satisfied
Substitute and
into
Substitute ,
,
and
into
From above, you want to find
The difference in sample proportions follows (approximately) a normal distribution with mean 0.15 and standard deviation 0.094273538...
To find the probability that the difference in sample means is less than 0, first calculate the z-score for 0
Then find , e.g. using the normal tables
The probability that the proportion of male eagles sampled in Twiggy National Park is less than the proportion of male eagles sampled in Dusty National Park is 0.0559
You've read 0 of your 5 free study guides this week
Unlock more, it's free!
Did this page help you?