Randomized Block & Matched Pairs Design (College Board AP® Statistics): Study Guide

Syllabus Edition

First teaching 2026

First exams 2027

Mark Curtis

Written by: Mark Curtis

Reviewed by: Dan Finlay

Updated on

Randomized block design

What is a block?

  • A block is group of experimental units who have something in common (are similar) that may affect how they respond to a treatment

    • e.g. a group of participants who are smokers

  • Blocking is the act of dividing up the experimental units into different blocks

    • e.g. separating participants out into smokers and non-smokers

      • 'smoking or not' is the blocking variable

  • Blocking should only be done if the researcher believes the blocking variable could affect the results

Why is blocking used?

  • How experimental units respond to treatments varies naturally due to many different factors (variables)

    • e.g. age, diet, weight, ....

  • Blocking allows natural variations in responses to treatments to be distinguished from those variations that were due to the blocking variable

    • It removes the blocking variable from the list of interfering factors

      • which gives a clearer picture of the effectiveness of the treatment

      • and makes any differences between treatments more distinguishable

What is a randomized block design?

  • An experiment that has a randomized block design is one in which

    • experimental units are separated out into blocks

      • based on an identified blocking variable that could cause an issue

    • then experimental units are randomly assigned the different treatments within each block

      • Common methods for randomly assigning treatments can be used in each block

      • e.g. using random number generators or flipping a fair coin

  • If an experiment has more than two treatments

    • each block needs to be randomly assigned all of the treatments

Flowchart of a randomised block design: subjects split into blocks, then randomly into groups receiving Treatment A or B, then responses are compared.
Flowchart of a randomized block design

Is a randomized block design better than a completely randomized design?

  • In general, a randomized block design is better than a completely randomized design

    • making it easier to distinguish the effectiveness of the treatment

      • from any differences caused by the blocking variable

  • However, completely randomized designs should be used

    • if blocking variables are unknown

    • or if the sample size is very large

      • because larger samples tend to introduce more blocking variables

      • which means more blocking is required

      • which ends up reducing the sample size within each block

Matched pairs design

What is a matched pairs design?

  • A matched pairs design is a special type of randomized block design

    • The blocks usually have only two experimental units each (a pair)

    • which are matched either naturally or by the researcher based on some common factor

      • e.g. matching pairs of individuals who have similar heights

      • The blocking variable here is height

  • The experiment has two treatments

    • These are randomly assigned within each pair

      • One of the pair receives the first treatment, the other receives the second

  • A single unit can be used

    • Each experimental unit serves as its own pair

    • The order of treatments is randomized

Examiner Tips and Tricks

Exam questions may use the word pairing instead of blocking.

How do I randomly assign treatments to each pair?

  • One way is to use a random number generator as follows

    • Label one of the pair as 1 and the other as 2

    • Use a random number generator to generate a number between 1 and 2

    • Give the first treatment to the experimental unit whose number is selected

    • Given the second treatment to the experimental unit whose number was not selected

Is a matched pair design better than a completely randomized design?

  • A matched pair design is better than a completely randomized design

    • as it makes it easier to distinguish the effectiveness of the treatment

      • by removing any effects due to the blocking variable

Worked Example

A sports researcher wants to compare two new running shoe designs (Design A and Design B) to determine which one produces faster 100-meter sprint times. The researcher has recruited 40 volunteer runners for the study. Twenty of the volunteers are trained sprinters, and the other 20 are trained long-distance runners.

Which of the following describes the most appropriate experimental design for this study and provides the correct justification?

(A) A completely randomized design, because randomly assigning the 40 runners to the two shoe designs will completely eliminate the effects of any confounding variables.

(B) A randomized block design with the running discipline (sprinter vs. long-distance) as the blocking variable, because it separates the variation in sprint times caused by the runners' previous training from the variation caused by the shoe designs.

(C) A randomized block design with the shoe design (Design A vs. Design B) as the blocking variable, because the shoe design is the explanatory variable being evaluated in the study.

(D) A matched-pairs design in which each trained sprinter is paired with a trained long-distance runner, and within each pair, the shoe designs are randomly assigned.

Answer:

The purpose of this design is to separate the variation in the response caused by the blocking variable from the rest of the extraneous variation, allowing for more precise comparisons of the treatments

Because trained sprinters will naturally have much faster 100-meter sprint times than long-distance runners, the running discipline is a massive source of extraneous variation

  • (A) is incorrect because while a completely randomized design (where treatments are assigned completely at random) balances out extraneous variables on average, it does not eliminate them

    • In a sample of 40 runners, random assignment could still result in an unbalanced group (e.g., one shoe design gets 15 sprinters while the other gets 5), making it an inferior design to blocking in this specific scenario

  • (C) features a common conceptual error

    • You block on an extraneous source of variation, not on the explanatory variable (the factor) being tested

  • (D) misapplies the definition of a matched-pairs design

    • In a matched-pairs design, experimental units must be paired by matching them on similar extraneous sources of variation

    • Pairing a sprinter with a distance runner creates highly heterogeneous pairs, defeating the entire purpose of the design

    • A valid matched-pairs design here would instead have each individual runner run in both shoe designs in a randomized order

The correct answer is B

Summary of the different types of design

Feature

Completely randomized design (CRD)

Randomized block design (RBD)

Matched pairs design

Stratified random sampling

Type of Design

Experimental design

Experimental design

Experimental design

Sampling design

Primary Purpose

To explore an investigative question by assigning treatments completely at random, which reduces the potential for confounding variables.

To separate the variation in the response caused by an extraneous variable (the blocking variable), which allows for more precise comparisons across treatments.

To compare exactly two treatments by matching experimental units on extraneous sources of variation.

To divide a population into groups to ensure a sample is selected properly from all subgroups.

Role of "Groups"

There is no initial grouping of subjects; subjects are placed directly into treatment groups.

Experimental units are first grouped into homogeneous blocks according to similar values of an extraneous variable.

Experimental units are grouped into pairs of similar units, or a single experimental unit serves as its own pair.

The entire population is divided into non-overlapping, homogeneous groups called strata based on shared attributes.

Randomization Process

Treatments are assigned to experimental units completely at random.

Treatments are randomly assigned to experimental units within each individual block so that all treatments occur within every block.

Within each pair, one treatment is randomly assigned to one member and the other treatment to the second member, or the order of treatments is randomized for a single unit.

A simple random sample is selected within each stratum and then combined to form one final sample.

Examiner Tips and Tricks

Stratified sampling is a sampling method used to gather data from a population, whereas completely randomized design (CRD), randomized block design (RBD), and matched pairs are experimental designs used to assign treatments to experimental units. Students often confuse stratified sampling with randomized block designs because both involve dividing individuals into homogeneous groups based on shared characteristics. However, stratification is used to randomly select who is included in a study, while blocking is used to randomly assign treatments to subjects who are already in the study.

Unlock more, it's free!

Join the 100,000+ Students that ❤️ Save My Exams

the (exam) results speak for themselves:

Mark Curtis

Author: Mark Curtis

Expertise: Maths Content Creator

Mark graduated twice from the University of Oxford: once in 2009 with a First in Mathematics, then again in 2013 with a PhD (DPhil) in Mathematics. He has had nine successful years as a secondary school teacher, specialising in A-Level Further Maths and running extension classes for Oxbridge Maths applicants. Alongside his teaching, he has written five internal textbooks, introduced new spiralling school curriculums and trained other Maths teachers through outreach programmes.

Dan Finlay

Reviewer: Dan Finlay

Expertise: Maths Subject Lead

Dan graduated from the University of Oxford with a First class degree in mathematics. As well as teaching maths for over 8 years, Dan has marked a range of exams for Edexcel, tutored students and taught A Level Accounting. Dan has a keen interest in statistics and probability and their real-life applications.