Operant Conditioning: Skinner's Research (College Board AP® Psychology): Revision Note

Raj Bonsor

Written by: Raj Bonsor

Reviewed by: Claire Neeson

Updated on

Skinner's research

  • B.F. Skinner (1938, 1953) built on Thorndike’s Law of Effect by developing operant conditioning and providing a systematic, experimental analysis of how consequences shape behavior

    • To study this, Skinner designed the Skinner box - a controlled environment containing a lever or key, a food dispenser, and sometimes an electrified floor

  • Consistent with the behaviorist approach, Skinner focused on observable, measurable behavior, allowing learning to be studied scientifically

Procedure and key findings

  • Skinner used the box to demonstrate how different consequences influence behavior

    • Positive reinforcement:

      • A rat received food after pressing a lever. Over time, lever-pressing increased and became the dominant behavior

    • Negative reinforcement:

      • Pressing the lever stopped an electric shock. Rats quickly learned to press the lever to escape or avoid the shock

    • Discrimination:

      • Rats reinforced only when a light was on learned to press the lever only in that condition, showing that behavior depends on specific cues

Shaping

  • Complex behaviors are not learned all at once

    • Instead, they are built gradually through shaping

  • Shaping involves reinforcing successive approximations

    • These are small steps that move closer to the desired behavior

      • E.g. to train a rat to press a lever:

        • reinforce moving toward the lever

        • then reinforce touching it

        • then reinforce pressing it

    • Each step must be closer to the target than the previous one

Reinforcement schedules

  • Once a behavior is learned, the schedule of reinforcement influences the rate, pattern, and persistence of responding

    • Continuous reinforcement: every response is rewarded

      • E.g. a parent praises a child every single time they tidy their toys

        • This leads to rapid learning, but rapid extinction

    • Partial (intermittent) reinforcement: only some responses are rewarded

      • E.g. a parent praises a child for tidying their toys only occasionally

        • This leads to slower learning, but much greater resistance to extinction

  • Different reinforcement schedules produce distinct patterns of behavior:

Schedule

Rule

Behaviour pattern

Example

Fixed Ratio (FR)

Reward after a set number of responses

High rate with pauses

A parent praises a child after every 3 times they tidy their toys

Variable Ratio (VR)

Reward after an unpredictable number of responses

Very high, steady rate; most resistant to extinction

A parent praises a child after a random number of times they tidy their toys

Fixed Interval (FI)

Reward after a fixed time period

“Scalloped” pattern (increases near deadline)

A parent checks at the end of each day and praises the child if their room is tidy

Variable Interval (VI)

Reward after varying time intervals

Slow, steady rate

A parent checks the child’s room at random times and praises tidy behavior

  • These differences in response patterns are illustrated in graphs of responding over time:

    • ratio schedules produce higher response rates

    • interval schedules produce slower, more patterned responding

Graph showing number of responses over time for fixed and variable ratios and intervals, with lines marking reinforcers and a key for line types.
A graph comparing response rates across four schedules of reinforcement.

Instinctive drift

  • Although operant conditioning explains much behavior, biology can limit what can be learned

    • Instinctive drift is the tendency for conditioned behaviors to revert to innate, instinctive patterns

  • Breland and Breland (1961) demonstrated this:

    • A raccoon trained to deposit coins began rubbing them together, resembling natural food-washing behavior

    • A pig trained to deposit coins began rooting them along the ground, an instinctive foraging behavior

  • In both cases, instinctive behaviors interfered with the trained response

  • This shows that learning is constrained by biology, similar to biological preparedness in classical conditioning

Superstitious behavior

  • Sometimes behaviors are strengthened by coincidence rather than real cause-and-effect

  • Skinner (1948) showed this by giving pigeons food at fixed intervals regardless of their behavior

  • The pigeons developed repetitive actions (e.g. turning or head-bobbing) as if these behaviors caused the food

    • This occurred because whatever behavior happened just before reinforcement was strengthened

  • Human examples include:

    • athletes repeating rituals during winning streaks, such as wearing the same socks

    • gamblers using “lucky” behaviors, such as blowing on dice

  • These persist because they are reinforced unpredictably, making them resistant to extinction

Learned helplessness

  • Not all learning leads to increased responding

    • In some cases, organisms learn that their behavior has no effect

  • Learned helplessness occurs when an organism experiences uncontrollable negative events and stops trying to change the outcome

  • Seligman et al. (1967) found that:

    • dogs exposed to inescapable shocks later failed to escape, even when escape was possible

    • they had learned that their actions did not matter

    • this effect can transfer to new situations

  • Seligman argued that learned helplessness contributes to human depression

    • Repeated uncontrollable experiences may lead to a belief that effort is pointless

  • Learned helplessness can also apply to education

    • Students who have repeatedly failed despite effort stop trying even when success becomes possible

Examiner Tips and Tricks

  • For Skill 3.A, be able to identify which reinforcement schedule a graph represents by its distinctive shape

    • variable ratio = steepest, most consistent slope with no pauses

    • fixed interval = scalloped curve

    • fixed ratio = steep slope with brief post-reinforcement pauses

    • variable interval = slow, steady line

  • For Skill 1.A, when answering a question on learned helplessness, ensure you know both of the following components of this concept:

    • prior experience of uncontrollable aversive events

    • subsequent failure to act even when control becomes possible

Punishment vs. reinforcement

The four consequence types

  • To understand consequences, apply two key questions:

    1. Is a stimulus added or removed?

    2. Does the behavior increase or decrease?

Type

Mechanism

Effect on Behavior

Example

Positive Reinforcement

Add a pleasant stimulus after behavior

Increases behavior

Praise for completing homework

Negative Reinforcement

Remove an unpleasant stimulus after behavior

Increases behavior

Taking pain relief to remove a headache

Positive Punishment

Add an unpleasant stimulus after behavior

Decreases behavior

Reprimand for running in the hall

Negative Punishment

Remove a pleasant stimulus after behavior

Decreases behavior

Phone taken away after breaking curfew

  • The key difference lies in their effect on behavior:

    • Reinforcement increases the likelihood of a behavior

    • Punishment decreases the likelihood of a behavior

  • Both can be positive (adding a stimulus) or negative (removing a stimulus)

Key differences

Feature

Reinforcement

Punishment

Goal

Increase the frequency of a behavior

Decrease the frequency of a behavior

Types

Positive (add pleasant) / Negative (remove unpleasant)

Positive (add unpleasant) / Negative (remove pleasant)

Side effects

Generally minimal

May produce fear, aggression, avoidance, or anxiety

Use in practice

Preferred approach

Should be used cautiously and alongside reinforcement

When is punishment effective?

  • Punishment is only effective under specific conditions:

    • Timing: Must be immediate to create a clear link with the unwanted behavior

      • Delayed punishment loses its association with the behavior and becomes less effective

    • Consistency: Must be applied every time, as inconsistency can maintain behavior

    • Paired with reinforcement: Must be combined with reinforcing the desired behavior

Limitations of punishment

  • Punishment has several important limitations:

    • Side effects: Harsh punishment can produce fear, aggression, anxiety, and avoidance

    • Suppression, not elimination: Behavior may return when the punisher is absent

    • Does not teach alternatives: It shows what not to do, but not what to do

    • May reinforce avoidance: Unwanted behaviors can be strengthened if they help escape something unpleasant

      • E.g. a child throws a tantrum to avoid eating vegetables. If the parent removes the food, the tantrum is negatively reinforced

Examiner Tips and Tricks

Ensure that you understand these key points:

  • Negative reinforcement is not a type of punishment

    • Reinforcement always increases a desired behavior; punishment always decreases it

    • Negative reinforcement increases desired behavior by removing an unpleasant stimulus

    • 'Negative' means subtraction, not something bad

  • Learned helplessness is not simply giving up

    • The person has learned from experience that their actions do not affect outcomes, so they stop trying, even when success is possible

Unlock more, it's free!

Join the 100,000+ Students that ❤️ Save My Exams

the (exam) results speak for themselves:

Raj Bonsor

Author: Raj Bonsor

Expertise: Psychology & Sociology Content Creator

Raj joined Save My Exams in 2024 as a Senior Content Creator for Psychology & Sociology. Prior to this, she spent fifteen years in the classroom, teaching hundreds of GCSE and A Level students. She has experience as Subject Leader for Psychology and Sociology, and her favourite topics to teach are research methods (especially inferential statistics!) and attachment. She has also successfully taught a number of Level 3 subjects, including criminology, health & social care, and citizenship.

Claire Neeson

Reviewer: Claire Neeson

Expertise: Psychology Content Creator

Claire has been teaching for 34 years, in the UK and overseas. She has taught GCSE, A-level and IB Psychology which has been a lot of fun and extremely exhausting! Claire is now a freelance Psychology teacher and content creator, producing textbooks, revision notes and (hopefully) exciting and interactive teaching materials for use in the classroom and for exam prep. Her passion (apart from Psychology of course) is roller skating and when she is not working (or watching 'Coronation Street') she can be found busting some impressive moves on her local roller rink.