Operant Conditioning & Thorndike's Research (College Board AP® Psychology): Revision Note
Operant conditioning
Operant conditioning (OC) is a type of learning in which behavior is shaped by its consequences
Behaviors followed by rewards are more likely to be repeated, whereas behaviors followed by punishment are less likely to occur again
OC differs from classical conditioning (CC) as:
in CC, learning involves associating two stimuli
in OC, learning involves associating a behavior with its consequence
The term 'operant' reflects the fact that the organism operates on its environment, rather than responding passively to stimuli
The behavioral perspective sees operant conditioning as a key process through which behavior is acquired, maintained, and reduced through experience
The Law of Effect
The basic principle of OC was first described by Edward Thorndike (1898) in the Law of Effect, which states that:
behaviors followed by satisfying consequences are more likely to be repeated
behaviors followed by unpleasant consequences are less likely to be repeated
This principle underpins all of operant conditioning by showing that consequences control behavior
Reinforcement and punishment
To understand how consequences shape behavior, psychologists distinguish between reinforcement and punishment
Reinforcement increases the likelihood that a behavior will be repeated
Punishment decreases the likelihood that a behavior will be repeated
Both can be positive (adding something) or negative (removing something)
Reinforcers also vary in type:
Primary reinforcers satisfy biological needs (e.g. food, water)
Secondary reinforcers gain value through association (e.g. money, grades, praise)
Generalized reinforcers can be exchanged for many rewards (e.g. money)
One practical application of generalized reinforcement is a token economy
This is where individuals earn tokens for desired behaviors and exchange them for rewards
These are used in settings such as schools, prisons, and clinical environments
Reinforcement discrimination and generalization
Once behaviors are learned, organisms also learn when to respond
Reinforcement discrimination occurs when an organism responds only to a specific stimulus associated with reinforcement (a discriminative stimulus)
E.g. a dog learns that it only gets food when it sits after hearing a specific command (“sit”), not when it hears other words
Reinforcement generalization occurs when a response is produced in similar situations, transferring across related stimuli
E.g. the dog begins to sit not only to “sit” but also to similar-sounding commands or gestures
Thorndike's research
Aim
To investigate how animals learn new behaviors through trial and error
Participants
Hungry cats placed individually in a wooden 'puzzle box'
Procedure:
Food was placed outside the box
To escape and reach the food, the cat had to operate a specific mechanism (e.g. pressing a lever or pulling a string)
Thorndike recorded how long it took each cat to escape across repeated trials
Findings:
At first, the cats showed random, trial-and-error behaviors
Over time, escape times decreased, and the cats made fewer mistakes
Eventually, they escaped quickly by performing the correct action almost immediately
Conclusion:
Thorndike concluded that behaviors followed by satisfying consequences (e.g. food and escape) were “stamped in,” while ineffective behaviors were “stamped out”
This showed that learning is driven by consequences, rather than by insight or understanding
Significance
Thorndike’s research demonstrated that behavior can be explained and measured scientifically in terms of its consequences
It introduced trial-and-error learning as a key mechanism of learning
These findings laid the foundation for operant conditioning, which was later developed further by Skinner
The Law of Effect remains a central principle in behaviorism and is widely used in applied behavior therapies
Examiner Tips and Tricks
Ensure that you understand these key points:
OC and CC don't work the same way
In CC, an organism learns an association between two stimuli and produces a response passively
In OC, the organism actively produces a behavior and learning is driven by what consequence follows that behavior
The Law of Effect does not requires conscious awareness
Thorndike's cats did not consciously reason about consequences
OC can operate without conscious awareness
Reinforcement and negative reinforcement do not mean the same thing
Reinforcement is any consequence that increases the likelihood of a behavior
Negative reinforcement increases the likelihood of a behavior by removing an unpleasant stimulus
Unlock more, it's free!
Was this revision note helpful?