Validity (College Board AP® Psychology): Study Guide

Raj Bonsor

Written by: Raj Bonsor

Reviewed by: Claire Neeson

Updated on

Types of validity

  • Validity refers to the extent to which a study measures what it claims to measure and whether its findings accurately reflect real thoughts, feelings, and behavior

  • A study is valid if:

    • the results are due to the manipulation of the IV rather than the influence of confounding variables

    • the findings can be generalized beyond the research setting to the wider population

    • the measurement instruments used actually measure what they are intended to measure

  • Validity is essential to the scientific process - findings that lack validity:

    • cannot be meaningfully interpreted

    • will not survive peer review

    • cannot contribute to the evolution of psychological knowledge

Types of Validity

Internal validity

  • Internal validity measures the extent to which changes in the DV are due to the manipulation of the IV rather than the influence of confounding variables

  • A study has high internal validity when

    • confounding variables have been identified and controlled

    • random assignment has been used to distribute participant variables evenly across conditions

    • single-blind or double-blind procedures have been used to control for demand characteristics and experimenter bias

    • the procedure is fully standardized across all conditions

  • A study has low internal validity when

    • confounding variables have not been controlled — alternative explanations for the results exist

    • experimenter bias or demand characteristics have influenced participant behavior

    • The IV and DV are not clearly operationally defined

External validity

  • External validity measures the extent to which the results of a study can be generalized beyond the research setting

  • There are two key aspects of external validity:

    • Ecological validity

    • Population validity

Ecological validity

  • Ecological validity refers to the extent to which the findings reflect real-world behavior

    • Does the task participants are given feel natural and reflect genuine everyday experience?

  • If it has high ecological validity, the task closely mirrors a real-life situation

    • This means the findings are more likely to reflect how people actually think and behave outside of the research setting

  • If the task has low ecological validity, the task is artificial or contrived

    • This means the participants may behave differently than they would in real life, limiting the generalizability of the findings

  • For example:

    • A lab experiment asking participants to memorize a list of random words has low ecological validity — this does not reflect how memory is used in everyday life

    • A field experiment observing helping behavior on a real subway train has higher ecological validity — the situation closely mirrors a genuine real-world scenario

Population validity

  • Population validity refers to the extent to which findings can be generalized from the sample to the wider target population

    • A study has high population validity when the sample is representative of the target population

    • A study has low population validity when the sample is biased

      • E.g. a study conducted exclusively on college students cannot be confidently generalized to the broader population

Measuring validity

  • There are different ways of measuring validity depending on the type of research and the measurement instrument used

Face validity

  • Face validity measures whether a test appears to measure what it claims to measure on the surface

    • E.g. does this anxiety scale look like it is measuring anxiety, based on the content of its questions?

  • Face validity is the most basic form of validity assessment

    • It does not guarantee that the measure is actually valid, only that it appears to be

Concurrent validity

  • Concurrent validity measures how closely a new test agrees with an already established test of the same construct

  • Participants complete both the new test and the established test — if the scores show a strong positive correlation, this is evidence of concurrent validity

    • E.g.scores on a new IQ test are compared with scores on an established, validated IQ test — a correlation of +0.8 or above would indicate good concurrent validity

Improving validity

  • If validity is found to be low, the researcher must take steps to improve it before the study is conducted or repeated

  • The appropriate strategy depends on the research method being used

Lab and field experiments

  • Use clearly operationally defined IV and DV

    • Vague definitions allow for alternative interpretations of the results and reduce internal validity

  • Use a control group as a baseline for comparison

    • Without a control group it is impossible to determine whether changes in the DV were caused by the IV

  • Use random assignment to distribute participant variables evenly across conditions

    • This reduces the risk that pre-existing differences between participants act as confounding variables

  • Use single-blind procedures to control for demand characteristics

    • Participants who do not know which condition they are in cannot alter their behavior based on perceived expectations

  • Use double-blind procedures to control for both demand characteristics and experimenter bias

    • Neither the participant nor the researcher knows which condition the participant is in, preventing either from influencing the results

Observational studies

  • Use covert methods where ethically justifiable

    • Participants who are unaware they are being observed are more likely to display natural, unforced behavior, increasing ecological validity

  • Ensure behavioral categories are clearly operationally defined, observable, and mutually exclusive

    • Ambiguous categories introduce subjectivity and reduce the validity of the findings

Surveys and interviews

  • Ensure questions are clearly worded, neutral, and free from leading questions

    • Poorly worded questions introduce self-report bias and social desirability bias, reducing the validity of responses

  • Use closed questions or Likert scale items where appropriate to reduce ambiguity in responses

Validity & the evolution of scientific conclusions

  • Validity is central to how psychological conclusions evolve through peer review and replication:

    • During peer review, experts evaluate whether a study's methodology is sufficiently valid to support the conclusions drawn

      • Studies with low internal or external validity are likely to be challenged or rejected

    • When a study is replicated and produces consistent findings across different samples and settings, this strengthens both its internal and external validity

    • Findings that lack ecological validity or population validity are less likely to be accepted as meaningful contributions to psychological knowledge

      • This is because they cannot be generalized beyond the specific context in which they were obtained

    • Over time, as studies are replicated, peer reviewed, and refined, invalid measures are identified and replaced with more valid alternatives

      • This is how psychological measurement improves

Examiner Tips and Tricks

Ensure that you understand these key points:

  • Validity and reliability are not the same — a measure can be reliable without being valid

    • E.g. a stopwatch that consistently runs 10 seconds slow is reliable but not valid; for a measure to be valid it must also be reliable, but reliability alone does not guarantee validity

  • Ecological validity is not simply about conducting research in a natural setting — a study conducted in a natural setting can still lack ecological validity if the task itself is artificial or contrived

    • The key question is whether the task reflects genuine real-world behavior, not where it takes place

  • Internal validity and external validity are not the same

    • Internal validity is about whether the IV caused the change in the DV

    • External validity is about whether the findings can be generalized beyond the study

Unlock more, it's free!

Join the 100,000+ Students that ❤️ Save My Exams

the (exam) results speak for themselves:

Raj Bonsor

Author: Raj Bonsor

Expertise: Psychology & Sociology Content Creator

Raj joined Save My Exams in 2024 as a Senior Content Creator for Psychology & Sociology. Prior to this, she spent fifteen years in the classroom, teaching hundreds of GCSE and A Level students. She has experience as Subject Leader for Psychology and Sociology, and her favourite topics to teach are research methods (especially inferential statistics!) and attachment. She has also successfully taught a number of Level 3 subjects, including criminology, health & social care, and citizenship.

Claire Neeson

Reviewer: Claire Neeson

Expertise: Psychology Content Creator

Claire has been teaching for 34 years, in the UK and overseas. She has taught GCSE, A-level and IB Psychology which has been a lot of fun and extremely exhausting! Claire is now a freelance Psychology teacher and content creator, producing textbooks, revision notes and (hopefully) exciting and interactive teaching materials for use in the classroom and for exam prep. Her passion (apart from Psychology of course) is roller skating and when she is not working (or watching 'Coronation Street') she can be found busting some impressive moves on her local roller rink.