Type I vs Type II error shows up everywhere on USMLE—drug trials, screening tests, p-values, power questions—and it’s one of those topics that feels simple until the answer choices get sneaky. Here’s a one-page, quick-hit cheat sheet you can screenshot and actually use.
The core idea (one-liners)
-
Type I error (): You think there’s an effect, but there isn’t one.
“False positive” → Reject a true null hypothesis. -
Type II error (): You miss a real effect that actually exists.
“False negative” → Fail to reject a false null hypothesis.
The classic 2×2 table (memorize this)
Think of hypothesis testing as a “test” for whether the null hypothesis () is true.
| Reality | Your decision | Name | What happened |
|---|---|---|---|
| is true (no real effect) | Reject | Type I error () | False alarm / false positive |
| is true | Fail to reject | Correct | You appropriately found “no effect” |
| is false (effect is real) | Reject | Correct | You detected the real effect |
| is false | Fail to reject | Type II error () | Missed detection / false negative |
Mnemonics & visual devices (pick one and stick with it)
Mnemonic 1: “Type I = Incorrectly convict”
- Type I error: convict an innocent person → you claim “guilty/effect” when none exists (false positive).
- Type II error: let a guilty person walk → you miss a real effect (false negative).
Mnemonic 2: “Alpha = Alarm”
- is the false alarm rate → Type I error.
Visual: the “null hypothesis courtroom”
- : “The defendant is innocent” (no effect).
- Reject = “Guilty!” (there is an effect).
- Type I = you said “Guilty” but they were innocent.
- Type II = you said “Not guilty” but they were guilty.
High-yield equations & relationships
Key definitions
- Significance level: = probability of Type I error
- Type II error: = probability of Type II error
- Power: = probability your study detects a true effect
What changes , , and power?
| Move you make | What happens to | What happens to | What happens to power () | USMLE takeaway |
|---|---|---|---|---|
| Decrease (e.g., 0.05 → 0.01) | ↓ | ↑ | ↓ | Fewer false positives, more false negatives |
| Increase sample size () | — (set by you) | ↓ | ↑ | Big studies reduce random error, improve detection |
| Increase effect size (bigger true difference) | — | ↓ | ↑ | Easier to detect real effects |
| Increase variability/noise | — | ↑ | ↓ | Messy data hides real effects |
Super testable: If a study is underpowered, it’s prone to Type II error (false negative).
How it appears in USMLE-style question stems
Type I error clues (false positive)
- “They claim the new drug works, but it actually doesn’t.”
- “They found a statistically significant difference by chance.”
- “Rejecting a true null hypothesis.”
- “p-value threshold set at 0.05” (that chosen cutoff is )
Translate: They detected an effect that isn’t real → Type I ().
Type II error clues (false negative)
- “Study concludes there is no difference, but there actually is one.”
- “Failed to detect a true association.”
- “Sample size too small” / “insufficient power”
- “Failing to reject a false null hypothesis.”
Translate: They missed a real effect → Type II ().
Quick p-value connection (don’t overcomplicate it)
- p-value: probability of observing your data (or more extreme) if is true.
- If p < , you reject .
High-yield nuance:
- is chosen ahead of time (commonly 0.05).
- p-value is calculated from the data.
- A “significant” p-value does not tell you effect size or clinical importance.
The screening-test analogy (helpful but keep it straight)
Students often map hypothesis testing onto screening tests:
- Type I error () ≈ false positive
- Type II error () ≈ false negative
Just remember: in biostats, this is about truth of , not disease status—though the logic is parallel.
One-page summary (screenshot-ready)
-
Type I error () = false positive = reject true
“I thought there was an effect—but it was chance.” -
Type II error () = false negative = fail to reject false
“I missed a real effect.” -
Power =
“Chance you detect a real effect.” -
Decrease → fewer false positives but more false negatives (power ↓)
-
Increase → ↓ and power ↑