Linkage disequilibrium (LD) is one of those genetics concepts that sounds abstract until you frame it the way USMLE tests it: “Do these two variants travel together through generations more than you’d expect by chance?” If yes, you can use one as a proxy for the other—huge for mapping disease genes and interpreting GWAS-style questions.
The one-liner (what you should say in your head)
Linkage disequilibrium = nonrandom association of alleles at different loci in a population (a haplotype effect), often because they’re physically close and recombination hasn’t separated them yet.
USMLE translation: “Marker allele tags the disease allele because they’re inherited together.”
Visual + mnemonic device: “LD = Linked Dice”
Picture two dice glued together:
- If they’re glued (high LD): you keep rolling the same pairings (alleles occur together more than expected).
- If they’re unglued (low LD): they roll independently (alleles assort randomly).
Mnemonic: Linked Dice → Loci Don’t separate (often due to limited recombination).
Step-by-step flowchart (how to answer questions fast)
Step 1 — Are we talking about population patterns or family inheritance?
- Population-level nonrandom association of alleles → think LD
- Co-segregation in a pedigree → think linkage analysis (related, but not identical)
Key distinction
- Linkage: physical proximity on a chromosome (a recombination concept)
- Linkage disequilibrium: statistical association of alleles in a population
Step 2 — Do the variants occur together more than expected by chance?
Ask: Is a particular allele at locus A frequently paired with a specific allele at locus B?
- If yes → LD present
- If no → alleles are approximately independent
Step 3 — Why would LD happen?
High-yield causes that show up in stems:
- Physical proximity (low recombination fraction between loci)
- Recent mutation (not enough generations for recombination to break the association)
- Population bottleneck / founder effect
- Admixture (mixing populations can create temporary LD patterns)
- Selection (a favored allele can “drag along” nearby variants = hitchhiking)
USMLE phrasing clue: “Inherited together more often than expected” or “marker strongly associated with disease allele.”
Step 4 — What does LD allow you to do clinically/research-wise?
- Use a tag SNP to track a nearby causal variant
- Map disease genes using association studies (e.g., GWAS)
- Interpret why certain alleles cluster in specific populations
Classic testing angle: “A SNP is strongly associated with a disease in a population” → LD with causal variant.
Step 5 — What breaks down LD?
LD tends to decay over time due to:
- Recombination across generations
- Gene conversion (less commonly emphasized)
- Increasing time since the causal mutation arose
High yield: The farther apart two loci are, the more recombination, the lower LD.
Ultra-high-yield: Linkage vs LD (quick table)
| Feature | Linkage | Linkage Disequilibrium (LD) |
|---|---|---|
| Level | Families/pedigrees | Populations |
| Core idea | Physical co-inheritance due to proximity | Nonrandom allele association (haplotypes) |
| Depends on recombination? | Yes (recombination fraction ) | Often reflects recombination history + population forces |
| Used for | Mapping genes in pedigrees (LOD score) | Association studies, tag SNPs, GWAS |
| Can distant loci show it? | Rare (recombination breaks it) | Usually low unless special population history |
Mini-flowchart summary (shareable)
Two loci in a population → do specific alleles occur together > expected?
- No → not in LD
- Yes → LD
- Likely due to proximity, founder/bottleneck, recent mutation, or selection
- Use marker as proxy to map disease genes
Common USMLE-style pitfalls
- “LD = genes close together” is incomplete.
- Close loci often create LD, but LD is fundamentally statistical association in a population.
- Linkage can exist without LD in some scenarios (e.g., equilibrium over many generations with recombination breaking historical associations).
- Association ≠ causation: a tag SNP can associate with disease without being functional.
Clinical Genetics tie-in: why Step questions care
When a vignette says:
- “A specific SNP is strongly associated with a disease in people of Northern European ancestry…”
- “The SNP is not in a coding region…”
- “The SNP is used to track the disease allele…”
They want: That SNP is in LD with the causal variant (same haplotype block).