Clinical GeneticsApril 18, 20264 min read

Step-by-step flowchart: Linkage disequilibrium

Quick-hit shareable content for Linkage disequilibrium. Include visual/mnemonic device + one-liner explanation. System: Genetics.

Linkage disequilibrium (LD) is one of those genetics concepts that sounds abstract until you frame it the way USMLE tests it: “Do these two variants travel together through generations more than you’d expect by chance?” If yes, you can use one as a proxy for the other—huge for mapping disease genes and interpreting GWAS-style questions.


The one-liner (what you should say in your head)

Linkage disequilibrium = nonrandom association of alleles at different loci in a population (a haplotype effect), often because they’re physically close and recombination hasn’t separated them yet.

💡

USMLE translation: “Marker allele tags the disease allele because they’re inherited together.”


Visual + mnemonic device: “LD = Linked Dice

Picture two dice glued together:

  • If they’re glued (high LD): you keep rolling the same pairings (alleles occur together more than expected).
  • If they’re unglued (low LD): they roll independently (alleles assort randomly).

Mnemonic: Linked Dice → Loci Don’t separate (often due to limited recombination).


Step-by-step flowchart (how to answer questions fast)

Step 1 — Are we talking about population patterns or family inheritance?

  • Population-level nonrandom association of alleles → think LD
  • Co-segregation in a pedigree → think linkage analysis (related, but not identical)

Key distinction

  • Linkage: physical proximity on a chromosome (a recombination concept)
  • Linkage disequilibrium: statistical association of alleles in a population

Step 2 — Do the variants occur together more than expected by chance?

Ask: Is a particular allele at locus A frequently paired with a specific allele at locus B?

  • If yesLD present
  • If no → alleles are approximately independent

Step 3 — Why would LD happen?

High-yield causes that show up in stems:

  • Physical proximity (low recombination fraction between loci)
  • Recent mutation (not enough generations for recombination to break the association)
  • Population bottleneck / founder effect
  • Admixture (mixing populations can create temporary LD patterns)
  • Selection (a favored allele can “drag along” nearby variants = hitchhiking)

USMLE phrasing clue: “Inherited together more often than expected” or “marker strongly associated with disease allele.”


Step 4 — What does LD allow you to do clinically/research-wise?

  • Use a tag SNP to track a nearby causal variant
  • Map disease genes using association studies (e.g., GWAS)
  • Interpret why certain alleles cluster in specific populations

Classic testing angle: “A SNP is strongly associated with a disease in a population” → LD with causal variant.


Step 5 — What breaks down LD?

LD tends to decay over time due to:

  • Recombination across generations
  • Gene conversion (less commonly emphasized)
  • Increasing time since the causal mutation arose

High yield: The farther apart two loci are, the more recombination, the lower LD.


Ultra-high-yield: Linkage vs LD (quick table)

FeatureLinkageLinkage Disequilibrium (LD)
LevelFamilies/pedigreesPopulations
Core ideaPhysical co-inheritance due to proximityNonrandom allele association (haplotypes)
Depends on recombination?Yes (recombination fraction θ\theta)Often reflects recombination history + population forces
Used forMapping genes in pedigrees (LOD score)Association studies, tag SNPs, GWAS
Can distant loci show it?Rare (recombination breaks it)Usually low unless special population history

Mini-flowchart summary (shareable)

Two loci in a population → do specific alleles occur together > expected?

  • No → not in LD
  • YesLD
    • Likely due to proximity, founder/bottleneck, recent mutation, or selection
    • Use marker as proxy to map disease genes

Common USMLE-style pitfalls

  • “LD = genes close together” is incomplete.
    • Close loci often create LD, but LD is fundamentally statistical association in a population.
  • Linkage can exist without LD in some scenarios (e.g., equilibrium over many generations with recombination breaking historical associations).
  • Association ≠ causation: a tag SNP can associate with disease without being functional.

Clinical Genetics tie-in: why Step questions care

When a vignette says:

  • “A specific SNP is strongly associated with a disease in people of Northern European ancestry…”
  • “The SNP is not in a coding region…”
  • “The SNP is used to track the disease allele…”

They want: That SNP is in LD with the causal variant (same haplotype block).