V(D)J Recombination

V(D)J Recombination

V(D)J recombination is the somatic DNA rearrangement process by which developing lymphocytes assemble functional antigen receptor genes from variable (V), diversity (D), and joining (J) gene segments. This mechanism is the primary source of the extraordinary diversity in T cell receptors (TCRs) and B cell receptors (BCRs), enabling the adaptive immune system to recognize virtually any pathogen.

Overview

The adaptive immune system faces a fundamental challenge: it must be prepared to recognize an essentially unlimited universe of potential pathogens using genetic information that can only encode a finite number of proteins. V(D)J recombination elegantly solves this problem through a “combinatorial” strategy that generates millions to billions of unique receptors from a few hundred gene segments.

The Diversity Problem and Solution

The challenge: Humans need to recognize ~10^9 or more different antigens

The solution: V(D)J recombination achieves this through:

  1. Combinatorial diversity: Random selection from multiple V, D, and J gene segments
  2. Junctional diversity: Imprecise joining creates additional variation at junctions
  3. Chain pairing: Independent rearrangement of two chains multiplies diversity

The result: Potential diversity of >10^15 unique TCRs and >10^11 unique BCRs from ~400-600 germline gene segments.

The Gene Segments

TCR Gene Loci

LocusLocationV SegmentsD SegmentsJ SegmentsC Segments
TRA (α chain)Chr 14q11~700611
TRB (β chain)Chr 7q34~522132
TRG (γ chain)Chr 7p14~6052
TRD (δ chain)Chr 14q11*~8341

*TRD is embedded within the TRA locus

BCR (Immunoglobulin) Gene Loci

LocusLocationV SegmentsD SegmentsJ SegmentsC Segments
IGH (heavy chain)Chr 14q32~452769
IGK (κ light chain)Chr 2p12~40051
IGL (λ light chain)Chr 22q11~3004-54-5

The Recombination Machinery

Recombination-Activating Genes (RAG)

The RAG1 and RAG2 proteins form the core recombinase complex that initiates V(D)J recombination:

RAG1:

  • Contains the catalytic core (DDE motif)
  • Binds directly to recombination signal sequences (RSS)
  • Introduces DNA nicks and catalyzes hairpin formation
  • ~1,040 amino acids; core region essential

RAG2:

  • Enhances RAG1 binding specificity and activity
  • Contains a PHD domain that recognizes H3K4me3 (active chromatin marker)
  • Cell cycle-regulated: degraded outside G1 phase
  • ~527 amino acids

Expression Pattern:

  • Strictly limited to developing lymphocytes
  • Expressed in bone marrow (B cells) and thymus (T cells)
  • Turned off after successful rearrangement
  • Aberrant expression linked to lymphoid malignancies

Recombination Signal Sequences (RSS)

Each V, D, and J gene segment is flanked by recombination signal sequences (RSS) that direct the RAG complex:

RSS Structure:

[Coding Segment]—CACAGTG—spacer (12 or 23 bp)—ACAAAAACC
                 ↑                              ↑
              Heptamer                        Nonamer

Two RSS Types:

  • 12-RSS: 12 base pair spacer (approximately one turn of DNA helix)
  • 23-RSS: 23 base pair spacer (approximately two turns of DNA helix)

The 12/23 Rule: Recombination occurs efficiently only between a 12-RSS and a 23-RSS. This ensures proper segment ordering—V cannot join directly to V, only to D (which can then join to J).

RSS Distribution Example (TCRβ):

Segment5’ RSS3’ RSS
23-RSS
12-RSS12-RSS
23-RSS

This arrangement ensures: Vβ(23)—(12)Dβ(12)—(23)Jβ

Additional Recombination Factors

FactorFunctionDeficiency Phenotype
Ku70/Ku80Binds broken DNA ends; recruits DNA-PKcsSCID
DNA-PKcsActivates Artemis; facilitates end joiningSCID, radiosensitivity
ArtemisOpens hairpin-sealed coding endsRadiosensitive SCID
TdTAdds N-nucleotides at junctionsReduced diversity, viable
DNA Ligase IVJoins processed DNA endsLIG4 syndrome
XRCC4Partners with Ligase IVSCID, microcephaly
XLF (Cernunnos)Enhances NHEJImmunodeficiency

The Recombination Process

Step 1: Synapsis

The RAG complex brings two gene segments together:

  1. RAG1/RAG2 binds to one RSS (12-RSS preferred initially)
  2. Complex captures second RSS (23-RSS) forming the synaptic complex
  3. Both RSS must be present for cleavage to occur
  4. The 12/23 rule is enforced at this step

Step 2: Cleavage

The RAG complex introduces DNA breaks through a two-step mechanism:

First reaction—Nicking:

  • Single-strand cut exactly at the heptamer/coding segment border
  • Creates a free 3’-hydroxyl group (3’-OH)

Second reaction—Hairpin formation:

  • The 3’-OH attacks the phosphate on the opposite strand (transesterification)
  • Creates hairpin-sealed coding ends
  • Creates blunt signal ends
Before cleavage:
5'——CODING——CACAGTG——spacer——ACAAAAACC——3'
3'——CODING——GTGTCAC——spacer——TGTTTTTGG——5'

After cleavage:
Coding end: Hairpin structure       Signal end: Blunt

Step 3: Hairpin Opening

The enzyme Artemis (activated by DNA-PKcs) opens the hairpin-sealed coding ends:

Symmetric cleavage (at the tip):

  • Creates blunt end
  • No P-nucleotide addition

Asymmetric cleavage (off-center):

  • Creates single-strand overhang
  • Overhang is filled in by DNA polymerase
  • Results in P-nucleotide (palindromic) additions

Step 4: End Processing (Junctional Diversity)

Before joining, the opened coding ends are processed, creating junctional diversity:

P-nucleotide (Palindromic) Addition:

  • Result of asymmetric hairpin opening
  • Short palindromic sequences (0-4 nucleotides)
  • Template-dependent

N-nucleotide (Non-templated) Addition:

  • Added by Terminal deoxynucleotidyl transferase (TdT)
  • Truly random sequence
  • Can add 0-15+ nucleotides
  • TdT adds nucleotides without a template
  • Most important source of junctional diversity
  • TdT expression is developmentally regulated

Exonuclease Trimming:

  • Nucleotides may be removed from segment ends
  • Contributes to diversity but can cause frameshifts
  • Balances N-additions

Step 5: Joining

The Non-Homologous End Joining (NHEJ) pathway ligates the processed ends:

Coding Joint Formation:

  • Ku70/Ku80 binds and protects DNA ends
  • Polymerases fill gaps
  • XRCC4/Ligase IV complex ligates
  • Creates the functional V(D)J junction

Signal Joint Formation:

  • The two signal ends are also joined
  • Creates a signal joint (precise head-to-head joining)
  • Usually on excised circular DNA (excision circles)

Signal Joints and Excision Circles

When segments are oriented head-to-head (most common), the intervening DNA is excised as a circular episome:

  • TRECs (T cell receptor excision circles): Byproduct of TCR rearrangement
  • KRECs (κ-deleting element recombination circles): Byproduct of B cell development

These circles:

  • Do not replicate during cell division
  • Dilute out as cells proliferate
  • Serve as markers of recent thymic/bone marrow emigrants
  • Used clinically for SCID screening in newborns

Sources of Receptor Diversity

1. Combinatorial Diversity

The random selection of gene segments provides a baseline of diversity:

TCRαβ Example:

ChainCombinationsCalculation
TCRα~4,27070 V × 61 J
TCRβ~1,35252 V × 2 D × 13 J
Combined~5.8 × 10^64,270 × 1,352

BCR Heavy + Light Example:

ChainCombinationsCalculation
Heavy~8,26245 V × 27 D × 6 J
Light (κ+λ)~639(40 × 5) + (30 × 5)
Combined~5.3 × 10^68,262 × 639

2. Junctional Diversity

The imprecise joining process adds enormous additional diversity at each junction:

MechanismDescriptionContribution
P-nucleotidesFrom asymmetric hairpin opening0-4 nucleotides
N-nucleotidesRandom addition by TdT0-15+ nucleotides
Exonuclease trimmingRemoval of germline nucleotidesVariable loss

Each junction can have dozens of possible outcomes, multiplying diversity by factors of 10^6 or more.

3. Chain Pairing

Two independently rearranged chains pair to form the complete receptor:

  • TCR: α chain + β chain (or γ + δ)
  • BCR: Heavy chain + Light chain (κ or λ)

Since chains rearrange independently, pairing multiplies diversity.

Total Diversity Calculation

TCR Diversity:

Diversity = (Vα × Jα × junctional_α) × (Vβ × Dβ × Jβ × junctional_β)
          = (70 × 61 × ~10^6) × (52 × 2 × 13 × ~10^6)
          ≈ 10^15 - 10^18 unique TCRs possible

BCR Diversity (before somatic hypermutation):

Diversity = (VH × DH × JH × junctional) × (VL × JL × junctional)
          ≈ 10^11 - 10^13 unique BCRs possible

Note: Actual repertoire size in any individual is limited by the number of lymphocytes (~10^11-10^12 T cells).

The CDR3 Region

The CDR3 (Complementarity-Determining Region 3) is the product of V(D)J junctional diversity and serves as a molecular “barcode” for each lymphocyte clone:

CDR3β Structure:

[End of Vβ]—[N-nts]—[Dβ]—[N-nts]—[Start of Jβ]
           ↑________________CDR3________________↑

CDR3α Structure:

[End of Vα]—[N-nucleotides]—[Start of Jα]
           ↑_______CDR3________↑

Significance:

  • Most hypervariable region of the receptor
  • Primary determinant of antigen specificity
  • Unique sequence identifies each clone
  • Target of repertoire sequencing
  • Diagnostic marker for tracking specific clones

Developmental Timing and Regulation

TCR Rearrangement Order in αβ T Cells

StageTCR LocusEventsCheckpoint
DN2-DN3TRBDβ→Jβ, then Vβ→DJββ-selection
DN3-DN4Proliferation after successful β chain
DPTRAVα→Jα (multiple attempts allowed)Positive/Negative selection

BCR Rearrangement Order

StageIg LocusEventsCheckpoint
Pro-BIGHDH→JH (both alleles), then VH→DJHPre-BCR
Pre-BProliferation after successful heavy chain
Pre-B (small)IGK/IGLVL→JL (κ first, then λ)Central tolerance

Allelic Exclusion

Each lymphocyte expresses only one productive receptor:

Mechanism:

  1. Successful rearrangement on one allele signals to stop
  2. RAG expression downregulated
  3. Prevents further rearrangement
  4. Ensures monospecificity (one receptor per cell)

Strictness:

  • TCRβ and IGH: Strict allelic exclusion
  • TCRα: Less strict; secondary rearrangements possible during thymic selection
  • IG light chains: Allows receptor editing

Chromatin Accessibility

V(D)J recombination requires accessible chromatin:

  • RAG2 PHD domain recognizes H3K4me3 (active chromatin mark)
  • Transcription through V regions indicates accessibility
  • Developmental stage determines which loci are accessible
  • Sequential opening explains ordered rearrangement

Clinical Implications

Diagnostic Applications

TRECs as Biomarkers:

  • Measure recent thymic output
  • Newborn screening for SCID (severe combined immunodeficiency)
  • Monitor immune reconstitution after transplant
  • Assess thymic function in immunosenescence

Clonality Assessment:

  • Monoclonal V(D)J rearrangement → suggests malignancy
  • Polyclonal pattern → reactive/benign process
  • Used in lymphoma/leukemia diagnosis
  • PCR-based or sequencing-based methods

Diseases Affecting V(D)J Recombination

DiseaseGene AffectedImmune Phenotype
SCID (T-B-NK+)RAG1 or RAG2 (null)Complete absence of T and B cells
Omenn syndromeRAG1 or RAG2 (hypomorphic)Limited repertoire, oligoclonal T cells, autoimmunity
Artemis deficiencyDCLRE1CRadiosensitive SCID
DNA-PKcs deficiencyPRKDCSCID with radiosensitivity
LIG4 syndromeLIG4SCID, microcephaly, developmental delay
XLF/Cernunnos deficiencyNHEJ1Immunodeficiency, growth retardation

Aberrant Recombination and Cancer

RAG occasionally targets sequences resembling RSS in non-antigen receptor loci:

  • Chromosomal translocations in lymphoid malignancies
  • BCL2-IGH t(14;18): Follicular lymphoma
  • MYC-IGH t(8;14): Burkitt lymphoma
  • TCR-TAL1/SCL: T-ALL

These “off-target” events highlight both the power and danger of the V(D)J recombination machinery.

Key Concepts

  1. V(D)J recombination is the somatic DNA rearrangement that creates antigen receptor diversity from germline gene segments

  2. RAG1/RAG2 form the recombinase that recognizes RSS and initiates cleavage; expression is restricted to developing lymphocytes

  3. The 12/23 rule ensures proper segment joining—only 12-RSS can join to 23-RSS

  4. Junctional diversity from P-nucleotides, N-nucleotides (TdT), and exonuclease activity is the major source of receptor variability

  5. CDR3 is the hypervariable product of junctional diversity and serves as a unique clonal identifier

  6. TRECs/KRECs are circular DNA byproducts used as biomarkers for recent lymphocyte production

  7. Allelic exclusion ensures one receptor per cell; this is enforced by checkpoints during development

References

  1. Schatz DG, Swanson PC. (2011). V(D)J recombination: mechanisms of initiation. Annual Review of Genetics, 45:167-202.

  2. Roth DB. (2014). V(D)J Recombination: Mechanism, Errors, and Fidelity. Microbiology Spectrum, 2(6).

  3. Bassing CH, et al. (2002). The mechanism and regulation of chromosomal V(D)J recombination. Cell, 109:S45-S55.

  4. Lieber MR. (2010). The mechanism of double-strand DNA break repair by the nonhomologous DNA end-joining pathway. Annual Review of Biochemistry, 79:181-211.